The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "HTML::LinkExtor"

HTML::LinkExtor - Extract links from an HTML document River stage four • 548 direct dependents • 3510 total dependents

*HTML::LinkExtor* is an HTML parser that extracts links from an HTML document. The *HTML::LinkExtor* is a subclass of *HTML::Parser*. This means that the document should be given to the parser by calling the $p->parse() or $p->parse_file() methods....

OALDERS/HTML-Parser-3.82 - 13 Mar 2024 20:11:51 UTC - Search in distribution

HTML::SimpleLinkExtor - Extract links from HTML River stage one • 3 direct dependents • 5 total dependents

This is a simple HTML link extractor designed for the person who does not want to deal with the intricacies of "HTML::Parser" or the de-referencing needed to get links out of "HTML::LinkExtor". You can extract all the links or some of the links (base...

NHORNE/HTML-SimpleLinkExtor-1.273 - 13 Nov 2020 23:31:59 UTC - Search in distribution

HTML::RelExtor - Extract "rel" and "rev" information from LINK and A tags. River stage zero No dependents

HTML::RelExtor is a HTML parser module to extract relationship information from "A" and LINK HTML tags....

MIYAGAWA/HTML-RelExtor-0.03 - 12 Apr 2009 03:20:12 UTC - Search in distribution

lwptut - An LWP Tutorial River stage four • 2163 direct dependents • 6200 total dependents

LWP (short for "Library for WWW in Perl") is a very popular group of Perl modules for accessing data on the Web. Like most Perl module-distributions, each of LWP's component modules comes with documentation that is a complete reference to its interfa...

OALDERS/libwww-perl-6.77 - 11 Mar 2024 00:58:25 UTC - Search in distribution

WWW::Comix - programmatically access comics on the web River stage zero No dependents

This modules eases the programmatical access to comic publishing sites. It deals with the differences in any of them, providing you with an abstraction layer that hides all the weird bits. New sites can be added easily by means of its plugin system. ...

POLETTIX/WWW-Comix-0.1.1 - 04 Sep 2008 01:05:40 UTC - Search in distribution

perlfaq9 - Web, Email and Networking River stage zero No dependents

This section deals with questions related to running web sites, sending and receiving email as well as general networking. Should I use a web framework? Yes. If you are building a web site with any level of interactivity (forms / users / databases), ...

ETHER/perlfaq-5.20240218 - 18 Feb 2024 17:37:36 UTC - Search in distribution

WWW::Robot - configurable web traversal engine (for web robots & agents) River stage one • 2 direct dependents • 2 total dependents

This module implements a configurable web traversal engine, for a *robot* or other web agent. Given an initial web page (*URL*), the Robot will get the contents of that page, and extract all links on the page, adding them to a list of URLs to visit. ...

KVENTIN/WWW-Robot-0.026 - 07 Aug 2009 13:21:26 UTC - Search in distribution

HTML::AsHTML - Return The same HTML document as was put in. River stage zero No dependents

The *HTML::AsHTML* an HTML parser that tries to return exactly what was parsed. In the process, it will do certain fixes to the HTML, (such as adding quotes to all values in start tags). As such, when it works on correct html, it's just a glorified w...

MIKEDLR/Link_Controller-0.037 - 09 Feb 2002 18:12:34 UTC - Search in distribution

IT::perlfaq9 River stage zero No dependents

ENRYS/POD2-IT-0.13 - 26 Aug 2009 16:35:44 UTC - Search in distribution

WWW::Find - Web Resource Finder River stage zero No dependents

WWW::Find simplifies the task of searching the web for specific types of information. The inspiration for this project came from the recursive website mirroring program, w3mir. WWW::Find is similar to w3mir, but with a more general feature set. In a ...

BROOM/WWW-Find-0.07 - 11 Mar 2004 18:01:04 UTC - Search in distribution

Bundle::OpenXPKI - A bundle to install modules required for OpenXPKI River stage zero No dependents

This bundle helps with the installation of modules required for OpenXPKI. OpenXPKI is an open source trust center software which aims to create an enterprise-scale PKI solution. For more information see http://www.openxpki.org...

ALECH/Bundle-OpenXPKI-0.06 - 21 Nov 2008 16:12:39 UTC - Search in distribution

HTML::Tagset - data tables useful in parsing HTML River stage four • 29 direct dependents • 3511 total dependents

This module contains several data tables useful in various kinds of HTML parsing operations. Note that all tag names used are lowercase. In the following documentation, a "hashset" is a hash being used as a set -- the hash conveys that its keys are t...

PETDANCE/HTML-Tagset-3.24 - 11 Mar 2024 17:32:52 UTC - Search in distribution

WWW::LinkRot - check web page link rot River stage zero No dependents

Scan HTML files for links, try to access the links, and make a report. The HTML files need to be in UTF-8 encoding. This module is intended for people who run web sites to run, for example, periodic checks over a large number of HTML files to find al...

BKB/WWW-LinkRot-0.02 - 22 Apr 2021 23:31:06 UTC - Search in distribution

Nagios::WebTransact - Class for generating Nagios service checks of Web transactions. River stage zero No dependents

This module implements a check of a Web Transaction. A Web transaction is a sequence of web pages, often fill out forms, that accomplishes an enquiry or an update. Common examples are database searches and registration activities. A Web transaction i...

HOPCROFTS/Nagios-WebTransact-0.16 - 17 Mar 2004 04:12:09 UTC - Search in distribution

WebService::GoogleHack - Perl package that ties together all GoogleHack modules (Webservice::GoogleHack::Search, Webservice::GoogleHack::Spelling, Webservice::GoogleHack::Rate, and Webservice::GoogleHack::Text) to implement Natural Language Processing techniques that use the World Wide Web as a source of information. Use this package to access all the functionality of GoogleHack. River stage zero No dependents

WebService::GoogleHack is a PERL package that interacts with the Google API, and implements basic functions that allow the user to interact with Google and retrieve results in an easy to use format. GoogleHack also implements and extends a number of ...

PRATH/WebService-GoogleHack-0.15 - 21 Apr 2005 14:38:30 UTC - Search in distribution

Apache::MiniWiki - Miniature Wiki for Apache River stage zero No dependents

Apache::MiniWiki is a simplistic Wiki for Apache. It doesn't have much uses besides very simple installations where hardly any features are needed. What is does support though is: - storage of Wiki pages in RCS - templates through HTML::Template - te...

KERW/Apache-MiniWiki-0.92 - 08 Nov 2003 20:36:42 UTC - Search in distribution

ASNMTAP::Asnmtap::Plugins::WebTransact River stage zero No dependents

This module implements a check of a Web Transaction. A Web transaction is a sequence of web pages, often fill out forms, that accomplishes an enquiry or an update. Common examples are database searches and registration activities. A Web transaction i...

ASNMTAP/ASNMTAP-3.002003 - 10 Feb 2011 08:31:59 UTC - Search in distribution

WWW::Crawler::Lite - A single-threaded crawler/spider for the web. River stage zero No dependents

"WWW::Crawler::Lite" is a single-threaded spider/crawler for the web. It can be used within a mod_perl, CGI or Catalyst-style environment because it does not fork or use threads. The callback-based interface is fast and simple, allowing you to focus ...

JOHND/WWW-Crawler-Lite-0.005 - 08 Jan 2012 06:48:53 UTC - Search in distribution

URI::Find::Simple - a simple interface to URI::Find River stage two • 7 direct dependents • 48 total dependents

URI::Find is all very well, but sometimes you just want a list of the links in a given piece of text, or you want to change all the urls in some text somehow, and don't want to mess with callback interfaces. This module uses URI::Find, but hides the ...

NEILB/URI-Find-Simple-1.07 - 14 Mar 2021 23:43:48 UTC - Search in distribution

AnyEvent::Net::Curl::Queued - Moo wrapper for queued downloads via Net::Curl & AnyEvent River stage zero No dependents

AnyEvent::Net::Curl::Queued (a.k.a. YADA, *Yet Another Download Accelerator*) is an efficient and flexible batch downloader with a straight-forward interface capable of: * create a queue; * append/prepend URLs; * wait for downloads to end (retry on e...

SYP/AnyEvent-Net-Curl-Queued-0.049 - 07 Apr 2021 07:02:55 UTC - Search in distribution
34 results (0.044 seconds)