Search results for "HTML::LinkExtor"
HTML::LinkExtor - Extract links from an HTML document
*HTML::LinkExtor* is an HTML parser that extracts links from an HTML document. The *HTML::LinkExtor* is a subclass of *HTML::Parser*. This means that the document should be given to the parser by calling the $p->parse() or $p->parse_file() methods....
OALDERS/HTML-Parser-3.82 - 13 Mar 2024 20:11:51 UTC - Search in distribution- HTML::Parser - HTML parser class
HTML::SimpleLinkExtor - Extract links from HTML
This is a simple HTML link extractor designed for the person who does not want to deal with the intricacies of "HTML::Parser" or the de-referencing needed to get links out of "HTML::LinkExtor". You can extract all the links or some of the links (base...
NHORNE/HTML-SimpleLinkExtor-1.273 - 13 Nov 2020 23:31:59 UTC - Search in distribution- linktractor - extract links from HTML
HTML::RelExtor - Extract "rel" and "rev" information from LINK and A tags.
HTML::RelExtor is a HTML parser module to extract relationship information from "A" and LINK HTML tags....
MIYAGAWA/HTML-RelExtor-0.03 - 12 Apr 2009 03:20:12 UTC - Search in distribution
lwptut - An LWP Tutorial
LWP (short for "Library for WWW in Perl") is a very popular group of Perl modules for accessing data on the Web. Like most Perl module-distributions, each of LWP's component modules comes with documentation that is a complete reference to its interfa...
OALDERS/libwww-perl-6.77 - 11 Mar 2024 00:58:25 UTC - Search in distribution
WWW::Comix - programmatically access comics on the web
This modules eases the programmatical access to comic publishing sites. It deals with the differences in any of them, providing you with an abstraction layer that hides all the weird bits. New sites can be added easily by means of its plugin system. ...
POLETTIX/WWW-Comix-0.1.1 - 04 Sep 2008 01:05:40 UTC - Search in distribution
perlfaq9 - Web, Email and Networking
This section deals with questions related to running web sites, sending and receiving email as well as general networking. Should I use a web framework? Yes. If you are building a web site with any level of interactivity (forms / users / databases), ...
ETHER/perlfaq-5.20240218 - 18 Feb 2024 17:37:36 UTC - Search in distribution
WWW::Robot - configurable web traversal engine (for web robots & agents)
This module implements a configurable web traversal engine, for a *robot* or other web agent. Given an initial web page (*URL*), the Robot will get the contents of that page, and extract all links on the page, adding them to a list of URLs to visit. ...
KVENTIN/WWW-Robot-0.026 - 07 Aug 2009 13:21:26 UTC - Search in distribution
HTML::AsHTML - Return The same HTML document as was put in.
The *HTML::AsHTML* an HTML parser that tries to return exactly what was parsed. In the process, it will do certain fixes to the HTML, (such as adding quotes to all values in start tags). As such, when it works on correct html, it's just a glorified w...
MIKEDLR/Link_Controller-0.037 - 09 Feb 2002 18:12:34 UTC - Search in distribution- HTML::Link_Miner - return links in a document along with information
- Bundle::Link_Controller - List of perl modules needed for LinkController.
WWW::Find - Web Resource Finder
WWW::Find simplifies the task of searching the web for specific types of information. The inspiration for this project came from the recursive website mirroring program, w3mir. WWW::Find is similar to w3mir, but with a more general feature set. In a ...
BROOM/WWW-Find-0.07 - 11 Mar 2004 18:01:04 UTC - Search in distribution
Bundle::OpenXPKI - A bundle to install modules required for OpenXPKI
This bundle helps with the installation of modules required for OpenXPKI. OpenXPKI is an open source trust center software which aims to create an enterprise-scale PKI solution. For more information see http://www.openxpki.org...
ALECH/Bundle-OpenXPKI-0.06 - 21 Nov 2008 16:12:39 UTC - Search in distribution
HTML::Tagset - data tables useful in parsing HTML
This module contains several data tables useful in various kinds of HTML parsing operations. Note that all tag names used are lowercase. In the following documentation, a "hashset" is a hash being used as a set -- the hash conveys that its keys are t...
PETDANCE/HTML-Tagset-3.24 - 11 Mar 2024 17:32:52 UTC - Search in distribution
WWW::LinkRot - check web page link rot
Scan HTML files for links, try to access the links, and make a report. The HTML files need to be in UTF-8 encoding. This module is intended for people who run web sites to run, for example, periodic checks over a large number of HTML files to find al...
BKB/WWW-LinkRot-0.02 - 22 Apr 2021 23:31:06 UTC - Search in distribution
Nagios::WebTransact - Class for generating Nagios service checks of Web transactions.
This module implements a check of a Web Transaction. A Web transaction is a sequence of web pages, often fill out forms, that accomplishes an enquiry or an update. Common examples are database searches and registration activities. A Web transaction i...
HOPCROFTS/Nagios-WebTransact-0.16 - 17 Mar 2004 04:12:09 UTC - Search in distribution
WebService::GoogleHack - Perl package that ties together all GoogleHack modules (Webservice::GoogleHack::Search, Webservice::GoogleHack::Spelling, Webservice::GoogleHack::Rate, and Webservice::GoogleHack::Text) to implement Natural Language Processing techniques that use the World Wide Web as a source of information. Use this package to access all the functionality of GoogleHack.
WebService::GoogleHack is a PERL package that interacts with the Google API, and implements basic functions that allow the user to interact with Google and retrieve results in an easy to use format. GoogleHack also implements and extends a number of ...
PRATH/WebService-GoogleHack-0.15 - 21 Apr 2005 14:38:30 UTC - Search in distribution
Apache::MiniWiki - Miniature Wiki for Apache
Apache::MiniWiki is a simplistic Wiki for Apache. It doesn't have much uses besides very simple installations where hardly any features are needed. What is does support though is: - storage of Wiki pages in RCS - templates through HTML::Template - te...
KERW/Apache-MiniWiki-0.92 - 08 Nov 2003 20:36:42 UTC - Search in distribution
ASNMTAP::Asnmtap::Plugins::WebTransact
This module implements a check of a Web Transaction. A Web transaction is a sequence of web pages, often fill out forms, that accomplishes an enquiry or an update. Common examples are database searches and registration activities. A Web transaction i...
ASNMTAP/ASNMTAP-3.002003 - 10 Feb 2011 08:31:59 UTC - Search in distribution
WWW::Crawler::Lite - A single-threaded crawler/spider for the web.
"WWW::Crawler::Lite" is a single-threaded spider/crawler for the web. It can be used within a mod_perl, CGI or Catalyst-style environment because it does not fork or use threads. The callback-based interface is fast and simple, allowing you to focus ...
JOHND/WWW-Crawler-Lite-0.005 - 08 Jan 2012 06:48:53 UTC - Search in distribution
URI::Find::Simple - a simple interface to URI::Find
URI::Find is all very well, but sometimes you just want a list of the links in a given piece of text, or you want to change all the urls in some text somehow, and don't want to mess with callback interfaces. This module uses URI::Find, but hides the ...
NEILB/URI-Find-Simple-1.07 - 14 Mar 2021 23:43:48 UTC - Search in distribution
AnyEvent::Net::Curl::Queued - Moo wrapper for queued downloads via Net::Curl & AnyEvent
AnyEvent::Net::Curl::Queued (a.k.a. YADA, *Yet Another Download Accelerator*) is an efficient and flexible batch downloader with a straight-forward interface capable of: * create a queue; * append/prepend URLs; * wait for downloads to end (retry on e...
SYP/AnyEvent-Net-Curl-Queued-0.049 - 07 Apr 2021 07:02:55 UTC - Search in distribution