The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.
#!/usr/bin/perl -s
use strict;
use vars qw( $b $f );

use HTML::SimpleLinkExtor;

my $extor = HTML::SimpleLinkExtor->new( defined $b ? $b : () );

if( defined $f ) {
	$extor->parse_url( $f );
	}
else {
	my $html = do { local $/; <STDIN> };
	$extor->parse( $html );
	}

{
local $, = "\n";

print $extor->links, '';
}

=pod

=encoding utf8

=head1 NAME

linktractor - extract links from HTML

=head1 SYNOPSIS

	% linktractor fileA.html fileB.html

	% linktractor -f=http://www.perl.com

	% lwp-request http://www.example.com | linktractor

	% lwp-request http://www.example.com | linktractor -b=http://www.example.com


=head1 DESCRIPTION

This is a small script that uses HTML::SimpleLinkExtractor to pull
all the HTML links out of the input HTML. It can take input from
files you specify on the command line (or standard input), or fetch
a URL.

=head2 Options

=over 4

=item -b=base

The C<-b> switch sets the base URL to resolve relative URLs in the
input.

=item -f=fetch_url

Instead of reading from files specified on the command line or
standard input, fetch this URL and use it as input.

=back

=head1 SOURCE AVAILABILITY

This module is in Github

	https://github.com/briandfoy/html-simplelinkextor

=head1 AUTHORS

brian d foy, C<< <bdfoy@cpan.org> >>

=head1 COPYRIGHT

Copyright © 2007-2018, brian d foy <bdfoy@cpan.org>. All rights reserved.

This program is free software; you can redistribute it and/or modify
it under the same terms as Perl itself.

=head1 LICENSE

You may use HTML::SimpleLinkExtor under the terms of the Artistic
License 2.0.

=cut

1;