Guido Socher > HTML-TagReader-1.10 > tr_tagcontentgrep

Download:
HTML-TagReader-1.10.tar.gz

Annotate this POD

CPAN RT

New  1
Open  0
View/Report Bugs
Source  

NAME ^

tr_tagcontentgrep -- grep for a xml/sgml/html tag

SYNOPSIS ^

 tr_tagcontentgrep [-hl] regexp-pattern [file ...]

DESCRIPTION ^

tr_tagcontentgrep opens all files provided on the command line and searches for the given pattern in the tags. The search is not case sensitive.

tr_tagcontentgrep is part of the HTML::TagReader package but is an example that you can also do 'reading by tag' without HTML::TagReader. tr_tagcontentgrep uses plain perl and sets the $/ variable. Working without HTML::TagReader causes however problems when working with faulty html code where single '<'-characters appear somewhere in the text.

All space in the tags is reduced to max. one space. You can search for "a href" even if the original tag had multiple spaces between "a" and "href".

OPTIONS ^

-h this help

-l list filename and line number

EXAMPLE ^

tr_tagcontentgrep -l img file.html

would e.g print something like:

index.html:53: <IMG src="../images/transpix.gif" alt=""> index.html:257: <IMG SRC="../Logo.gif" width=128 height=53>

AUTHOR ^

tr_tagcontentgrep is part of the HTML::TagReader package and was written by Guido Socher [guido(at)linuxfocus.org]

syntax highlighting: