Alberto Manuel Brandão Simões > Lingua-PT-PLN > divsent

Download:
Lingua/Lingua-PT-PLN-0.17.tar.gz

Annotate this POD

CPAN RT

Open  0
Report a bug
Source  

NAME ^

divsent - a perl script to mark sentences

SYNOPSIS ^

 divsent file+
 divsent -stag=phrase   file+
 divsent -ptag=par      file+
 divsent -ftag=f        file+

DESCRIPTION ^

Given a set of files it makes a (almost) XML file with the sentences marked.

Optionally you can also mark the paragraphs and the files (see options ptag and ftag)

Options ^

-p -- Paragraphs in the input text follow HTML notation (<p>)

-stag=tagname -- define the tag name for sentences (def. s)

-ptag=tagname -- define the tag name for paragraphs (def. none)

-ftag=tagname -- define the tag name for files (def. none). Filename is include as id attribute.

-n -- simple normalize the output

-l -- Each line is a different paragraph

AUTHOR ^

J.Joao Almeira, jj@di.uminho.pt

SEE ALSO ^

Lingua::PT::PLN(3pm)

perl(1).