
divsent - a perl script to mark sentences

divsent file+ divsent -stag=phrase file+ divsent -ptag=par file+ divsent -ftag=f file+

Given a set of files it makes a (almost) XML file with the sentences marked.
Optionally you can also mark the paragraphs and the files (see options ptag and ftag)

-p -- Paragraphs in the input text follow HTML notation (<p>)
-stag=tagname -- define the tag name for sentences (def. s)
-ptag=tagname -- define the tag name for paragraphs (def. none)
-ftag=tagname -- define the tag name for files (def. none). Filename is include as id attribute.
-n -- simple normalize the output
-l -- Each line is a different paragraph

J.Joao Almeira, jj@di.uminho.pt

Lingua::PT::PLN(3pm)
perl(1).