The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

divsent - a perl script to mark sentences

SYNOPSIS

 divsent file+
 divsent -stag=phrase   file+
 divsent -ptag=par      file+
 divsent -ftag=f        file+

DESCRIPTION

Given a set of files it makes a (almost) XML file with the sentences marked.

Optionally you can also mark the paragraphs and the files (see options ptag and ftag)

Options

-p -- Paragraphs in the input text follow HTML notation (<p>)

-stag=tagname -- define the tag name for sentences (def. s)

-ptag=tagname -- define the tag name for paragraphs (def. none)

-ftag=tagname -- define the tag name for files (def. none). Filename is include as id attribute.

-n -- simple normalize the output

-l -- Each line is a different paragraph

AUTHOR

J.Joao Almeida, jj@di.uminho.pt

SEE ALSO

Lingua::PT::PLN(3pm)

perl(1).