Module for splitting text into sentences.

Edit Package perl-Lingua-EN-Sentence

The 'Lingua::EN::Sentence' module contains the function get_sentences,
which splits text into its constituent sentences, based on a regular
expression and a list of abbreviations (built in and given).

Certain well know exceptions, such as abreviations, may cause incorrect
segmentations. But some of them are already integrated into this code and
are being taken care of. Still, if you see that there are words causing the
get_sentences() to fail, you can add those to the module, so it notices
them.

Refresh
Refresh
Source Files
Filename Size Changed
Lingua-EN-Sentence-0.25.tar.gz 0000006994 6.83 KB
perl-Lingua-EN-Sentence.changes 0000000447 447 Bytes
perl-Lingua-EN-Sentence.spec 0000002196 2.14 KB
Latest Revision
Adrian Schröter's avatar Adrian Schröter (adrianSuSE) committed (revision 1)
branched from openSUSE:Factory
Comments 0
openSUSE Build Service is sponsored by