LogoopenSUSE Build Service > Projects
Sign Up | Log In

A CRF-based word segmenter in Java. Supports Arabic and Chinese

Some languages require  extensive token pre-processing, which is usually called segmentation.
The Stanford Word Segmenter currently supports Arabic and Chinese. The provided segmentation schemes have been found to work well for a variety of applications. 

Source Files

Filename Size Changed Actions
stanford-segmenter-3.2.0.tar.gz 247 MB over 4 years ago
stanford-segmenter.spec 1.38 KB over 4 years ago Download File

Comments for home:inescid:robots (0)