Show home:inescid:robots / stanford-segmenter

Overview Repositories Revisions Requests Users Attributes Meta

A CRF-based word segmenter in Java. Supports Arabic and Chinese

Some languages require extensive token pre-processing, which is usually called segmentation.
The Stanford Word Segmenter currently supports Arabic and Chinese. The provided segmentation schemes have been found to work well for a variety of applications.

Download package
Checkout Package
osc -A https://api.opensuse.org checkout home:inescid:robots/stanford-segmenter && cd $_
Create Badge

Build Results
RPM Lint

Refresh

Source Files

Filename	Size	Changed
stanford-segmenter-3.2.0.tar.gz	0258877056 247 MB	over 10 years ago
stanford-segmenter.spec	0000001414 1.38 KB	over 10 years ago

Latest Revision

Filipe Correia (filipecorreia) committed over 10 years ago (revision 5)

Files Changed
Browse Source

Places

Actions on this page

A CRF-based word segmenter in Java. Supports Arabic and Chinese

Edit Package stanford-segmenter

Source Files

Latest Revision

Comments 0

Places