Unsupervised text tokenizer for Neural Network-based text generation.
https://github.com/google/sentencepiece
SentencePiece is an unsupervised text tokenizer and detokenizer mainly for
Neural Network-based text generation systems where the vocabulary size is
predetermined prior to the neural model training.
- Links to home:bird...learning / python-sentencepiece
- Has a link diff
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:illuusio:python:3.11/python-sentencepiece && cd $_
- Create Badge
Refresh
Refresh
Source Files (show merged sources derived from linked package)
Filename | Size | Changed |
---|---|---|
_link | 0000000140 140 Bytes | |
python-sentencepiece.spec | 0000004492 4.39 KB | |
sentencepiece-0.2.0.tar.gz | 0002632106 2.51 MB | |
sentencepiece-fix-build.patch | 0000001067 1.04 KB |
Comments 0