Tesseract Open Source OCR Engine
http://code.google.com/p/tesseract-ocr
Tesseract is a free optical character recognition engine originally developed
at Hewlett-Packard and currently developed by Google. It is a raw OCR engine -
it has no document layout analysis, no output formatting, and no graphical user
interface. It only processes a TIFF or BMP image of a single column and creates
text from it. It can detect fixed pitch vs proportional text. The engine was in
the top 3 in terms of character accuracy in 1995. The source code will read a
binary, grey or color image and output text.
Tesseract can process English, French, Italian, German, Spanish, Brazilian
Portuguese and Dutch and can be trained to work in other languages as well.
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:11.4:Contrib/tesseract && cd $_ - Create Badge
Refresh
Source Files
| Filename | Size | Changed |
|---|---|---|
| bul.traineddata.gz | 0000848731 829 KB | |
| cat.traineddata.gz | 0000995008 972 KB | |
| ces.traineddata.gz | 0001059966 1.01 MB | |
| chi_sim.traineddata.gz | 0019732398 18.8 MB | |
| chi_tra.traineddata.gz | 0027512772 26.2 MB | |
| dan-frak.traineddata.gz | 0000683525 668 KB | |
| dan.traineddata.gz | 0000958449 936 KB | |
| deu.traineddata.gz | 0000965684 943 KB | |
| ell.traineddata.gz | 0000944284 922 KB | |
| eng.traineddata.gz | 0000742852 725 KB | |
| fin.traineddata.gz | 0000959833 937 KB | |
| fra.traineddata.gz | 0000933372 911 KB | |
| hun.traineddata.gz | 0001008061 984 KB | |
| ind.traineddata.gz | 0000836752 817 KB | |
| ita.traineddata.gz | 0000939956 918 KB | |
| jpn.traineddata.gz | 0014604738 13.9 MB | |
| kor.traineddata.gz | 0006032090 5.75 MB | |
| lav.traineddata.gz | 0001018176 994 KB | |
| lit.traineddata.gz | 0001012936 989 KB | |
| nld.traineddata.gz | 0000954151 932 KB | |
| nor.traineddata.gz | 0000951018 929 KB | |
| pol.traineddata.gz | 0001060352 1.01 MB | |
| por.traineddata.gz | 0000911645 890 KB | |
| ron.traineddata.gz | 0000929925 908 KB | |
| rus.traineddata.gz | 0000848490 829 KB | |
| slk.traineddata.gz | 0001091624 1.04 MB | |
| slv.traineddata.gz | 0000930221 908 KB | |
| spa.traineddata.gz | 0000910992 890 KB | |
| srp.traineddata.gz | 0000977674 955 KB | |
| swe.traineddata.gz | 0000959911 937 KB | |
| tesseract-3.00-nonvoid.patch | 0000000793 793 Bytes | |
| tesseract-3.00.tar.gz | 0003436992 3.28 MB | |
| tesseract-package-creator.py | 0000007308 7.14 KB | |
| tesseract.changes | 0000000936 936 Bytes | |
| tesseract.spec | 0000015575 15.2 KB | |
| tesseract.spec.in | 0000001610 1.57 KB | |
| tgl.traineddata.gz | 0000978138 955 KB | |
| tur.traineddata.gz | 0000933401 912 KB | |
| ukr.traineddata.gz | 0000927741 906 KB | |
| vie.traineddata.gz | 0001575539 1.5 MB |
Comments 0