A tool to make "sandwich" OCR pdf files
http://www.tobias-elze.de/pdfsandwich/
Pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only
images (no text) will be processed by optical character recognition (OCR) and the
text will be added to each page invisibly "behind" the images.
pdfsandwich is a command line tool which is supposed to be useful to OCR scanned
books or journals. It is able to recognize the page layout even for multicolumn
text.
- Sources inherited from project Publishing
-
3
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:redwil:15.4/pdfsandwich && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
pdfsandwich-0.1.7.tar.bz2 | 0000017461 17.1 KB | |
pdfsandwich.changes | 0000001695 1.66 KB | |
pdfsandwich.spec | 0000001964 1.92 KB |
Latest Revision
Martin Pluskal (pluskalm)
accepted
request 738638
from
Christophe Giboudeaux (cgiboudeaux)
(revision 3)
- Update to 0.1.7: * New option -omp_thread_limit that controls number of threads for tesseract and prevents tesseract >=4 from freezing when multiple threads are used * Correction of several typos * Introduction of a global temporary directory that is only user readable - Run spec-cleaner - Fix the license tag. pdfsandwich is GPL-2.0-or-later.
Comments 0