A tool to make "sandwich" OCR pdf files

Edit Package pdfsandwich
http://www.tobias-elze.de/pdfsandwich/

Pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only
images (no text) will be processed by optical character recognition (OCR) and the
text will be added to each page invisibly "behind" the images.

pdfsandwich is a command line tool which is supposed to be useful to OCR scanned
books or journals. It is able to recognize the page layout even for multicolumn
text.

Refresh
Refresh
Source Files
Filename Size Changed
pdfsandwich-0.1.7.tar.bz2 0000017461 17.1 KB
pdfsandwich.changes 0000001695 1.66 KB
pdfsandwich.spec 0000001964 1.92 KB
Latest Revision
Martin Pluskal's avatar Martin Pluskal (pluskalm) accepted request 738638 from Christophe Giboudeaux's avatar Christophe Giboudeaux (cgiboudeaux) (revision 3)
- Update to 0.1.7:
  * New option -omp_thread_limit that controls number of threads for tesseract
  and prevents tesseract >=4 from freezing when multiple threads are used
  * Correction of several typos
  * Introduction of a global temporary directory that is only user readable
- Run spec-cleaner
- Fix the license tag. pdfsandwich is GPL-2.0-or-later.
Comments 0
openSUSE Build Service is sponsored by