[sane-devel] scanimage / tesseract interoperability
simon.matter at invoca.ch
Sat May 10 13:41:50 UTC 2014
> Tesseract is an open source OCR program. It can already
> produce searchable PDF and will soon support streaming.
> It would be fun to support something like this:
> scanimage --batch | tesseract - - pdf > searchable.pdf
> To make this work nicely, scanimage would need to
> print the name of each file to stdout after it is written.
We had a different requirement for batch processing and added a
--batch-script option to our SANE build. Maybe it could be useful for you,
patch is attached.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 4457 bytes
Desc: not available
More information about the sane-devel