[sane-devel] scanimage / tesseract interoperability

Simon Matter simon.matter at invoca.ch
Sat May 10 13:41:50 UTC 2014


> Tesseract is an open source OCR program. It can already
> produce searchable PDF and will soon support streaming.
> It would be fun to support something like this:
>
>    scanimage --batch | tesseract - - pdf > searchable.pdf
>
> To make this work nicely, scanimage would need to
> print the name of each file to stdout after it is written.
>
> Thoughts?

Hi,

We had a different requirement for batch processing and added a
--batch-script option to our SANE build. Maybe it could be useful for you,
patch is attached.

Regards,
Simon
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sane-backends-1.0.21-scanimage-batch-script.patch
Type: text/x-diff
Size: 4457 bytes
Desc: not available
URL: <http://lists.alioth.debian.org/pipermail/sane-devel/attachments/20140510/773e7eb8/attachment.patch>


More information about the sane-devel mailing list