[Python-apps-team] Bug#666994: ocrfeeder: Need to silence tesseract engine

Michael Hanke mih at debian.org
Tue Apr 3 08:08:25 UTC 2012


Package: ocrfeeder
Version: 0.7.7-2
Severity: normal

When I use ocrfeeder with the tesseract engine, the engine's version
output is merged into the actual OCR output, hence the result is pretty
much useless. A fix would be to send the output of the tesseract call to
/dev/null. This would be a working engine config:

% cat .ocrfeeder/engines/Tesseract.xml
<?xml version='1.0' encoding='UTF-8'?>
<engine><image_format>TIF</image_format><engine_path>/usr/bin/tesseract</engine_path><name>Tesseract</name><arguments>$IMAGE $FILE >/dev/null; cat $FILE.txt; rm $FILE $FILE.txt</arguments></engine>


-- System Information:
Debian Release: wheezy/sid
Architecture: i386 (i686)

Kernel: Linux 3.1.0-1-686-pae (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages ocrfeeder depends on:
ii  cuneiform            1.1.0+dfsg-2
ii  ghostscript          9.05~dfsg-3
ii  gocr                 0.49-1
ii  ocrad                0.21-3
ii  python               2.7.2-10
ii  python-enchant       1.6.5-2
ii  python-gtk2          2.24.0-3
ii  python-gtkspell      2.25.3-11
ii  python-imaging-sane  1.1.7-4
ii  python-pygoocanvas   0.14.1-1+b3
ii  python-reportlab     2.5-1.1
ii  python2.6            2.6.7-4
ii  python2.7            2.7.2-8
ii  tesseract-ocr        3.02.01-4

Versions of packages ocrfeeder recommends:
ii  unpaper  0.3-1
ii  yelp     2.30.1+webkit-1+b1

ocrfeeder suggests no packages.

-- no debconf information





More information about the Python-apps-team mailing list