[Python-apps-team] Bug#666994: ocrfeeder: Need to silence tesseract engine
Michael Hanke
mih at debian.org
Tue Apr 3 08:08:25 UTC 2012
Package: ocrfeeder
Version: 0.7.7-2
Severity: normal
When I use ocrfeeder with the tesseract engine, the engine's version
output is merged into the actual OCR output, hence the result is pretty
much useless. A fix would be to send the output of the tesseract call to
/dev/null. This would be a working engine config:
% cat .ocrfeeder/engines/Tesseract.xml
<?xml version='1.0' encoding='UTF-8'?>
<engine><image_format>TIF</image_format><engine_path>/usr/bin/tesseract</engine_path><name>Tesseract</name><arguments>$IMAGE $FILE >/dev/null; cat $FILE.txt; rm $FILE $FILE.txt</arguments></engine>
-- System Information:
Debian Release: wheezy/sid
Architecture: i386 (i686)
Kernel: Linux 3.1.0-1-686-pae (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages ocrfeeder depends on:
ii cuneiform 1.1.0+dfsg-2
ii ghostscript 9.05~dfsg-3
ii gocr 0.49-1
ii ocrad 0.21-3
ii python 2.7.2-10
ii python-enchant 1.6.5-2
ii python-gtk2 2.24.0-3
ii python-gtkspell 2.25.3-11
ii python-imaging-sane 1.1.7-4
ii python-pygoocanvas 0.14.1-1+b3
ii python-reportlab 2.5-1.1
ii python2.6 2.6.7-4
ii python2.7 2.7.2-8
ii tesseract-ocr 3.02.01-4
Versions of packages ocrfeeder recommends:
ii unpaper 0.3-1
ii yelp 2.30.1+webkit-1+b1
ocrfeeder suggests no packages.
-- no debconf information
More information about the Python-apps-team
mailing list