Bug#499606: ITP: tika -- a Java library for extracting textual information from various documents
Jan-Pascal van Best
janpascal at vanbest.org
Sat Sep 20 11:59:23 UTC 2008
Package: wnpp
Severity: wishlist
Owner: "Jan-Pascal van Best" <janpascal at vanbest.org>
* Package name : tika
Version : 0.2-SNAPSHOT
Upstream Author : Jukka Zitting <jukka at apache.org> and others
* URL : http://incubator.apache.org/tika/
* License : Apache 2.0
Programming Lang: Java
Description : a Java library for extracting textual information from various documents
Apache Tika is a toolkit for detecting and extracting metadata and structured
text content from various documents using existing parser libraries.
-- System Information:
Debian Release: lenny/sid
APT prefers testing
APT policy: (990, 'testing'), (400, 'unstable')
Architecture: amd64 (x86_64)
More information about the pkg-java-maintainers
mailing list