Bug#499606: ITP: tika -- a Java library for extracting textual information from various documents

Jan-Pascal van Best janpascal at vanbest.org
Sat Sep 20 11:59:23 UTC 2008


Package: wnpp
Severity: wishlist
Owner: "Jan-Pascal van Best" <janpascal at vanbest.org>

* Package name    : tika
  Version         : 0.2-SNAPSHOT
  Upstream Author : Jukka Zitting <jukka at apache.org> and others
* URL             : http://incubator.apache.org/tika/
* License         : Apache 2.0
  Programming Lang: Java
  Description     : a Java library for extracting textual information from various documents

Apache Tika is a toolkit for detecting and extracting metadata and structured
text content from various documents using existing parser libraries.

-- System Information:
Debian Release: lenny/sid
  APT prefers testing
  APT policy: (990, 'testing'), (400, 'unstable')
Architecture: amd64 (x86_64)





More information about the pkg-java-maintainers mailing list