Bug#382995: ITP: nekohtml -- HTML parser for Java
marcus at better.se
marcus at better.se
Mon Aug 14 14:55:27 UTC 2006
Package: wnpp
Severity: wishlist
* Package name : nekohtml
Version : 0.9.5
Upstream Author : Andy Clark
* URL or Web page : http://people.apache.org/~andyc/neko/doc/html/
* License : CyberNeko Software License, Version 1.0
Description : HTML parser for Java
This is a simple HTML scanner and tag balancer that enables
application programmers to parse HTML documents and access the
information using standard XML interfaces. The parser can scan HTML
files and "fix up" many common mistakes that human (and computer)
authors make in writing HTML documents. NekoHTML adds missing parent
elements; automatically closes elements with optional end tags; and
can handle mismatched inline element tags.
More information about the pkg-java-maintainers
mailing list