[xml/sgml-pkgs] Bug#287371: xsltproc: Probable memory leak (when using document()?)

Vincent Lefevre Vincent Lefevre <vincent@vinc17.org>, 287371@bugs.debian.org
Wed, 9 Feb 2005 17:38:54 +0100

On 2005-02-09 17:12:21 +0100, Mike Hommey wrote:
> How big is the document you load with document() ? How many times it
> gets loaded ? Could you provide me the files ?

The documents are small, but the DTD is very big (this is a DTD based
on DocBook + MathML). Currently, about 50 documents are included.

I wanted to post a followup, but hadn't had the time yet. FYI, I had
a discussion with Daniel on the LibXSLT mailing-list 10 days ago. In
short, for some reasons, the DTD structures are not reused each time
a new document is parsed. IMHO, this could be solved by some form of
cache (corresponding to the DTD + internal subset if any).

Technically, this bug could be regarded as a wishlist. But using so
much memory should be regarded as a bug IMHO, unless the other XSLT
processors have the same problem.

The title of the bug should be changed to something like "DTD
structures should be shared/cached in case of multiple inclusions"
(when possible, of course).

Vincent Lefèvre <vincent@vinc17.org> - Web: <http://www.vinc17.org/>
100% accessible validated (X)HTML - Blog: <http://www.vinc17.org/blog/>
Work: CR INRIA - computer arithmetic / SPACES project at LORIA