[xml/sgml-pkgs] Bug#287371: xsltproc: Probable memory leak (when using document()?)

Mike Hommey Mike Hommey <mh@glandium.org>, 287371@bugs.debian.org
Wed, 9 Feb 2005 17:52:31 +0100


retitle 287371 DTD should be cached when included several times
severity 287371 wishlist
tag 287371 upstream
thanks

On Wed, Feb 09, 2005 at 05:38:54PM +0100, Vincent Lefevre <vincent@vinc17.org> wrote:
> On 2005-02-09 17:12:21 +0100, Mike Hommey wrote:
> > How big is the document you load with document() ? How many times it
> > gets loaded ? Could you provide me the files ?
> 
> The documents are small, but the DTD is very big (this is a DTD based
> on DocBook + MathML). Currently, about 50 documents are included.
> 
> I wanted to post a followup, but hadn't had the time yet. FYI, I had
> a discussion with Daniel on the LibXSLT mailing-list 10 days ago. In
> short, for some reasons, the DTD structures are not reused each time
> a new document is parsed. IMHO, this could be solved by some form of
> cache (corresponding to the DTD + internal subset if any).
> 
> Technically, this bug could be regarded as a wishlist. But using so
> much memory should be regarded as a bug IMHO, unless the other XSLT
> processors have the same problem.
> 
> The title of the bug should be changed to something like "DTD
> structures should be shared/cached in case of multiple inclusions"
> (when possible, of course).

Thanks for the feedback.
Note that such "optimization" bugs are not really *that* important, so i
downgraded this bug to wishlist, even if a huge amount of memory is
used. Also note that 138MB is not *that* much considering the number of
documents and the DTD size.

Mike