[xml/sgml-pkgs] Bug#287371: xsltproc: Probable memory leak (when using document()?)

Vincent Lefevre Vincent Lefevre <vincent@vinc17.org>, 287371@bugs.debian.org
Thu, 10 Feb 2005 00:44:20 +0100


On 2005-02-09 17:52:31 +0100, Mike Hommey wrote:
> retitle 287371 DTD should be cached when included several times

To be more accurate: this is the internal structure related to the
DTD (and internal subset) that should be cached (to be reused when
the DTD with internal subset is the same, thus not taking additional
memory when a second document is processed).

> Note that such "optimization" bugs are not really *that* important,

Well, it is important on machines that don't have enough memory.

> so i downgraded this bug to wishlist, even if a huge amount of
> memory is used. Also note that 138MB is not *that* much considering
> the number of documents and the DTD size.

By caching the DTD structures, one could gain something like a
factor 1000 on the asymptotic memory usage with small documents
(3 KB vs 3 MB for the DTD itself). This is quite significant.

-- 
Vincent Lefèvre <vincent@vinc17.org> - Web: <http://www.vinc17.org/>
100% accessible validated (X)HTML - Blog: <http://www.vinc17.org/blog/>
Work: CR INRIA - computer arithmetic / SPACES project at LORIA