Bug#420636: [xml/sgml-pkgs] Bug#420636: crashes on feeds that contain invalid utf-8 sequences

Joey Hess joeyh at debian.org
Mon Apr 23 18:34:25 UTC 2007


Mike Hommey wrote:
> On Mon, Apr 23, 2007 at 02:03:44PM -0400, Joey Hess <joeyh at debian.org> wrote:
> > Mike Hommey wrote:
> > > Such a lax xml parser is not an xml parser. This bug is therefore a
> > > wishlist bug.
> > 
> > Not being able to parse as many xml feeds out in the wild as other
> > language's parsers is not a bug?
> 
> The fact is they are *not* xml feeds.

The fact is that they are out there in the wild and tools are needed to
parse them. Do you read Planet Debian? I'll guarantee you that at least
one feed currently on there is not valid xml.

It seems that you are more interested in arguing semantics than actually
fixing the problem? (Which I've worked around in my code now anyway by
calling Encode::decode_utf8 on the feed if XML::Feed crashes.)

-- 
see shy jo
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
Url : http://lists.alioth.debian.org/pipermail/debian-xml-sgml-pkgs/attachments/20070423/f8136064/attachment.pgp


More information about the debian-xml-sgml-pkgs mailing list