Bug#702742: libpod-sax-perl: emits latin-1 entities incorrectly
brian m. carlson
sandals at crustytoothpaste.net
Sun Mar 10 22:54:19 UTC 2013
Package: libpod-sax-perl
Version: 0.14-5
Severity: normal
Forwarded: https://rt.cpan.org/Ticket/Display.html?id=83867
Using named HTML entities causes them to be output as a sequence of two
characters:
=item * E<216> Ø E<Oslash>
produces:
<listitem>Ø Ø Ã˜ </listitem>
The %HTML_Escapes hash in Pod::SAX::Parser maps Oslash to "\xC3\x98",
which is then passed directly to the characters method. This is wrong
because the Perl SAX documentation states:
All handler methods have a single argument; a hash reference. Hash
values are Unicode strings (scalars with UTF-8 flag on).
So the appropriate mapping here is:
Oslash => "\xd8"
This is the case for all entities outside of ASCII that Pod::SAX
understands.
-- System Information:
Debian Release: 7.0
APT prefers unstable
APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386
Kernel: Linux 3.8-trunk-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/mksh-static
Versions of packages libpod-sax-perl depends on:
ii libxml-sax-perl 0.99+dfsg-2
ii libxml-sax-writer-perl 0.53-1
ii perl 5.14.2-19
libpod-sax-perl recommends no packages.
libpod-sax-perl suggests no packages.
-- no debconf information
--
brian m. carlson / brian with sandals: Houston, Texas, US
+1 832 623 2791 | http://www.crustytoothpaste.net/~bmc | My opinion only
OpenPGP: RSA v4 4096b: 88AC E9B2 9196 305B A994 7552 F1BA 225C 0223 B187
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 836 bytes
Desc: Digital signature
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20130310/6e91c234/attachment.pgp>
More information about the pkg-perl-maintainers
mailing list