Bug#787821: libhtml-parser-perl: encode_entities() convert chars to à instead of their proper entity
gregor herrmann
gregoa at debian.org
Fri Jun 5 13:22:01 UTC 2015
On Fri, 05 Jun 2015 14:34:42 +0200, Mathieu ROY wrote:
> Ok, so after further testing, it turns out that if I change the coding of the
> string from UTF-8 to ISO-8859..., it encode to the proper entities.
Good.
> I obviously can adjust the script to pre convert UTF-8 to ISO-8859
Or just add "use utf8;" to your script if it contains utf8-encoded
strings.
> but it
> should be at least documented (but I dont see any reason why encode_entities
> should actually not be able to deal with UTF-8)
That's how encoding in perl works in general, and I'm sure it's
documented somewhere :)
(I just don't find the correct perldoc right now ...)
Cheers,
gregor
--
.''`. Homepage: http://info.comodo.priv.at/ - OpenPGP key 0xBB3A68018649AA06
: :' : Debian GNU/Linux user, admin, and developer - https://www.debian.org/
`. `' Member of VIBE!AT & SPI, fellow of the Free Software Foundation Europe
`- NP: Treibhaus: Garish
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 949 bytes
Desc: Digital Signature
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20150605/8ac80b78/attachment.sig>
More information about the pkg-perl-maintainers
mailing list