Bug#787821: libhtml-parser-perl: encode_entities() convert chars to à instead of their proper entity

gregor herrmann gregoa at debian.org
Fri Jun 5 13:22:01 UTC 2015


On Fri, 05 Jun 2015 14:34:42 +0200, Mathieu ROY wrote:

> Ok, so after further testing, it turns out that if I change the coding of the 
> string from UTF-8 to ISO-8859..., it encode to the proper entities.

Good.
 
> I obviously can adjust the script to pre convert UTF-8 to ISO-8859 

Or just add "use utf8;" to your script if it contains utf8-encoded
strings.

> but it 
> should be at least documented (but I dont see any reason why encode_entities 
> should actually not be able to deal with UTF-8)

That's how encoding in perl works in general, and I'm sure it's
documented somewhere :)
(I just don't find the correct perldoc right now ...)


Cheers,
gregor
-- 
 .''`.  Homepage: http://info.comodo.priv.at/ - OpenPGP key 0xBB3A68018649AA06
 : :' : Debian GNU/Linux user, admin, and developer -  https://www.debian.org/
 `. `'  Member of VIBE!AT & SPI, fellow of the Free Software Foundation Europe
   `-   NP: Treibhaus: Garish
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 949 bytes
Desc: Digital Signature
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20150605/8ac80b78/attachment.sig>


More information about the pkg-perl-maintainers mailing list