Bug#665274: perl-base: $utf8string ~= /regex/ fails

Dominic Hargreaves dom at earth.li
Sun Apr 8 16:05:03 UTC 2012


fixed 665274 5.14.2-9
thanks

On Thu, Mar 22, 2012 at 07:08:13PM +0100, Bodo Eggert wrote:
> Some versions of perl, including 5.10.1-17squeeze3, have problems matching utf-8 characters:
> 
> $ LANG=en_US.UTF-8 perl -e 'use utf8; "Herbert Grönemeyer" =~ /(.*?)\s*\(with (.*)\)$/i'
> Malformed UTF-8 character (unexpected continuation byte 0xb6, with no preceding start byte) in pattern match (m//) at -e line 1.
> $ _
> 
> More info can be found at e.g. http://www.perlmonks.org/?node_id=843208

This test succeeds with the perl 5.14 in sid/wheezy. As there have been
lots of Unicode-related changes between 5.10 and 5.14, I'm sure not sure
that finding and fixing this bug in 5.10 will be practical, especially
given the need to not introduced unwanted behavioural changes in squeeze.

Dominic.

-- 
Dominic Hargreaves | http://www.larted.org.uk/~dom/
PGP key 5178E2A5 from the.earth.li (keyserver,web,email)






More information about the Perl-maintainers mailing list