[pkg-lynx-maint] Bug#808949: Bug#808949: U+200F RIGHT-TO-LEFT MARK shows up on screen

Axel Beckert abe at debian.org
Mon Dec 28 13:27:55 UTC 2015


Control: tag -1 -moreinfo

Hi Jidanni,

積丹尼 Dan Jacobson wrote:
> AB> See the attached screenshot for how it looks here.
> No attachment found, but no big deal.

Ah, sorry, my fault. Attached now.

> >> STARTFILE:file:/home/jidanni/wwwoffle-chunks/index.html
> OK I got rid of that. Problem remains.

Ok, so wwwoflle is not the cause.

> AB> Another thing I can imagine are locale settings. Here are mine:
> 
> AB> → env | egrep 'LANG|LC_'
> AB> LANGUAGE=en_GB:en
> AB> LANG=C.UTF-8
> 
> $ env | egrep 'LANG|LC_'
> LC_MESSAGES=C
> LC_COLLATE=C
> LANG=zh_TW.UTF-8
> LC_CTYPE=zh_TW.UTF-8
> 
> Ah ha, no bug when using
> # su - nobody -c 'lynx "http://radioscanningtw.jidanni.org/index.php?title=特殊:分類&uselang=en"'
> 
> OK perhaps you can reproduce this:
> $ env - lynx "http://radioscanningtw.jidanni.org/index.php?title=特殊:分類&uselang=en"
> makes
>    * 123.4000 ~@~O    (1 member)
>    * 138.3125 ~@~O    (1 member)
> when answering "VT100" to "what terminal?"

I can reproduce this output (except the question as I didn't unset
$TERM) with

env -uLANG lynx "http://radioscanningtw.jidanni.org/index.php?title=特殊:分類&uselang=en"

But that looks a little bit expected: If no UTF-8 is available, Lynx
tries to find replacements and "~@" looks like a good replacement for
some only indirectly visible character.

I also get it with

env -uLANGUAGE LC_MESSAGES=C LC_COLLATE=C LANG=zh_TW.UTF-8 LC_CTYPE=zh_TW.UTF-8 lynx "http://radioscanningtw.jidanni.org/index.php?title=特殊:分類&uselang=en"

But I guess this is because I have no zh_TW.UTF-8 locales installed.
So I tried it with this:

env -uLANGUAGE LC_MESSAGES=C LC_COLLATE=C LANG=en_GB.UTF-8 LC_CTYPE=en_GB.UTF-8 lynx "http://radioscanningtw.jidanni.org/index.php?title=特殊:分類&uselang=en"

That way I get again what's on my screenshot -- no U+200F visible.

Basically I can switch between those two representations (screenshot
and the "~@~O" view) by switching between LC_CTYPE=C and
LC_CTYPE=C.UTF-8. Unfortunately I was never able to reproduce what's
on your screenshot -- and I'm currently out of ideas what else could
have caused this. I also found no differences between
TERM=xterm-256color (what I have) and TERM=xterm.

What $TERM do you have set?

		Regards, Axel
-- 
 ,''`.  |  Axel Beckert <abe at debian.org>, http://people.debian.org/~abe/
: :' :  |  Debian Developer, ftp.ch.debian.org Admin
`. `'   |  4096R: 2517 B724 C5F6 CA99 5329  6E61 2FF9 CD59 6126 16B5
  `-    |  1024D: F067 EA27 26B9 C3FC 1486  202E C09E 1D89 9593 0EDE
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 2015-12-28-122204_314x106_scrot.png
Type: image/png
Size: 4120 bytes
Desc: not available
URL: <http://lists.alioth.debian.org/pipermail/pkg-lynx-maint/attachments/20151228/71df99a0/attachment.png>


More information about the pkg-lynx-maint mailing list