[Reproducible-builds] [PATCH 2/2] DocBook: Use a fixed encoding for output
Jonathan Corbet
corbet at lwn.net
Fri Sep 18 16:30:24 UTC 2015
On Mon, 14 Sep 2015 01:32:50 +0100
Ben Hutchings <ben at decadent.org.uk> wrote:
> > I guess I have to ask, though: doesn't it seem that having the docs
> > produced according to the current locale is the Right Thing to do? Users
> > have their locale set as it is for a reason, it seems like the production
> > of textual documents should respect their choice.
> >
> > Am I missing something here?
>
> Yes - the locale's character encoding applies to plain text, but rich
> text formats can have a locale-independent encoding which the viewer
> will automatically to the current locale's encoding.
>
> For HTML, the document encoding can be explicit in the document header
> (and is, in this case).
>
> Manual pages were already consistently encoded in UTF-8, as this is the
> default behaviour of DocBook-XSL (and is what man-db prefers as input).
>
> PDF and Postscript documents have arbitrary and explicit mappings from
> character numbers (or names) to glyphs, and PDF documents normally have
> a mapping from glyphs back to Unicode code points to support searching
> and copying text.
OK, I guess you've talked me into it. Can I ask you for one last favor,
though: please resubmit this patch with a couple of tweaks:
- Based off current mainline, please (or docs-next, but that shouldn't
be necessary). The patch as sent doesn't apply.
- Could you add a comment to the check-lc_ctype proglet so that somebody
stumbling across it in the scripts directory knows why it's there?
Thanks,
jon
More information about the Reproducible-builds
mailing list