[xml/sgml-pkgs] Bug#273600: marked as done (After switch to native transcoder, KOI8-R support in xerces23 is broken)

Debian Bug Tracking System owner@bugs.debian.org
Mon, 28 Mar 2005 10:34:11 -0800

Your message dated Mon, 28 Mar 2005 13:17:16 -0500
with message-id <20050328131714.3270055946.qww314159@soup.acv.apexcovantage.com>
and subject line xerces23 is being removed
has caused the attached Bug report to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what I am
talking about this indicates a serious mail system misconfiguration
somewhere.  Please contact me immediately.)

Debian bug tracking system administrator
(administrator, Debian Bugs database)

Received: (at submit) by bugs.debian.org; 27 Sep 2004 09:04:08 +0000
>From nikita@lvk.cs.msu.su Mon Sep 27 02:04:08 2004
Return-path: <nikita@lvk.cs.msu.su>
Received: from zigzag.lvk.cs.msu.su [] 
	by spohr.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1CBrQV-00036R-00; Mon, 27 Sep 2004 02:04:08 -0700
Received: from nikita by zigzag.lvk.cs.msu.su with local (Exim 3.36 #1 (Debian))
	id 1CBrLM-00063z-00; Mon, 27 Sep 2004 12:58:48 +0400
Content-Type: multipart/mixed; boundary="===============1551151242=="
MIME-Version: 1.0
From: "Nikita V. Youshchenko" <yoush@cs.msu.su>
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: After switch to native transcoder, KOI8-R support in xerces23 is broken
X-Mailer: reportbug 2.63
Date: Mon, 27 Sep 2004 12:58:47 +0400
Message-Id: <E1CBrLM-00063z-00@zigzag.lvk.cs.msu.su>
X-Scanner: exiscan *1CBrLM-00063z-00*wPH8MG6ipp6*
Delivered-To: submit@bugs.debian.org
X-Spam-Checker-Version: SpamAssassin 2.60-bugs.debian.org_2004_03_25 
	(1.212-2003-09-23-exp) on spohr.debian.org
X-Spam-Status: No, hits=-8.0 required=4.0 tests=BAYES_00,HAS_PACKAGE,

This is a multi-part MIME message sent by reportbug.

MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Content-Type: text/plain; charset="KOI8-R"
Content-Disposition: inline

Package: libxerces23
Version: 2.3.0-2
Severity: normal


After I upgraded xerces23 packages from 2.3.0-2 to 2.3.0-4 (as part of a
larger upgrade), I've got report from my users that their code that uses
xerces does not work any longer. After some inversigation, I found the
following changelog entry:

  * Switch from GNU transcoder to native transcoder

This looks to break KOI8-R support.

Minimal example code is attached, provided by one of uor users, is

nikita@zigzag:~/adm/xerces> cat test2.xml
<?xml version="1.0" encoding="KOI8-R"?>
        <Text>Некий текст</Text>
nikita@zigzag:~/adm/xerces> ./t test2.xml

File parsed successfully
nikita@zigzag:~/adm/xerces> LD_LIBRARY_PATH=2.3.0-4/usr/lib ./t test2.xml
Fatal: An exception occurred! Type:TranscodingException, Message:Could not create a converter for encoding: KOI8-R  line: 1

-- System Information:
Debian Release: 3.0
  APT prefers testing
  APT policy: (620, 'testing'), (600, 'unstable'), (550, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.7-1-k7-smp
Locale: LANG=ru_RU.KOI8-R, LC_CTYPE=ru_RU.KOI8-R

Versions of packages libxerces23 depends on:
ii  libc6                  2.3.2.ds1-16      GNU C Library: Shared libraries an
ii  libgcc1                1:3.4.1-4sarge1   GCC support library
ii  libstdc++5             1:3.3.4-6sarge1.2 The GNU Standard C++ Library v3

-- no debconf information

Content-Type: text/x-c; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment; filename="t.cpp"

#include <xercesc/parsers/XercesDOMParser.hpp>
#include <xercesc/dom/DOM.hpp>
#include <xercesc/sax/HandlerBase.hpp>
#include <xercesc/util/XMLString.hpp>
#include <xercesc/util/PlatformUtils.hpp>

#include <iostream>

using std::cout;

class my_err_handler: public HandlerBase	//override base error handling class
	void fatalError(const SAXParseException &exception) {
		char* message = XMLString::transcode(exception.getMessage());
		cout << "Fatal: " << message << "  line: " << exception.getLineNumber() << "\n";

int main(int argc,char **argv)
	if (argc < 2) {
		cout << "\nYou must specify file to parse\n";
		return -1;
	XercesDOMParser *parser=new XercesDOMParser();
	ErrorHandler* errHandler = (ErrorHandler *) new my_err_handler();
	parser->parse(argv[1]);	 //parse the file
	cout << "\nFile parsed successfully\n";
	return 0;

MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Content-Type: text/xml; charset="KOI8-R"
Content-Disposition: attachment; filename="test2.xml"

<?xml version="1.0" encoding="KOI8-R"?>
	<Text>Некий текст</Text>

Received: (at 273600-done) by bugs.debian.org; 28 Mar 2005 18:17:25 +0000
>From ejb@apexcovantage.com Mon Mar 28 10:17:25 2005
Return-path: <ejb@apexcovantage.com>
Received: from through.apexcovantage.com [] 
	by spohr.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1DFynl-0005ZV-00; Mon, 28 Mar 2005 10:17:25 -0800
Received: from [] (helo=apex.acv.apexcovantage.com)
	by through.apexcovantage.com with esmtp (Exim 4.34)
	id 1DFynh-0002uW-6X
	for 273600-done@bugs.debian.org; Mon, 28 Mar 2005 13:17:21 -0500
Received: from soup (soup.acv.apexcovantage.com [])
	by apex.acv.apexcovantage.com (8.12.8/8.12.8/ads-apexcovantage-hub-1.4) with ESMTP id j2SIHGQQ028724;
	Mon, 28 Mar 2005 13:17:16 -0500
Received: from ejb by soup with local (Exim 4.50)
	id 1DFync-0000rF-Go; Mon, 28 Mar 2005 13:17:16 -0500
From: Jay Berkenbilt <qjb@debian.org>
To: 273600-done@bugs.debian.org
Subject: xerces23 is being removed
Message-ID: <20050328131714.3270055946.qww314159@soup.acv.apexcovantage.com>
Date: Mon, 28 Mar 2005 13:17:16 -0500
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
X-Apex-MailScanner: Found to be clean
X-MailScanner-From: ejb@apexcovantage.com
Delivered-To: 273600-done@bugs.debian.org
X-Spam-Checker-Version: SpamAssassin 2.60-bugs.debian.org_2005_01_02 
	(1.212-2003-09-23-exp) on spohr.debian.org
X-Spam-Status: No, hits=-3.0 required=4.0 tests=BAYES_00 autolearn=no 

xerces23 is being removed, so I'm closing this bug.  The bug mentions
that KOI8-R support was broken after switching from the gnu transcoder
to the native transcoder.  Post-sarge, we will most likely provide
xerces only with the icu transcoder or, alternatively, continue to
provide native and ICU versions as we do now.  The icu version should
be used when non-builtin encodings are required.

Jay Berkenbilt <qjb@debian.org>