[Python-modules-team] Bug#654680: python-html2text: 3.200.1-1 breaks parsing of feeds within rss2email

Stefano Rivera stefanor at debian.org
Fri Jan 6 13:02:33 UTC 2012


tag 654680 + patch
thanks

Hi Joerg (2012.01.05_10:39:33_+0200)
> after python-html2text was upgraded to 3.200.1-1 the feeds read by
> rss2email can't be parsed anymore

Looks like the html2text upstream didn't consider unescape() to be part
of the public API, and moved it into a class-level function.
https://github.com/aaronsw/html2text/commit/1a25828d556d30cc689c1bc2c11f52838c57b7ac

I see it's also been marked with a "# @@nobody calls this function?"
comment. Aaron: Are you intending to remove it?

Joerg / Lindsey: The attached patch for rss2email should do the trick
for supporting 3.200.

The alternatives are:
* re-adding a top level unescape() function to
  html2text, but that would have to create an HTML2Text object on each
  invocation...
* Moving unscape() (and the functions it calls) back out of the class,
  but then HTML2Text.unicode_snob won't be very useful.

SR

-- 
Stefano Rivera
  http://tumbleweed.org.za/
  H: +27 21 465 6908 C: +27 72 419 8559  UCT: x3127
-------------- next part --------------
A non-text attachment was scrubbed...
Name: html2text-3.200.patch
Type: text/x-diff
Size: 1099 bytes
Desc: not available
URL: <http://lists.alioth.debian.org/pipermail/python-modules-team/attachments/20120106/e8755251/attachment.patch>


More information about the Python-modules-team mailing list