[Pkg-haskell-maintainers] Bug#739448: pandoc: endless loop converting from html

Jonas Smedegaard dr at jones.dk
Wed Feb 19 20:41:33 UTC 2014


reopen 739448
thanks

Quoting Thomas Viehweger (2014-02-19 20:49:51)
>>>> pandoc -f html -t epub -o 131212.epub \
>>>> http://lwn.net/Articles/575838/bigpage?format=printable
>>>
>>> I can reproduce the error.
>>>
>>> Seems it is not tied to epub (trying with markdown does the same) 
>>> but instead with impurity of html (it works after cleaning up with 
>>> tidy).
>>
>> Investigating closer - using weblint (Debian package weblint-perl) 
>> reveals that the html page is broken: contains several non-closed tr 
>> and td tags.
>
> thanks for the quick response. tidy-ing the html file really helps.
>
> But.. the previous version of pandoc in testing (1.9.?) also behaves 
> as expected (no endless loop).
>
> In my opinion pandoc should not go into an endless loop - no matter 
> what I feed it with.
> It might produce garbage or abort. But an endless loop is not 
> acceptable (for me).

Good point.  Reopening the bugreport.

Btw: A closed bugreport can still be posted to (I guess that's why you 
contacted me discretely, and I took the liberty to reply in public).  
Only when _archived_ can you no longer post to it.


 - Jonas

-- 
 * Jonas Smedegaard - idealist & Internet-arkitekt
 * Tlf.: +45 40843136  Website: http://dr.jones.dk/

 [x] quote me freely  [ ] ask before reusing  [ ] keep private
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 966 bytes
Desc: signature
URL: <http://lists.alioth.debian.org/pipermail/pkg-haskell-maintainers/attachments/20140219/c1395e6e/attachment-0001.sig>


More information about the Pkg-haskell-maintainers mailing list