Bug#1033406: licensecheck: scan-copyrights fails to create copyright file for texlive-extra

Vignesh Raman vignesh.raman at collabora.com
Fri Mar 24 15:52:33 GMT 2023


Hi Jonas,

On 24/03/23 8:09 pm, Jonas Smedegaard wrote:
> Please try isolate the exact licensecheck command that scan-copyrights
> executes (e.g. by hacking that script to print to stderr before
> executing).
>
> When that exact command - including options and arguments - is known, we
> can move on to examine why the failure isn't deterministic.

This is the exact command.

licensecheck --encoding utf8 --copyright --machine --lines=300 
--shortname-scheme=debian,spdx --recursive 
'--ignore=^(\./)?(?s:(?>[^/]*?/po/)[^/]*)\Z|^(\./)?(?s:.*\~)\Z|^(\./)?(?s:CVS)\Z|^(\./)?(?s:DEADJOE)\Z|^(\./)?(?s:RCS)\Z|^(\./)?(?s:\..*\.swp)\Z|^(\./)?(?s:\.\#.*)\Z|^(\./)?(?s:\.arch\-ids)\Z|^(\./)?(?s:\.arch\-inventory)\Z|^(\./)?(?s:\.bzr)\Z|^(\./)?(?s:\.bzr\.backup)\Z|^(\./)?(?s:\.bzrignore)\Z|^(\./)?(?s:\.bzrtags)\Z|^(\./)?(?s:\.cvsignore)\Z|^(\./)?(?s:\.deps)\Z|^(\./)?(?s:\.git)\Z|^(\./)?(?s:\.gitignore)\Z|^(\./)?(?s:\.hg)\Z|^(\./)?(?s:\.hgtags)\Z|^(\./)?(?s:\.pc)\Z|^(\./)?(?s:\.shelf)\Z|^(\./)?(?s:\.svn)\Z|^(\./)?(?s:\{arch\})\Z|^(\./)?(?s:_MTN)\Z|^(\./)?(?s:_darcs)\Z|^(\./)?(?s:config\.guess)\Z|^(\./)?(?s:config\.sub)\Z|^(\./)?(?s:debian/(?>[^/]*?/)[^/]*\.asc)\Z|^(\./)?(?s:debian/(?>[^/]*?/)[^/]*\.gpg)\Z|^(\./)?(?s:debian/(?>[^/]*?/)[^/]*\.pgp)\Z|^(\./)?(?s:debian/apertis/[^/]*)\Z|~$|(^|/)\.\.swp$|(^|/),,.*(?:$|/.*$)|(^|/)(DEADJOE|\.cvsignore|\.arch-inventory|\.bzrignore|\.gitignore)$|(^|/)(CVS|RCS|\.pc|\.deps|\{arch\}|\.arch-ids|\.svn|\.hg|_darcs|\.git|\.shelf|_MTN|\.bzr(?:\.backup|tags)?)(?:$|/.*$)|(?:^|/)debian/((fix.scanned.)?copyright|changelog|NEWS|compat|.*yml|docs|source|patches/series)|(png|jpg|pdf|ico|bmp|jpe?g)$' 
.

> Your mentioning above that licensecheck failing is "not consistently
> reproducible" is another bug that is more serious, however: Licensecheck
> should - with same version also of its libraries - behave the same at
> each execution.
>
> So it would be helpful to me, independent of the issue of not parsing
> binary data - if you can help isolate the command which is unreliable.
>
When I run the above command the results for the problematic files are,

./texmf-dist/tex/latex/exp-testopt/exp-testopt.sty    UNKNOWN 2009 Paul 
Ebermann. | / 2008 Paul Ebermann
./texmf-dist/fonts/type1/public/baskervillef/BaskervilleF-Bold.pfb 
UNKNOWN    2016 / 2012 Rodrigo Fuenzalida (www.rfuenzalida.com|hello 
:rfuenzalida.com) / 2012 Pablo Impallari 
(www.impallari.com|impallari at gmail.com),
./texmf-dist/fonts/type1/public/baskervillef/BaskervilleF-BoldItalic.pfb 
UNKNOWN    2016 / 2012 Rodrigo Fuenzalida (www.rfuenzalida.com|hello 
:rfuenzalida.com) / 2012 Pablo Impallari 
(www.impallari.com|impallari at gmail.com),

Going with the above information, scan-copyrights should not crash. If I 
remove all other files and keep only the above in same directory 
structure, it does not crash.

Only when we run scan-copyrights with all the source files, it crashes.

> It is unclear what you are referring to as "this issue": What you
> summarise in the subject of this bugreport strictly speaking is an issue
> with libconfig-model-dpkg-perl, and above I have mentioned 2 potential
> underlying issues that might be your main concern here (as I assume you
> deliberately chose to file this bugreport against licensecheck).

Yes, I created the bug report against licensecheck because I suspected 
that parsing the binary files was causing the issue.

> If you are not really reporting a new issue here, but instead are asking
> for best practice of using licensecheck with sources that include binary
> data, then my most up-to-date notes on that is here:
> https://wiki.debian.org/CopyrightReviewTools#licensecheck

Thanks for the link.

If there are no issues with licensecheck, will report the issue in 
libconfig-model-dpkg-perl with all this information.

Regards,
Vignesh



More information about the pkg-perl-maintainers mailing list