[Python-modules-commits] [scancode-toolkit] annotated tag v2.0.0.rc2 created (now 12b90e8)

Maximiliano Curia maxy at moszumanska.debian.org
Thu Aug 10 18:57:11 UTC 2017


This is an automated email from the git hooks/post-receive script.

maxy pushed a change to annotated tag v2.0.0.rc2
in repository scancode-toolkit.

        at  12b90e8   (tag)
   tagging  eeb6f2734b8ec90956b675a13b23d90be9a8e312 (commit)
  replaces  v2.0.0.rc1
 tagged by  Philippe Ombredanne
        on  Mon Jan 16 15:38:36 2017 +0100

- Log -----------------------------------------------------------------
Release candidate v2.0.0.rc2

Chin Yeung Li (1):
      #408 Change license category from Attribution to Permissive

Jillian (1):
      #41: adding base script to get license keys using DejaCode API

Jono Yang (37):
      Rename unicode_2.RULE and unicode_2.yml to unicode_mappings.RULE and unicode_mappings.yml, respectively. Change licenses in unicode_mappings.yml from 'unicode' to 'unicode-mappings'.
      Rename unicode_2.RULE and unicode_2.yml to unicode_mappings.RULE and unicode_mappings.yml, respectively. Change licenses in unicode_mappings.yml from 'unicode' to 'unicode-mappings'.
      Revert unicode_mappings.RULE and unicode_mappings.yml back to unicode_2.RULE and unicode_2.yml, respectively. Add 'unicode-mappings' under license in unicode_2.yml
      Merge branch 'develop' into 143-incorrect-license-detection
      #92: Add 'not-a-license' rule to remove GPL false positive
      #143 Modify 'unicode_2.yml' to detect 'unicode-mappings' rather than 'unicode'
      Revert "#143 Modify 'unicode_2.yml' to detect 'unicode-mappings' rather than 'unicode'"
      #143 Modify 'unicode_2.yml' to detect 'unicode-mappings' rather than 'unicode'. Remove Unicode copyright line from 'unicode_2.RULE'.
      #143 Add test for unicode-mappings license
      #92 Add flex readme as a test for flex license
      Remove category from intbitset.ABOUT
      Ensure that binary strings have at least two unique characters.
      #339 Create new function  to correctly parse newest ScanCode JSON output
      #339 Create leading slashes for paths and trailing slashes for directories
      #339 Create new function  to correctly parse newest ScanCode JSON output
      #339 Create leading slashes for paths and trailing slashes for directories
      #339 Refactor  based on original
      #339 Refactor `json2csv.py` functionality by using the implementation from the original `json2csv.py` script
      #339 Add support for license option.
      #339 Flatten the 'matched_rule' field under 'licenses' and append to 'licenses' when processing the 'licenses' field
      #339 Add support for copyright option. Rewrite license processing functionality to deal with matched rules seperatly.
      #339 Add placeholder for packages and scan error fields
      #339 Refactor json2csv.py.
      #339 Refactor copyright function
      #339 Consolidate information from columns whose values are lists into a single string for those column entries
      #339 Add support for emails, urls, and scan errors output
      #339 Replace old tests for json2csv.py
      #339 Add new test files and expectations for modified tests. Create new test for verifying csv output from ScanCode result that contain all scan options (full).
      #386 Add new rule and test for OpenSSL license from OpenSSL-1.1.0c/crypto/aes/aes_cbc.c
      #339 Remove OrderedSet class and fold class functionality into collect_header_keys()
      #409 Added LICENSE file and test for bpmn-io license
      #411 Modify list comprehension for package license names to return an empty string for license name values which are null
      #411 Remove the default return value of \'\' when getting license name from package licenses
      #416 Create new production rule to capture copyrights in the order of copyright, year range, dash, and noun.
      #406 Create new rule to capture 'Copyright (c) 2012-2016, Project contributors'
      \#405 Add new rule to capture \`Copyright 2007-2010 the original author or authors.\`
      #380 Create test to ensure 'Byten ( c ) Exactly' is not detected as a copyright statement

Li (13):
      #395 Add the parser for PHP composer, and add test cases
      #395, refine the code according to latest pull review comments, change the language, version mapping
      #395 comment message refinement
      #395 keep the original license format
      #395 update mimetype and add comments
      #395 Make generic code and avoid dupication
      #395 Add the vendor support
      #395 update the mimetype of composer.json to pure json type
      #395 update vcs type and make sure it covers all possible types
      #253 Pypi: Fix the test to fit for the setup.py name
      #253: Pypi: add author object mapper and update the test to make sure the build passes
      #429: urls: Add test cases and update the code and commit  to make more clear
      #429 Utils: Add 'gist github' support and update test case

Philippe Ombredanne (372):
      Move CI build badges to the top of the README for #330
      New rules and test for #331
      New test for #331
      #337 Update README installation doc for Linux
      #337 Update README installation doc for Debian 8
      #330 Simple IRC build notification script for Appveyor
      Merge remote-tracking branch 'upstream/develop' into develop
      Fix indentation
      Add missing license header and simplify script
      Use official FSF-formatted GPL 2.0 license text
      #330 Update appveyor build badge URLs
      #327 New and improved license detection rules
      #327 Externalize data driven test utils in their own module
      #327 Improve handling of literal line endings in texts
      Add todo and pointer to improve wide charactes extraction in binaries
      #327 refine handling of tags in markup such as HTML
      Add ability to force clear the license match cache
      Add new frequent tokens such as "license"
      #327 overhaul the license match merging and filtering logic
      Improve license match tracing. Update logic for qyery run matching
      #327 Update match type for exact license match.
      #327 Simplify texts tokenization, remove unused code
      Work in progress twoards a separate module for license texts extraction
      Update match type
      #327 do not merge sequence matches
      #327 break query runs at "40" empty lines
      Cosmetics and comments updates
      Document why we have an unused import
      Remove unused Span.merge method
      Update and improve license detection tests
      Update license Rule model
      Remove unused ngram references, add new thresholds for small matches
      #327 New and improved licenses and detection rules
      #327 Remove two extra bsd-new from sleepycat SPDX
      #327 New and improved licenses and detection rules
      Comment on cache usage
      Make MAX_DIST used for merging license matches much bigger
      Add new frequent tokens for license detection
      Use intbitsets for LicenseMatch spans
      #327 Refin proprietary license detection rules
      #327 New and improved licenses and detection rules
      Improve and cleanup license detection query processing
      Move print_matched_texts to shared license test utils
      Add deeper tracing to exact matching
      Remove references to unused exact matching in sets.
      Fix how license match lines are obtained
      New license detection perf test
      Micro optimization to test set containment last.
      Use same call to load licenses, rules and query files for symmetry.
      Refine sequence matching arguments
      #327 Rework match merging and filtering
      Improved license matching procedure
      New and improved license detection tests
      Do not regen test fixtures by default.
      Fixed incorrect and duplicated GPL license texts and rules
      The GPL 2.0.0
      The GPL 2.0.1
      The GPL 2.0.2
      The GPL 2.0.3
      The GPL 2.0.4
      Update the GPL license with notes about which text version it is.
      Cleanup and refine GPL detection rules.
      Fixed maven test expectation that fails with a weird heisenbug
      Fix scan integration test to account for new license detection context
      Add some verbosity to release smoke tests
      New rules for a conflicted GPL/LGPL licensing from ozone-db
      Cleanup and refine GPL detection rules.
      Test for #336
      Test for #336
      Renaming Curl to cURL license
      Removing copyright statement from cURL license and rules
      New license detection rule for #336
      New license bds-new-derivative and rules and tests for #299
      New and improved BSD detection rules
      New gpl+classpath detection rule
      Merge pull request #334 from nexB/327-license-detection-accuracy-improvements
      #341 make the index lock timeout more explicit with a constant
      Bumping develop to 2.0.0rc2 to avoid confusion with other RC's
      Merge pull request #343 from nexB/143-incorrect-license-detection
      Add rationale for flex non-gpl negative rule
      Merge pull request #344 from nexB/92-incorrect-license-gpl-detected
      Merge pull request #345 from sschuberth/ignore-tcl
      Merge pull request #348 from nexB/143-incorrect-license-detection
      #92 Convert negative rule to regular for flex detection
      #92 Simplify netbsd license
      #92 Simplify flex-2.5 text
      #92 Correct failing integration test
      Make incoherent test data files failures explanatory
      #88 Fixed with a test
      Merge pull request #349 from nexB/92-incorrect-license-gpl-detected
      Refine and improve duplicated Apache 2.0 license detection rules
      Fix incorrect apache license text
      Support matching rule fragments with automaton
      New and improved licenses, rules and tests
      Improve test for query tokens
      New frequent tokens
      Refine ngrams selection
      Add support for ngrams detection with an automaton
      Refine license match merging
      Add note to test for iPhone legal notice
      Update failing test because from new tokens context
      Correct some requirements version damaged by bumpversion
      Merge branch 'license-fragments' into develop
      Merge pull request #350 from sschuberth/result-file
      #351 Support collecting BV company suffix in copyrights
      Merge pull request #352 from nexB/351-copyright-bv
      Add strings module from Fireeye's flare-floss
      #99 Remove duplicated rules and license for stlport/hans boehm
      Format license text
      #99 new classpath rules
      #99 new unicode rule
      #99 test using only a subset of the strings found in test elf
      #99 permission should is a frequent token
      Formatting
      #99 license queries: make the line_threshold to 1000 for binaries
      Ensure full, verbatim text of Apache license is used
      #99 refine fireeye floss strin extraction for our use case
      #99 re-implementation of strings extraction from binaries
      Remove "expected_contains" feature from license detection tests
      Add new license match merge condition when ispans are the same
      Improved rules for proprietary license detection
      Improved X11 and MIT rules, license and texts
      Improved public domain detection rules
      Removedd incorrect rule: this is a duple of the afmparse text
      Improved artitics license detection
      Simplify adobe-scl license
      Improve license detection of some tests expecting for fail for now
      Update test expectation
      Correct failure in license detection test
      Improve copyright detection accuracy
      Merge pull request #353 from nexB/99-license-in-binary
      #267 Add simplejson for its iterable streaming encoding capabilities
      #267 Ensure that cli test always raise exceptions
      #267 Initial implementation of scan streaming
      #267 Catch exceptions in CLI tests
      #267 Bump diskcache to latest version 2.0.2
      #267 Organize imports
      #267 Add test highlighting a Python pickle bug
      #267 Add test highlighting a diskcache bug
      #355 add test for false positive license detection
      #267 Rename test module to avoid name conflicts
      #267 workaround Python pickle bug
      #267 Do not raise exceptions in CLI calls
      Merge pull request #354 from nexB/cache-and-stream-scan-results
      #356 Do not create dupe entries for NPM metafile_locations
      Make public domain rules solid
      New license detection rule and test for #360
      Improve license detection rules and tests
      Use indented JSON for improved test readability
      Add SCANCODE_LICENSE_DEBUG env var to enable license detection tracing
      Add derive(s|d) and public as license frequent tokens
      Ensure that licenses are tokenized on +
      Ensure that license test JSON is idented for readability
      Add derive(s|d) and public as license frequent tokens
      Improve licenses and rules
      Remove mojibake from license texts and rules
      New and improved license rules and tests
      Ensure that licenses are tokenized on +
      New rules and test for #361
      Remove unused code and docs
      Merge pull request #363 from nexB/360-361-improved-license-detection
      Fix failing test following rules updates in #360 and #361
      Fialing test for #362
      Always put infos in cache when caching scan
      #364 Store scan errors in a per-file scan_errors key.
      #364 Ensure html-app works with missing scans and errors
      Merge pull request #365 from nexB/364-file-level-errors
      #362 Make html-app copyright table creation more error proof
      Do not use diskache's Fanout cache.
      Improved Apache 2.0 license detection rule.
      #267 Do not run scancode CLI tests on multiple processes
      Quote commands launched in CI loops
      Correct Travis matrix environment variable that were wrongly defined
      Add doc to get_scans_cache for #267
      Use slots for content type registry to use less memory
      Initial support for an interruptible function call for #267
      Improve getting results from cache and other refinements for #267
      Simplify clik progress reporting for #267
      Improve scanning for #267
      #267 Update extractcode command with latest CLI utils changes
      #267 Update get_relative_path tests to match new semantics
      #267 Improve tests for scancode CLI
      #267 Fix logic in timeout computation
      #267 Ensure that any scanning error is reported in the scan stats
      #267 experimental support for max memory quota
      #267 Ensure psutil is added as a dependency
      #267 Improve documentation
      Fix typo in comment
      #267 Add SCANCODE_EXPERIMENTAL_MAX_MEMORY feature switch
      #267 Make interruptible functions work
      #267 Make memory quota always enabled in scans
      Refine code and comment typos
      #267 Refine tests for scan timeouts and max_memory
      #267 Make timeout optional if not provided, with default
      #267 Correct typo in docstring
      #267 Use environment variables to test timeout and max_memory
      #267 Lower expectation for timeout and memory tests
      #267 Lower expectation for timeout and memory tests
      #267 Ensure that process and thread pools are closed properly.
      #267 improve expectations for interrupt tests
      #267 Do not run a memory guard forever, instead combine with a timeout
      #267 Improve timeout test with some randomness in test files
      Merge pull request #369 from nexB/long-scan-timeout
      #267 Add command options for --timeout and --max-memory
      New license detection rules.
      New license detection rule: do not detect fake GPL.
      Merge pull request #371 from nexB/267-cli-options-for-timeout-and-max_memory
      New license detection rule
      #267 Improve interruptible function error message
      #267 remove line_by_pos references from cache
      #267 Improve incorrect license test expectation
      Remove useless solid and min_score references in license tests for #267
      #267 Remove unused set-based Span implementation
      #267 promote intbitset-based Span implementation as the one
      #267 Remove memory leak in license detection match merging
      #267 Improve scan results caching speed
      Merge pull request #376 from nexB/267-memory-leak
      Cosmetic formatting and import reorg
      New and improved license detection rules
      Remove unused import
      New and improved license detection rules
      New and improved license detection rules and tests
      Cosmetics
      #342 Implement new scoring for license match
      Cleanup imports
      Cosmetics
      #342 Refine doc and clean dead code. Add digits-only token tracking
      #342 Improve computing of license rule relevance
      Improve filtering of license matches of single spurious token
      #342 Add tests for License match score and rule relevance computation
      Merge pull request #321 from nexB/320-links-open-in-new-tab
      #385 Temptative correction for Python 2.7.3 issue
      Merge pull request #383 from nexB/342-license-match-confidence
      #388 Ensure that --verbose scan goes to stdout and is not hidden
      Organize imports. Ensure cache is cleared on exit.
      #388 Minor cleanups related to verbose output in extracode
      #385 Add new workaround for bug https://bugs.python.org/issue14881
      #385 Improve workaround for bug https://bugs.python.org/issue14881
      #390 Ensure that unicode or/and byte paths are handled correctly
      Merge pull request #391 from nexB/390-unicode-paths
      New and improved MIT license variations
      #390 Rename test data path to correct path.
      #390 Bundle test data in an archive. Use OS-specific tests.
      Run first scancode CLI tests on Appveyor and faster on 2 processes
      #390 Use extractcode for proper extraction of test data on windows
      #390 Test that extractcode cli extracts archives with unicode names
      Reformat code
      #390 Improve handling for archives with unicode names.
      #390 Skip failing test on windows for now.
      #387 Initial support to extract XAR archives and Apple pkg or mpkg
      Merge pull request #392 from nexB/387-apple-pkg-xar-archive-support
      #259 Use shorter paths for test files on Windows
      #253 Document code, add several FIXME and reformat
      Cosmetics to work also in Python 3.
      Ensure license index cache is not invalidated by pyc compilation
      #397 Fix incorrect mit or gpl detection with new license rule
      Merge pull request #402 from nexB/386-apache-2.0-incorrectly-detected
      Merge pull request #399 from nexB/339-upgrade-json2csv.py
      Only issue warnings and not errors for short license rules
      New frequent license words
      Add minimum coverage at License level whne a license is used as Rule
      New legacy PHP license
      Format license text
      New license test
      New and improved license detection rules
      #374 #396 Do not scan archives and compressed files
      Merge pull request #404 from nexB/374-396-do-not-scan-archives-and-compressed-files
      Format license text
      Add new and improved GPL and ISC license rules
      #390 and #132 Failing test for scancode html output with unicode
      #390 and #132 Correct handling of output file
      Merge pull request #407 from nexB/390-html-output-with-license-unicode-metadata
      Merge pull request #410 from nexB/409-ScanCode-bpmn-io
      Merge pull request #412 from nexB/411-json2csv-crash-null-package-license-name
      New and improved CDDL license detection rules.
      New and improved license detection rules.
      New and improved license detection rule for Apache
      New and improved license detection rule for NIST-PD
      New and improved license detection rules
      New and improved license detection rules, several with mojibake
      #388 and #417 Ensure that a --quiet run is really quiet
      Remove cache on disk during license detection.
      #388 and #417 Send progress bar outputs to stderr and improve CLI help.
      #388 and #417 Split output tests in two
      Merge pull request #418 from nexB/388-417-consistent-stderr-stdout-outputs
      #260 Add attrs as a dependency
      #260 New tokenizer and get_full_matched_text function to collect texts
      Fix format for ABOUT file.
      #260 New "--license-text" cli option to return matched license texts
      #422 new and improved license detection rules
      #422 new SBIA license found in Jansson cmake scripts
      #260 Fixed failing test following minor refactoring.
      Merge pull request #427 from nexB/416-dash-dotted-name-truncated
      Merge pull request #428 from nexB/406-truncated-copyright-statement
      Merge pull request #403 from nexB/395-php-composer
      Update test expectations for latest beautifulsoup4
      Merge pull request #430 from sschuberth/spdx-tv
      Merge pull request #423 from nexB/260-license-text-capture
      Merge pull request #421 from nexB/253-python-packages
      Merge pull request #432 from nexB/405-truncation-of-copyright
      Merge pull request #433 from nexB/380-incorrect-copyright-detected
      Merge pull request #435 from sschuberth/clarify-default-options
      Add write up on packagedcode module in README.
      Cosmetic formatting and documentation.
      Add upcoming Python 3.6 to detected versions.
      Add private function to return an empty mapping of file info.
      #413 file_info may be empty. get to avoid KeyError
      #413 Always catch low level exceptions with traceback
      #413 Use native paths and not POSIX paths for absolute paths.
      #413 Use native paths and not POSIX paths for file names.
      #413 test using nodemon weird file names
      #413 new tests to extract zip and tar with weird file names
      #413 Work in progress to detect POSIX paths and filename accordingly
      Cosmetics
      #413 Improve handling of POSIX path detection
      #413 update toascii to optionally transliterate unicode to ascii
      #413 Improve paths resolution and safe paths creation
      #413 Move portable_new_name to commoncode.paths.portable_name
      #413 Cosmetics
      #413 Use improved "paths.safe_path" transform when extracting.
      #413 New library for Unicode transliteration
      #413 new and improved tests cases for extracting weird file names
      #413 Fix failing test using outdated helper syntax
      #413 Ensure extractcode can extract from archives with weird names
      #413 Fix failing test with os-specific expectations.
      #413 Run scancode CLI tests first on Travis
      #413 Ensure that weird names tests have OS-specific expectations
      #413 Split Ci tests in a matrix for each main module
      #413 Correct appveyor.yml syntax: matrix must be a mapping.
      #413 Do not test Mac as a special case for unicode names in tar.
      #413 Cosmetics
      #413 Split weird names extraction tests per OS, extractor and warning
      #413 Use shorter paths for extract tests to pass on Windows
      #413 do not use an archive for test scans of files with unicode paths
      #413 force test path to be unicode for Windows to handle unicode OK.
      #413 Skip tests with illegal file names on Windows.
      #413 more traceable really unit tests
      #414 Abstract dates from scan comparisons in tests
      #413 Cosmetics
      #413 Add TODO comment as a reminder to handle exception properly
      #413 Mark as excepted failures certain OS/extractor test combos.
      #413 Update test expectations for Mac
      #413 Improve check function to test archives with weird file names
      #413 Update pytar test expections for Mac
      #413 Improve expectations for Windowss and make it easy to debug
      #413 Upgrade py.test and dependencies to the latest version
      Ensure py.test does not ignore etc/scripts tests.
      #413 Make expectations empty for easier debug on Windows
      Improve setup.cfg to remove py.test warning
      Merge pull request #434 from nexB/425-json-output-change
      #413 debug py.test failure on Travis
      #413 Correct typo in path for licensedcode CI runs
      #413 Correct more typo in path for licensedcode CI runs
      #413 Correct expectation for Windows extraction tests
      #413 Improve parent directory tests expectations for Windows
      Merge pull request #414 from nexB/413-no-crash-on-weird-file-names
      Merge pull request #441 from sschuberth/spdx-tools-unicode
      Improve path splits (follow up from #413)
      #408 Temporarily update CI tests verbosity and order to debug failures.
      Signed-off-by: Philippe Ombredanne <pombredanne at nexb.com>
      #408 make smaller tests run verbose
      #401 Ensure copyrights are not truncated in HTML output
      Remove test file committed by mistake
      Improve json2csv
      Merge pull request #445 from nexB/401-truncated-copyright-in-html
      Merge pull request #440 from nexB/408-update-lic-category
      Merge remote-tracking branch 'upstream/429-testcase-utils-packagedcode' into develop
      Merge remote-tracking branch 'upstream/262-shallow-extraction' into develop
      Merge remote-tracking branch 'sschuberth/spdx-path-issues' into develop
      Ensure json2csv writes CSV from scan that contain unicode without error
      Merge pull request #451 from sschuberth/spdx-none
      Bump spdx_tools for #453

Sebastian Schuberth (18):
      If the platform is unsupported, actually disclose its name
      Ignore the generated top-level "tcl" directory
      Be case-insensitive on result format names
      Create any missing intermediate directories for the result file
      cli: Only create JSON acknowledgment if needed
      Update existing thirdparty packages
      Add the spdx-tools package and dependencies as thirdparty
      Add ABOUT and LICENSE files for the spdx-tools package and dependencies
      Add basic SPDX tag/value output
      Add basic SPDX RDF output
      cli: Clarify how default command line options are enabled
      Work around spdx-tools assuming a "str" file instead of "unicode"
      NOTICE: Add blank lines around the address
      NOTICE: Update the Copyright year
      Load notice text from NOTICE file
      cli: Fix some minor typos in a comment
      cli: Read a file's SHA1 from the cache instead of recalculating it
      cli: Write SPDXNone if no licenses are found

Steven Esser (9):
      #320: links now open in new tab, where applicable; Jinja2 updated to v 2.8 to use the 'target' parameter
      #320: removed unneeded AboutCode dependency
      #425: JSON outputs no longer pretty-printed by default; added format option 'json-pp' for pretty-printed JSON outputs.
      #425: additional JSON size reduction by removing unnecessary space characters
      #425: tests added and passing
      #425: added json-pp test entry in etc/release/release.sh
      #262: shallow extraction flag working.
      #262: added help text for --shallow option
      #262: initial test cases created and passing

pombredanne (1):
      #300 and #413 Handle extraction of file names not legal on windows

rakesh balusa (5):
      #253 Recognize Python packages using setup.py files
      #253 Recognize python wheels as packages
      #253 Recognize Python .egg files as packages
      #253 Ensure that opened files are closed
      #253 Improve functions documentation

-----------------------------------------------------------------------

No new revisions were added by this update.

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/python-modules/packages/scancode-toolkit.git



More information about the Python-modules-commits mailing list