Skip to content

Project Ideas Improve package license detection

Philippe Ombredanne edited this page Feb 21, 2022 · 1 revision

Improve ScnnCode package license detection

The goal of this project is to improve package license detection across the board. ScanCode toolkit package license detection works primarily by:

  1. extracting license statements found in package manifests
  2. applying license detection and normalization on these extracted statements

In some case things are super simple: for instance the npm package.json license tag "license": "MIT" is pretty straight forward and obvious. The difficulty is in the details and getting these right at a large scale on most or all the packages.

This project idea can take multiple shapes and it should focus on one or more package types. The end results should be a significant improvement to the license detection quality for the selected package types.

These are details on sub-projects focused each on a different package type. Some may be completed or quite advanced. The long term goal is to work closely with each package ecosystem to contribute the improved license data upstream.

Clone this wiki locally