menu scancode-results-analyzer documentation

Welcome to scancode-results-analyzer Documentation!

What is Scancode-Results-Analyzer

ScanCode detects licenses, copyrights, package manifests and direct dependencies and more both in source code and binary files.

ScanCode license detection is using multiple techniques to accurately detect licenses based on automatons, inverted indexes and multiple sequence alignments. The detection is not always accurate enough. The goal of this project is to improve the accuracy of license detection leveraging the ClearlyDefined and other datasets, where ScanCode is used to massively scan millions of packages. It would also be available as a ScanCode post-scan plugin to use it in scans directly, or in scancode.io pipelines.

This project aims to:

  • Write tools and create models to massively analyze the accuracy of license detection

  • Detect areas where the accuracy could be improved.

  • Add this as a scancode post-scan plugin

  • Add to pipelines in scancode.io

  • Write reusable tools and models to assist in the semi-automated reviews of scan results.

  • It will also create new license detection rules semi-automatically to fix the detected anomalies

Getting Started with scancode-results-analyzer

Indices and tables