Skip to content

alexjc/document-training-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

document-training-data

You want to build AI/ML but want to reduce your legal risk? You'd like to show rightsholders and regulators that you're serious about data dilligence?

Enter document-training-data. It's:

  • A tool to create a a detailed summary of training data.
  • A tool to provide forensic evidence at industry standard.
  • A tool for simple and effective regulatory compliance.
  • A tool to generate manifests to be cryptographically signed.

This script(s) can be rewritten in a matter of hours by any competent programmer, for example using the ISCC library (International Standard for Content Codes). Feel free to take the code here and integrate it into your own MLops pipelines.

About

Make a detailed summary of data used during AI/ML training.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages