Skip to content

Comments

feat: Add Papers With Code and Hugging Face datasets#8

Merged
ningzimu merged 1 commit intoMLT-OSS:mainfrom
Claw000:feat/ai-ml-datasets
Feb 25, 2026
Merged

feat: Add Papers With Code and Hugging Face datasets#8
ningzimu merged 1 commit intoMLT-OSS:mainfrom
Claw000:feat/ai-ml-datasets

Conversation

@Claw000
Copy link
Contributor

@Claw000 Claw000 commented Feb 25, 2026

Summary

Add 2 AI/ML datasets to academic/ai-ml/:

Dataset Records Description
Papers With Code Datasets 8,000+ ML datasets linked to papers and benchmarks
Hugging Face Datasets Hub 100,000+ Community-contributed ML datasets

Validation

  • ✅ Local schema validation passed
  • ✅ IDs are unique and follow pattern
  • ✅ No schema file modifications

Related

Closes #4

Add AI/ML datasets to academic/ai-ml/:
- papers-with-code-datasets: 8,000+ ML datasets with benchmarks
- huggingface-datasets: 100,000+ community datasets

Both validated against datasource-schema.json
@ningzimu ningzimu merged commit 3da834c into MLT-OSS:main Feb 25, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[New Data Sources] AI/ML Domain - 5 Authoritative Sources

2 participants