Skip to content

Latest commit

 

History

History
53 lines (51 loc) · 4.01 KB

DATASETS.md

File metadata and controls

53 lines (51 loc) · 4.01 KB

Websites with a huge amount of data to use in your projects

Highlight

  • Information is Beautiful - A really good website with beatiful graphs and data.
  • Colaboradados - Repository with a lot of good datasets from Brazil in Portuguese.
  • Awesome Public Datasets - The repository bellow contains a lot of datasets, you have to take a look.
  • Google Data Search - The site bellow is a Google's tool for searching for datasets.
  • Chatito - Helps you generate datasets for natural language understanding models using a simple DSL.
  • Datahub - Collections - high quality data and datasets organized by topic.
  • Common Voice - An open source, multi-language dataset of voices that anyone can use to train speech-enabled applications. Each entry in the dataset consists of a unique MP3 and corresponding text file.

Data from Brazil in Portuguese

Top resources

Other resources