-
Notifications
You must be signed in to change notification settings - Fork 0
Open Datasets
steviep42 edited this page Oct 14, 2019
·
16 revisions
Here are some tried and true sources of public datasets that might be useful to teachers, students, and autodidacts interested in analyzing "real" data in a ML/Data Science, image detection, recommendation systems, context or just to illustrate some basic statistics concepts. There are many more but these seem to be solid "go to destinations" for such material.
| Source | Second Description |
|---|---|
| Awesome Public Datasets | A topic-centric list of HQ open datasets |
| UCI Machine Learning Repository | One of the oldest data resources in the Net |
| Kaggle | Datasets and Open source Jupyter Notebooks |
| r/Reddit | Reddit for open datasets and discussions |
| MNIST | Database of handwritten digits |
| ImageNET | Image Database |
| Deep LearningNet | Data for benchmarking deep learning algorithms: |
| Zillow Real Estate Research | Home prices and rents |
| Movie Lens | Rating Data sets from the MovieLens web site |
| Five Thirty Eight | Data and code behind the articles and graphics at FiveThirtyEight |
| US Census | United States Census Data |
| FBI Crime Data | FBI Crime Data |