Skip to content

Open Datasets

steviep42 edited this page Oct 14, 2019 · 16 revisions

Here are some tried and true sources of public datasets that might be useful to teachers, students, and autodidacts interested in analyzing "real" data in a ML/Data Science, image detection, recommendation systems, context or just to illustrate some basic statistics concepts. There are many more but these seem to be solid "go to destinations" for such material.

Source Second Description
Awesome Public Datasets A topic-centric list of HQ open datasets
UCI Machine Learning Repository One of the oldest data resources in the Net
Kaggle Datasets and Open source Jupyter Notebooks
r/Reddit Reddit for open datasets and discussions
MNIST Database of handwritten digits
ImageNET Image Database
Deep LearningNet Data for benchmarking deep learning algorithms:
Zillow Real Estate Research Home prices and rents
Movie Lens Rating Data sets from the MovieLens web site
Five Thirty Eight Data and code behind the articles and graphics at FiveThirtyEight
US Census United States Census Data
FBI Crime Data FBI Crime Data

Clone this wiki locally