Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Example datasets, used for testing Wukong -- and in many cases useful beyond that
HTML Ruby R Shell
Branch: master
Pull request Compare This branch is 2 commits behind infochimps-data:master.

Adding submodule for geonames full datasets

latest commit a6b95fa16d
Philip (flip) Kromer authored
Failed to load latest commit information.
demographic organizing data
eventlogs organizing word lists
geo Adding submodule for geonames full datasets
graph
sports/baseball organizing data
stats/numbers
text organizing data
wikipedia adding wikipedia pagecounts (1.8GB) as submodule
.gitignore
.gitmodules Adding submodule for geonames full datasets
CREDITS.md
README-contents.md Rename README-contents to README-contents.md
README-getting_large_datasets.md
README.md Update README.md

README.md

Infochimps Data: Useful datasets from all over

Additional datasets

To keep the git repo from bloating too much, some datasets are submoduled and not versioned directly. To include them, clone this repo with the --recursive flag, eg

git clone --recursive https://github.com/infochimps-data/infochimps-data

Note: this is many gigabytes of data.

Something went wrong with that request. Please try again.