A collection of the best open data sets and open-source tools for data science
Ruby Python PHP JavaScript CSS Shell
Latest commit c3ff139 Sep 20, 2014 @petewarden Merge pull request #50 from webfanatic/bug_0049
Bug 0049
Failed to load latest commit information.
dataconversion Use linear interpolation on survival percentages to smooth out freque… May 27, 2013
docs Updated vagrant startup command documentation, added cross-site permi… Jan 7, 2014
dstk.xcodeproj Debugging locale problems Sep 30, 2013
php Added more duplicates to blacklist, recognize district of columbia co… Feb 10, 2012
public Updated API version references to 0.50 May 19, 2013
python Fix bug where json with wrong encoding causes exception Sep 20, 2014
rubygem added Geocode method and changed URL encoding method Jul 2, 2013
sql Added missing field to baby name loading May 26, 2013
test_suite Switched over to Data Science Toolkit naming and URLs. Added command … Mar 20, 2011
tests Fix bug where json with wrong encoding causes exception Sep 20, 2014
views Update developerdocs.haml May 20, 2014
Gemfile Switched to Ubuntu 12.04 setup instructions, added instructions for c… Jan 27, 2013
Gemfile.lock Pull in regions along with postal codes, and handle postal codes in t… Jan 29, 2013
config.ru More work on the switch to Data Science Toolkit naming and URLs. Upda… Mar 20, 2011
coordinates2politics.rb Implemented foundation of geographic statistics querying May 8, 2013
coordinates2statistics.rb Work on programatically listing statistics in the documentation May 14, 2013
cruftstripper.rb Working on text2sentences, text2html and boilerpipe Mar 18, 2011
dstk_config.rb Updated version and AMI information Oct 1, 2013
dstk_server.rb Added documentation for coordinates2statistics May 14, 2013
emulategoogle.rb Added debug logging Aug 26, 2014
genderfromname.rb Calculate the probability of a person's ethnic group based on their n… May 7, 2013
geodict_cli.rb Initial commit Mar 6, 2011
geodict_daemon.rb More work on the switch to Data Science Toolkit naming and URLs. Upda… Mar 20, 2011
geodict_lib.rb Improve logging on database connections May 13, 2013
gpl.txt Initial commit Mar 7, 2011
largetestinput.txt Initial commit Mar 7, 2011
mit.txt Updated docs with license information Jul 2, 2013
populate_database.rb Implemented foundation of geographic statistics querying May 9, 2013
readme.asciidoc Updated docs with license information Jul 2, 2013
street2coordinates.rb Removed country name suffix from addresses we recognize are in the US Jul 29, 2013
testinput.txt Implementing Python interface Mar 16, 2011
text2people.rb Added male percentage to text2people output Jun 6, 2013
text2sentiment.rb Updated documentation to cover sentiment analysis, and added API endp… Apr 30, 2013
text2times.rb Fixed bugs with text2times Apr 3, 2011
twofishes.conf Switched users for twofishes daemon Oct 2, 2013
twofishesd.sh Increased the maximum heap space for twofishes' JVM to avoid an OOM e… Feb 22, 2014

readme.asciidoc