Below are some useful references that will help you not only for this course but also for data analysis in general.
##Books
Book | Author | Availability |
---|---|---|
Python for Data Analysis | Wes McKinney | Penn Libraries (ebook) |
R Graphics Cookbook | Winston Chang | |
The R Book, 2nd Edition | Michael J. Crawley | Penn Libraries (ebook) |
Introductions to Mathematics for Life Scientists | Edward Batschelet | |
R Cookbook | Paul Teetor |
##Websites
Site | Category | Description |
---|---|---|
cookbook-r | R | Graphics, programming, analysis |
Andrew Gelman | Statistics | How to do statistics right and what often goes wrong |
Interesting IPython notebooks | Jupyter/IPython | IPython/IJulia notebooks on a variety of topics and tutorials |
List of stats blogs | Statistics | List of blogs (some links dead) that cover topics in statistics |
Google Python style guide | Python | Google's Python style guide - relative to Google internal, but recommended! |
Google R style guide | R | Google's R style guide - follow it! |
Princeton WW509 R examples | R | Germán Rodriguez's site with R instructionals and tutorials |
RStudio cheatsheets | R | RStudio cheatsheets for visualization, data munging, markdown syntax and others |
UCLA statistics | R | UCLA statistics site for learning to use R |
Beautiful charts in ggplot2 |
R | Tutorial for creating ggplot2 figures in a variety of styles |
FlowingData: beyond base charts in R | R | Creating aesthetically pleasing plots in base R |
R packages for data wrangling and visualization | R | R packages for retrieving and manipulating data |
Austin Clemens: 538-style plots in ggplot2 |
R | Create 538-style graphs in ggplot2 |
Data Origami: 538-style plots in Python | Python | Create 538-style graphs in Matplotlib |
Matplotlib style sheets |
Python | How to create custom styles in Matplotlib |
Google Python class | Python | Google's Python class with useful exercises |
Python cheat sheet by Dave Child | Python | Python cheat sheet with common commands and methods |
R reference card | R | Refernce card for R commands and packages |
R reference card for data mining | R | R reference card for data mining from RDataMining.com |
Sweave and knitr intro and examples |
R | Sweave and knitr introduction and template examples from Vanderbilit biostats |
GitHub tutorial by Karl Broman | Git/GitHub | Minimal tutorial on using Git/GitHub |
Text formatting with LaTex | LaTeX | Tutorial on LaTeX formatting commands |
Quick-R | R | Intro to R functions and capabilities with examples |
##Package documentation
##Datasets
Dataset | Site | Description |
---|---|---|
2010 US Census | US Census data | Population demographics and API for census data |
Federal Election Commission | FEC presidential | Data on donations and expenditures for Presidential races |
Federal Election Commission | FEC all | Data on donations and expenditures for all federal races |
MovieLens | Movie ratings | Database of movie ratings |
Social Security Administration | SSA baby names | SSA Baby Names database |
NY MTA | NY MTA measures | Performance and other information for NY MTA systems |
USDA nutrient database | Food nutrients | USDA nutrient database in a logically organized JSON format |
General data sources | Data Incubator suggested sources, part 1 | The Data Incubator's list of data for projects, Part 1 |
General data sources | Data Incubator suggested sources, part 2 | The Data Incubator's list of data for projects, Part 2 |
General data sources | List of data sources from MRAN | MRAN list of data sources of different types and formats |