Skip to content

Latest commit

 

History

History
75 lines (71 loc) · 6.93 KB

REFERENCES.md

File metadata and controls

75 lines (71 loc) · 6.93 KB

Useful references

Below are some useful references that will help you not only for this course but also for data analysis in general.

##Books

Book Author Availability
Python for Data Analysis Wes McKinney Penn Libraries (ebook)
R Graphics Cookbook Winston Chang
The R Book, 2nd Edition Michael J. Crawley Penn Libraries (ebook)
Introductions to Mathematics for Life Scientists Edward Batschelet
R Cookbook Paul Teetor

##Websites

Site Category Description
cookbook-r R Graphics, programming, analysis
Andrew Gelman Statistics How to do statistics right and what often goes wrong
Interesting IPython notebooks Jupyter/IPython IPython/IJulia notebooks on a variety of topics and tutorials
List of stats blogs Statistics List of blogs (some links dead) that cover topics in statistics
Google Python style guide Python Google's Python style guide - relative to Google internal, but recommended!
Google R style guide R Google's R style guide - follow it!
Princeton WW509 R examples R Germán Rodriguez's site with R instructionals and tutorials
RStudio cheatsheets R RStudio cheatsheets for visualization, data munging, markdown syntax and others
UCLA statistics R UCLA statistics site for learning to use R
Beautiful charts in ggplot2 R Tutorial for creating ggplot2 figures in a variety of styles
FlowingData: beyond base charts in R R Creating aesthetically pleasing plots in base R
R packages for data wrangling and visualization R R packages for retrieving and manipulating data
Austin Clemens: 538-style plots in ggplot2 R Create 538-style graphs in ggplot2
Data Origami: 538-style plots in Python Python Create 538-style graphs in Matplotlib
Matplotlib style sheets Python How to create custom styles in Matplotlib
Google Python class Python Google's Python class with useful exercises
Python cheat sheet by Dave Child Python Python cheat sheet with common commands and methods
R reference card R Refernce card for R commands and packages
R reference card for data mining R R reference card for data mining from RDataMining.com
Sweave and knitr intro and examples R Sweave and knitr introduction and template examples from Vanderbilit biostats
GitHub tutorial by Karl Broman Git/GitHub Minimal tutorial on using Git/GitHub
Text formatting with LaTex LaTeX Tutorial on LaTeX formatting commands
Quick-R R Intro to R functions and capabilities with examples

##Package documentation

Package Site Description
Jupyter/IPython http://ipython.org/ipython-doc/stable/index.html IPython documentation
SciPy http://docs.scipy.org/doc/scipy/reference/ SciPy documentation
NumPy http://docs.scipy.org/doc/numpy/reference/ NumPy documentation
pandas http://pandas.pydata.org/pandas-docs/stable/ pandas documentation
scikit-learn http://scikit-learn.org/stable/documentation.html scikit-learn documentation
Matplotlib http://matplotlib.org/contents.html Matplotlib documentation
SymPy http://docs.sympy.org/latest/index.html SymPy documentation
Seaborn http://stanford.edu/~mwaskom/software/seaborn/index.html Seaborn documentation
Statsmodels http://statsmodels.sourceforge.net/devel/index.html Statsmodels documentation
dplyr http://cran.r-project.org/web/packages/dplyr/index.html CRAN dplyr page
lme4 http://cran.r-project.org/web/packages/lme4/index.html CRAN lme4 page (see also GitHub)
ggplot2 http://cran.r-project.org/web/packages/ggplot2/index.html CRAN ggplot2 page
knitr http://cran.r-project.org/web/packages/knitr/index.html CRAN knitr page
multcomp http://cran.r-project.org/web/packages/multcomp/index.html CRAN multcomp page
languageR http://cran.r-project.org/web/packages/languageR/index.html CRAN languageR page
Sweave https://www.statistik.lmu.de/~leisch/Sweave/ Sweave documentation
RMarkdown http://rmarkdown.rstudio.com/ RMarkdown documentation
'Hadleyverse' https://github.com/hadley?tab=repositories Hadley Wickham's GitHub repos
Markdown http://daringfireball.net/projects/markdown/syntax Markdown syntax documentation

##Datasets

Dataset Site Description
2010 US Census US Census data Population demographics and API for census data
Federal Election Commission FEC presidential Data on donations and expenditures for Presidential races
Federal Election Commission FEC all Data on donations and expenditures for all federal races
MovieLens Movie ratings Database of movie ratings
Social Security Administration SSA baby names SSA Baby Names database
NY MTA NY MTA measures Performance and other information for NY MTA systems
USDA nutrient database Food nutrients USDA nutrient database in a logically organized JSON format
General data sources Data Incubator suggested sources, part 1 The Data Incubator's list of data for projects, Part 1
General data sources Data Incubator suggested sources, part 2 The Data Incubator's list of data for projects, Part 2
General data sources List of data sources from MRAN MRAN list of data sources of different types and formats