Ask the right questions, manipulate data sets, and create visualizations to communicate results - Coursera
- Welcome
- Introduction to basic tools
- Installing the Toolbox
- R, Git, Github
- Conceptual Issues
- Steps in a data analysis, Putting the science in data science
- Course Project Submission & Evaluation
- Background, Getting Started, and Nuts & Bolts
- Programming with R
- Derek Franks has written a very nice tutorial to help you get up to speed
- Loop Functions and Debugging
- lapply, apply, tapply, split, mapply
- Simulation & Profiling
- Simulate a random normal variable with an arbitrary mean and standard deviation
- Data collection & Data formats
- Raw files (.csv,.xlsx), Databases (mySQL), APIs, Flat files (.csv,.txt), XML, JSON
- Making data tidy
- Distributing data
- Scripting for data cleaning
- Making exploratory graphs
- Principles of analytic graphics
- Clustering methods
- Dimension reduction techniques
- Concepts, Ideas, & Structure
- Markdown & knitr
- Reproducible Research Checklist & Evidence-based Data Analysis
- Case Studies & Commentaries
- Probability & Expected Values
- Variability, Distribution, & Asymptotics
- Intervals, Testing, & Pvalues
- Power, Bootstrapping, & Permutation Tests
- Least Squares and Linear Regression
- Linear Regression & Multivariable Regression
- Multivariable Regression, Residuals, & Diagnostics
- Logistic Regression and Poisson Regression
- Prediction, Errors, and Cross Validation
- The Caret Package
- Predicting with trees, Random Forests, & Model Based Predictions
- Regularized Regression and Combining Predictors
- Shiny, GoogleVis, and Plotly
- R Markdown and Leaflet
- R Packages
- Swirl and Course Project
The most up to date information on the course lecture notes will always be in the course Github repository