For the Spring 2014 Data Science course at General Assembly, NYC
Monday, 3/31/14
Wednesday, 4/7/14
#####Learning how to use the file pager, less
Handy to have this in your bookmarks!
Wednesday 4/9/2014
- Watch the 5 minute "Ipython Notebook Tour"
- Review "What is NumPy"
- Watch Wes McKinney's 10 minute Whirlwind Tour of Pandas (even once is ok ;-) )
- Another great resource: Review Chapters 1 to 5 of Julia Evans Cookbook
Monday 4/14/2014
Wednesday 4/16/2014
Lecture Notes: Data Visualization
Python Notebook: Plotting with Matplotlib
- Complete and submit previous assignments
Resource | About |
---|---|
Basic Plotting in Pandas | |
Matplotlib userguide | |
Matplotlib Gallery | Examples with Code |
Rougier and Prace EuroSciPy Matplotlib Tutorial | Short Overview |
Monday 4/21/2014 We'll be reviewing a number of datasets and going through the Data Exploration Process
The ACES model for Data Exploration:
Letter | Step | Notes |
---|---|---|
A | Assemble the data frame | Find data, import into Pandas |
C | Clean the data frame | Identify and limit columns, rows, indices, dates, etc. |
E | Explore global properties | Visualize! Basic plots and stats appropriate to the data set |
S | Subset comparisons | Look at (visualize!) initial emergenet variable relationships and subsets |
- EDA with SAT Scores
- Grouping with Pandas
- Data Wrangling Movies
- EDA Questions
- Volinksy EDA Presentation
N/A - Please review all prior materials and work on Project 1.
Wednesday 4/23/2014
[Project 1: Scraping, APIs, and Data Visualization](Project 1https://github.com/datadave/GADS9-NYC-Spring2014-Lectures/blob/master/projects/project01.md)
- Selected Presentations of Student Projects
- Discussion of Data Science Careers
- Introduction to Machine Learning