Skip to content

DeepthiMo/DAT_SF_12

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DAT SF 12

Instructor:Alessandro Gagliardi
EiRs:Ramesh Sampath
Otto Stegmaier
Alex Chao
Classes:6:30pm-9:30pm, Tuesday and Thursdays
January 15 – March 31
Office Hours:Alex Chao, 5:30 - 6:30 before class at GA
Otto Stegmaier, 9:30 - 10:00 after class at GA
Ramesh Sampath, 4:00 - 6:00 Saturdays remote
Can also set by appointment

Homework is to be submitted by posting it to your own github repo. Then post the URL and folder where the homework lives at here.


Tentative Course Outline

  1. Intro to Data Science, Relational Databases & SQL
  2. Getting started with IPython & Git
  3. APIs and semi-structured data
  4. IPython.parallel & StarCluster
  5. Hadoop Distributed File System and Spark
  6. Intro to ML: k-Nearest Neighbor Classification
  7. Clustering: Hierarchical and K-Means

016478627c6b1264d8355bed55ac8d2e77152e78

  1. Probability, A/B Tests & Statistical Significance
  2. Multiple Linear Regression and ANOVA
  3. Logistic Regression and Generlized Linear Models
  4. Project Elevator Pitches
    • See Student Project Repos below
  5. Naïve Bayes, Cross Validation, ROC, AUC & Midterm Review - Part I
  6. Naïve Bayes, Cross Validation, ROC, AUC - Part II
  7. Principal Components Analysis
  8. Nonlinear Models
  9. Grid Search and Parameter Selection
  10. Bringing it Together
  11. Final Project Working Session
  12. Final Project Working Session
  13. Final Project Presentations (12 min. each)
  14. Final Project Presentations (12 min. each) <<<<<<< HEAD
  15. Future Directions <<<<<<< HEAD =======
  16. Future Directions

016478627c6b1264d8355bed55ac8d2e77152e78 =======

Project Schedule

Date Due Returned
1/22 Preliminary Project Proposals Due (3-4 sentences)
1/27 Homework 1
1/29 EiR Feedback on Project Proposals
2/3 EiR Feedback on Homework 1
2/5 Formal Proposals (including data and methods chosen)
2/10 Homework 2 Assigned
2/12 EiR Feedback on Formal Proposals
2/17 Homework 2 Due
2/19 Project Elevator Pitch in class (4 minutes each) Project Live on Github
2/24 Homework 3 Assigned
2/26 Peer Feedback of Projects Peer Feedback on Project
3/3 Midterm Assessment Posted
3/10 Midterm Assessment Due
3/17 At least one working model
3/24-26 Final Presentations (12 minutes each) Midterm Graded
<<<<<<< HEAD

614f5817a6aae7dc1f97ddb3bcabd660284af944 =======

Student Project Repos:

| Student | Repo | | Ajay Anand | sryballin/GeneralAssembly-DS | | Zachary Cousens | zfcousens/DAT_SF_12/tree/gh-pages/Project | | Carmen Diaz Echauri | cde/? | | Deepthi Duddempudi | DeepthiGA/Project | | Vijay Duraipalam | coolcalguy/DAT_SF_12/tree/gh-pages/Project | | Cheong-tseng Eng | ctteng/GA-Proj-GPSAnomalyDetection.git | | David Feng | selwyth/neighborhood | | Isabel Friedman | isabitz/whales | | Dave Halvorson | git-halvorson/DAT_SF_12/tree/gh-pages/FinalProject | | Alison Harmon | alharmon13/DAT_SF_12/tree/gh-pages/project | | Markus Huber | mbhuber/USconsumers | | Ryan Hughes | cryhughes/AVS-Kaggle | | Tania Ibanez | positiveepsilon/GA_Project | | Roxana Ordonez | rockyroxana/bike-share-forecast.git | | Justin Peterson | justinrpeterson/? | | April Song | khsong92/ga_ds | | India Swearingen | iswearingen/DAT_SF_12/blob/gh-pages/Homework/Project-IS-load-data.ipynb | | Bing Wang | bingbingboo/DAT_SF_12/blob/gh-pages/Homework/2014flightdatalab.ipynb | | Jaime Williams | jawilliams3000/OaklandCrime | | David Yerrington | dyerrington/Rapstats | <<<<<<< HEAD

922455e3262c6cf1eb483e01defe7765c1d4a807 ======= | Matt Jones | jonesmatt415/NCAA-Prediction-Project- | a71381d39b700707910f2950ba9aa41bda804091

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 94.9%
  • Python 2.8%
  • CSS 2.3%