Skip to content

Teaching materials for the part-time Data Science course at General Assembly London

License

Notifications You must be signed in to change notification settings

themarkjensen/ga-data-science

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Part-time Data Science course

By Gianluca Campanella (g.campanella@estimand.com)

Creative Commons License

This repository contains teaching materials (slides and Jupyter Notebooks) for the part-time Data Science course at General Assembly London.

Aims

By the end of the course, you should be able to:

  • Apply the Data Science workflow to solve problems with data
  • Load, manipulate, and summarise data using pandas
  • Produce basic data visualisations using matplotlib and seaborn
  • Perform hypothesis testing, and interpret the output of generalised linear models
  • Build and validate predictive models using sklearn

Schedule

Session Topic
1 Introduction to Data Science
2 Managing data and analyses
3 Causality and study design
4 EDA and data visualisation
Lightning talks
5 Statistical thinking
6 Generalised linear models
7 Time series
Recap
8 Regression
9 Classification
10 Clustering
11 Dimensionality reduction
Recap
12 Decision trees and random forests
13 Natural language processing
14 Ensemble methods
15 SVMs and neural networks
Final presentations

About

Teaching materials for the part-time Data Science course at General Assembly London

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 96.9%
  • TeX 3.0%
  • Other 0.1%