Skip to content

Repo for the Spring 2021 iteration of IS507: Data, Stats & Info at the iSchool, UIUC

License

Notifications You must be signed in to change notification settings

jnaiman/is507_spring2021

Repository files navigation

IS507, Spring 2021 (taught online, synchronously; 3 hrs/class)

NOTE: check each week to see if its IN-DEV or READY for each week

These are the course materials for IS507 (previously IS542) at the iSchool, University of Illinois, Urbana-Champaign.

If you see any bugs or errors please issue a PR -- always looking to make things better!

How to use this repo

Each week consists of lecture slides and prep-notebooks in the programming language R along with suggested resources and readings. The readings consist of what was "required" for the course as well as optional extra readings so feel free to take/leave what you'd like.

Below is the outline of the course, with links to the individual folders for each week. Each folder contains:

  1. an "README.md" file which lists the suggested readings, datasets and any extra resources for the week
  2. The lecture slides (as a pdf)
  3. Prep coding notebooks used for the "live coding" portions of class which generally happened after the lecture portion (.ipynb files) BUT sometimes we swtich between the lecture slides & R. This will be denoted in the slides as well as R.
  4. Any datasets used in the coding portion

Course Outline

Example syllabus here.

Week Link Topic
Week 01 Course Intro & Motivation, Intro to R
Week 02 Intro to Numerical Data, Intro to R
Week 03 Intro to Categorical Data, Table Proportions, and Probability Theory
Week 04 Random Variables, Continous Probability Distributions
Week 05 The Normal Distribution
Week 06 The Normal & Binomial Distributions
Week 07 Foundations for Inference; Hypothesis testing: Normal, T-distribution, and single proportions;
differences of 2 means/proportions, paired data
Week 08 Hypothesis testing: ANOVA and models
Week 09 Fake Break! We'll do some fun stats stuff in Python.
Week 10 Linear Regression & Multiple Linear Regression
Week 11 Intro to classification & Logistic Regression
Week 12 Classification with KNN & Beginning Model Selection with CV & Bootstrapping
Week 13 Model Selection & Shrinkage Methods for Linear Regression
Week 14 Lasso Regression & CV; Intro to PCA; Course wrap-up

Reading

This course is based off of the following textbooks:

Required

Optional

Installation of R, RStudio and Jupyter notebooks

  1. Download R from the R-project webpage
  2. Courses were taught using RStudio which you can download right here

Totally optional: To run the Jupyter notebooks with R locally, install Anaconda and then please follow the instructions for installing R using the Anaconda Navigator. The easiest way to install packages is through the Anaconda GUI, or you can conda install -- either way you need to append an r- to all packages!

NOTE: There are pay-versions of this software but we assume you are using the free versions.

TODO

  • Add in a "who this course is for" section -- describe your typical iSchool student
  • Add in photos of RStudio & label panels
  • Link to Data ag install list?
  • Add in pedagogy links and general references
  • Add in "working on" as far as online teaching strategies -- what is currently working and is not
  • add in course pre-reqs
  • add in slides with lecture notes
  • collapse answers to practice problems
  • Add in instructor notes for all weeks pages

Weeks

  • Week 02
    • A-void notebook
  • Week 03
    • Add in extra GLM stuff, corrplot, links to datacamp
  • Week 10
    • ERROR: the tabplot library won't load in jupyter notebooks
  • Week 15
    • add in-person "real life" example using the motion data and KNN
  • Week 16
    • prep notebook for PCA isn't included as of now

Stretch Goals

  • include bayesian stuffs as a bonus class

Disclaimer

My background is in astrophysics (hydro simulations) so there will be an abundance of astronomy examples and space jokes. Also my spelling is atrocious. You have been warned.

About

Repo for the Spring 2021 iteration of IS507: Data, Stats & Info at the iSchool, UIUC

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages