- A tabular data manipulation library.
- Content in the
pandas/
directoryexamples.py
contains some things you can do in pandasexercise.py
contains some practice problems exercising techniques fromexamples.py
solutions.py
contains solutions fromexercise.py
- A numerical array library.
- General Reference
- Reference for generating statistical distributions
- Content in the
numpy/
directoryexamples.py
contains some things you can do numpyexercise.py
contains some practice problems exercising techniques fromexamples.py
solutions.py
contains solutions fromexercise.py
- A plotting library.
- Pyplot reference.
- Content in the
matplotlib/
directoryexamples.py
shows some plots you can make with matplotlibexercise.py
contains a difficult matplotlib plotsolutions.py
contains an example implementation for the exercise
- A machine learning library.
- We will learn about clustering algorithms. In particular, the K-means algorithm and the Gaussian Mixed Model algorithm.
- Content in the
scikit_learn/
directoryexamples.py
contains some unsupervised clustering algorithm examples
- Another resource containing the same examples.
- An example exercise for data cleanup and normaliztion. This is a very common problem.