Skip to content

ephes/data_science_tutorial

Repository files navigation

Data Science Tutorial

Cover image for a Data Science / Machine Learning Course or Meetup

This tutorial covers topics that are relevant or interesting for me for some kind of reason 😄. Have fun!

🚧 At the moment this tutorial is under construction 🚧. It will stay in that state for quite some time, but I will add new tutorials one by one. So it might be a good idea to check back periodically for updates.

Installation

You can run the chapter notebooks on your local machine or on Google Colab.

On your local machine you need to clone the repo, create a virtualenv and start your preferred notebook server.

$ git clone git@github.com:ephes/data_science_tutorial.git
$ python -m venv venv
$ venv/bin/python -m pip install jupyterlab
$ venv/bin/jupyter lab

Usually, the dependencies needed for a notebook should be installed from within the notebook itself using the %pip cell magic. Packages that you install using %pip will be installed into the virtualenv that you run the notebook server from. It should make no difference whether you use classic or conda virtual environments.

Text Classification

Chapter 01: Getting started with Text Classification docs

Foundations

🏗 I added the notebooks below already because they should work, but they are also still very much work in progress. 👷

Numpy

Numpy is the basis of a lot of stuff in related to data science in Python.

Chapter 08: Numpy Overview Covering the Basic Features docs

Pandas

Pandas is very useful for all kinds of pre-processing and data cleaning.

Chapter 09: Using Pandas docs

About

Python data science and machine learning tutorial

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages