No description, website, or topics provided.
Jupyter Notebook HTML Python
Pull request Compare This branch is 18 commits ahead of dfjennings:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

PyData Carolinas 2016 Tutorial: Datascience on the web

Welcome to Datascience on the web, with Don and Francois.

You have been given a paper with a URL in the form:

Point your browser to it and type the accompanying password.

Problems? Raise your hand and somebody will help you, even perhaps your neighbor. Also, feel free to tweet about this session. I am @f_dion and this is #datascience and #flask at #pydatacarolinas.

After the fact

The unrefactored notebook is here while the refactored one is here.

Once you run through the whole refactored notebook, you will have train and test sets saved in data/ and a trained model in trained_models/. To make these available in the tutorial directory, you will have to run the script. On a unix like environment (mac, linux etc):

chmod a+x


The whole session is now on youtube: Francois Dion & Don Jennings Datascience on the web


This repository will get a few more files after the tutorial, including some PDFs. Make sure you watch the repo if you want the latest information.

The basics

The Machine Learning

The visualization

Further reading

Check out these awesome lists:

Automating the basics