Skip to content

Latest commit

 

History

History
59 lines (39 loc) · 1.81 KB

README.rst

File metadata and controls

59 lines (39 loc) · 1.81 KB

Doing Data Science in Python

Python code samples and solutions based on examples and exercises in Doing Data Science by Rachel Schutt and Cathy O'Neil

These code samples and solutions are splitted into different IPython/Jupyter notebooks.

Clone this repo and launch ipython notebook at the root directory of this project. You can also launch the Bash script launch.sh which carries out the same command with some specific ipython options.

You can also use the NBViewer website to read a notebook.

Note: This repo aims to practice myself in Data Science and to show that's easy to do Data Science in Python.

Data Samples

Data samples can be downloaded at the official Doing Data Science Github project at https://github.com/oreillymedia/doing_data_science

For some exercises, I provide a Python script which downloads needed files.

Notebooks

  • Chapter 2 - Exploratory Data Analysis

    Play with some (simulated) data about ads shown and clicks recorded on the NY Time home page.

    The file down_nyt.py downloads the 31 CSV files into the nyt-data directory. Just do:

    python down_nyt.py
    

    and wait... it can be a quite long.

Requirements