Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Interactive notebooks you can use to learn how to analyze lots of data
branch: master
Failed to load latest commit information.
datasets Added worksheet 220
images Rehauled content
scripts Rehauled content
slides Polished notebooks and slides
.gitignore Updated notebook 200
000 Set goals for cross-disciplinary analytics hands-on workshop.ipynb Revived original purpose
010 Count the number of links on a webpage.ipynb Added any() note
011 Explore interactively with IPython.ipynb Added YouTube video
050 Find the fastest path from Brooklyn Bridge to 96th and Broadway.ipynb Rehauled content
051 Analyze network relationships with NetworkX.ipynb Rehauled content
090 Explore the NYC MTA subway system.ipynb Polished notebooks and slides
100 Animate a heatmap.ipynb Polished notebooks and slides
111 Work with matrices in NumPy.ipynb Rehauled content
121 Apply optimized algorithms from SciPy.ipynb Polished notebooks and slides
131 Visualize numerical data with Matplotlib.ipynb Polished notebooks and slides
200 Find regions with recent earthquakes.ipynb Polished notebooks and slides
210 Explore NYC 311 service requests.ipynb Polished notebooks and slides
211 Model time series with pandas.ipynb Saved update
220 Analyze business data.ipynb Added worksheet 220
250 Compare yearly prices of organic vs conventional spinach.ipynb Rehauled content
251 Compute statistics with statsmodels.ipynb Polished notebooks and slides
300 Count graffiti sightings within 100 feet of a subway entrance.ipynb Fixed proj4NY
321 Analyze spatial relationships with GeometryIO, Shapely, PySAL.ipynb Polished notebooks and slides
400 Discover informative features when classifying handwritten digits.ipynb Rehauled content
401 Harness machine learning with Scikit-Learn.ipynb Polished notebooks and slides
421 Compare supervised learning techniques.ipynb Polished notebooks and slides
430 Identify key user conversion metrics after split testing.ipynb Polished notebooks and slides
431 Select informative features.ipynb Rehauled content
450 Segment users to find market opportunities.ipynb Polished notebooks and slides
451 Compare unsupervised learning techniques.ipynb Rehauled content
460 Listen for unusual activity.ipynb Polished notebooks and slides
481 Parallelize symbolic mathematics on multi-dimensional arrays with Theano.ipynb Rehauled content
491 Prototype GPU computation techniques with PyCUDA.ipynb Rehauled content
501 Discuss practical considerations in real-world applications.ipynb Polished notebooks and slides
551 Explore cloud computing concepts with pika.ipynb Rehauled content
600 Rank the influence of alcohol and chocolate on marriage age.ipynb Polished notebooks and slides
620 Estimate Big Mac prices from World Bank contracts.ipynb Polished notebooks and slides
700.ipynb Polished notebooks and slides
999 Contribute to open source projects.ipynb Rehauled content Updated README
TODO.txt Updated TODO

Cross-disciplinary computational analysis

Here is a growing collection of interactive IPython Notebook tutorials on large-scale data analysis.

Install packages

cd ~/Documents
git clone
cd crosscompute-scripts

Run notebooks

cd ~/Documents
# Download and unpack the notebooks into a folder
git clone
# Activate virtual environment and start IPython Notebook in the folder
cd crosscompute-tutorials

Ask questions

Stay updated

Something went wrong with that request. Please try again.