Interactive notebooks you can use to learn how to analyze lots of data
datasets
images
scripts
slides
.gitignore
000 Set goals for cross-disciplinary analytics hands-on workshop.ipynb
010 Count the number of links on a webpage.ipynb
011 Explore interactively with IPython.ipynb
050 Find the fastest path from Brooklyn Bridge to 96th and Broadway.ipynb
051 Analyze network relationships with NetworkX.ipynb
090 Explore the NYC MTA subway system.ipynb
100 Animate a heatmap.ipynb
111 Work with matrices in NumPy.ipynb
121 Apply optimized algorithms from SciPy.ipynb
131 Visualize numerical data with Matplotlib.ipynb
200 Find regions with recent earthquakes.ipynb
210 Explore NYC 311 service requests.ipynb
211 Model time series with pandas.ipynb
220 Analyze business data.ipynb
250 Compare yearly prices of organic vs conventional spinach.ipynb
251 Compute statistics with statsmodels.ipynb
300 Count graffiti sightings within 100 feet of a subway entrance.ipynb
321 Analyze spatial relationships with GeometryIO, Shapely, PySAL.ipynb
400 Discover informative features when classifying handwritten digits.ipynb
401 Harness machine learning with Scikit-Learn.ipynb
421 Compare supervised learning techniques.ipynb
430 Identify key user conversion metrics after split testing.ipynb
431 Select informative features.ipynb
450 Segment users to find market opportunities.ipynb
451 Compare unsupervised learning techniques.ipynb
460 Listen for unusual activity.ipynb
481 Parallelize symbolic mathematics on multi-dimensional arrays with Theano.ipynb
491 Prototype GPU computation techniques with PyCUDA.ipynb
501 Discuss practical considerations in real-world applications.ipynb
551 Explore cloud computing concepts with pika.ipynb
600 Rank the influence of alcohol and chocolate on marriage age.ipynb
620 Estimate Big Mac prices from World Bank contracts.ipynb
700.ipynb
999 Contribute to open source projects.ipynb
TODO.txt

Cross-disciplinary computational analysis

Here is a growing collection of interactive IPython Notebook tutorials on large-scale data analysis.

Install packages

cd ~/Documents
git clone
cd crosscompute-scripts

Run notebooks

cd ~/Documents
# Download and unpack the notebooks into a folder
git clone
# Activate virtual environment and start IPython Notebook in the folder
cd crosscompute-tutorials

