Introduction to probabilistic programming using PyMC3
Installation guide for Apache Spark + Hadoop on Mac/Linux
DSI Self Study Resources
Live Coding repo for showing to an info session
The statistics short course is both a resource and survey of the areas of probability and statistics that are foundational for the data science immersive at Galvanize.
A presentation of commonly observed beginner-mistakes.
A set of self paced resources for anyone looking to get into data science. The materials assume an absolute beginner and are intended to prepare students for the Galvanize Data Science interview process: http://www.galvanize.com/courses/data-science/
An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.
Scripts to cleanup and update apps on the lab macs.
Repository for Programming Assignment 2 for R Programming on Coursera
Find faces in images and get facial info with this easy REST API.
Generate pairs for pair programming, that are guaranteed to not repeat
A repo for all of the web-scrapers I've ever built
A awesome list of (large-scale) public datasets on the Internet. (On-going collection)
The Jekyll theme for your personal landing page.
Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
A little utility for allowing students to post the performance of their models to slack.
Examples of code using our API
Best practices of using Spark for practicing data scientists in the context of a data scientist’s standard workflow.
Event Management APP Using Swift
An example repository containing an exercise and lecture used as a reference for instructor mock lectures.
Bash Crash Course
Aspect-based opinion mining on Yelp reviews
Flight rules for git
Feature-Based Sentiment Analysis in Python