A suite of visual analysis and diagnostic tools to facilitate feature selection, model selection, and parameter tuning for machine learning.
Notebooks and materials for DDL/CEB training.
Code & Data for Introduction to Machine Learning with Scikit-Learn
Resources and materials related to PyCon 2017.
Multidimensional data explorer and visualization tool.
Private repo for PPM Data team.
A web application that identifies party in political discourse and an example of operationalized machine learning.
Code and Notebooks for the Natural Language Processing with Python course.
Data and code for "Fast Data Applications with Spark and Python"
Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.
Public code files for the DDL blog
Code bases, tutorials, posters, and other content for PyCon2016.
Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.
A simple web application for activity tracking and event aggregation.
Fast topic survey with associated word cloud visualization on completion.
Code and slides for supervised machine learning in R
An automated ingestion service for blogs to construct a corpus for NLP research.
Examples for using the dedupe library
Building a simple Python application - Calendar Application Tutorial
Tutorial on how to ingest data with Twitter into MongoDB
Generating the next read for our book club- with Data Science!
My first repository on Github
Solution to the NBA analysis workshop
An example data product using Django
Private repo for Team 7.