Visual analysis and diagnostic tools to facilitate machine learning model selection.
An automated ingestion service for blogs to construct a corpus for NLP research.
resources for pycon 2018
Notebooks and code for "Visual Pipelines for Text Analysis" at the Data Intelligence Conference: June 23, 2017.
Multidimensional data explorer and visualization tool.
Resources and materials related to PyCon 2017.
Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.
Teaching materials for web scraping class
Finding how common the strangers in your life are (reword)
Notebooks and materials for DDL/CEB training.
Code & Data for Introduction to Machine Learning with Scikit-Learn
A machine learning approach to recording and analyzing the 2016 election.
Private repo for PPM Data team.
A web application that identifies party in political discourse and an example of operationalized machine learning.
Code and Notebooks for the Natural Language Processing with Python course.
Data and code for "Fast Data Applications with Spark and Python"
Graph extraction and NLP analysis for Baleen Corpora
Public code files for the DDL blog
Code bases, tutorials, posters, and other content for PyCon2016.
Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.
Repository for Incubator 4 Team 3
Repository for Incubator 4 Team 5
Repository for Incubator 4 Team 1
A simple web application for activity tracking and event aggregation.
Repository for Incubator 4 Team 4
Fast topic survey with associated word cloud visualization on completion.
Code and slides for supervised machine learning in R
Repository for Incubator 4 Team 2
Examples for using the dedupe library