@DistrictDataLabs

District Data Labs

Data Science Research, Open Source Projects, and Content.

Pinned repositories

  1. yellowbrick

    A suite of visual analysis and diagnostic tools to facilitate feature selection, model selection, and parameter tuning for machine learning.

    Python 149 45

  2. baleen

    Forked from bbengfort/baleen

    An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 5

  3. partisan-discourse

    A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 1

  • A suite of visual analysis and diagnostic tools to facilitate feature selection, model selection, and parameter tuning for machine learning.

    Python 149 45 Updated Jan 16, 2017
  • Notebooks and materials for DDL/CEB training.

    Jupyter Notebook 1 5 Updated Jan 13, 2017
  • Code & Data for Introduction to Machine Learning with Scikit-Learn

    Jupyter Notebook 49 58 Updated Jan 10, 2017
  • Resources and materials related to PyCon 2017.

    Updated Dec 21, 2016
  • Multidimensional data explorer and visualization tool.

    HTML 24 20 Updated Dec 14, 2016
  • Private repo for PPM Data team.

    Jupyter Notebook 1 Updated Oct 12, 2016
  • A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 1 Updated Sep 14, 2016
  • Code and Notebooks for the Natural Language Processing with Python course.

    Jupyter Notebook 28 27 Updated Sep 11, 2016
  • Data and code for "Fast Data Applications with Spark and Python"

    Python 19 22 Updated Sep 11, 2016
  • Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.

    Jupyter Notebook 28 36 Updated Sep 9, 2016
  • Public code files for the DDL blog

    Python 41 28 Updated Jul 26, 2016
  • Code bases, tutorials, posters, and other content for PyCon2016.

    JavaScript 19 12 Updated Jul 22, 2016
  • Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 1 1 Updated Jul 11, 2016
  • A simple web application for activity tracking and event aggregation.

    Python Updated Jun 23, 2016
  • Fast topic survey with associated word cloud visualization on completion.

    HTML 7 Updated May 13, 2016
  • Code and slides for supervised machine learning in R

    HTML 2 Updated May 4, 2016
  • An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 5 19 Updated Mar 8, 2016
  • Jupyter Notebook 1 1 Updated Mar 1, 2016
  • Examples for using the dedupe library

    Python 2 40 Updated Feb 22, 2016
  • Building a simple Python application - Calendar Application Tutorial

    Python 7 Updated Feb 12, 2016
  • Tutorial on how to ingest data with Twitter into MongoDB

    Python 17 Updated Feb 12, 2016
  • Jupyter Notebook 1 9 Updated Feb 9, 2016
  • Generating the next read for our book club- with Data Science!

    Python 35 60 Updated Feb 6, 2016
  • My first repository on Github

    Jupyter Notebook 3 Updated Jan 15, 2016
  • Solution to the NBA analysis workshop

    Jupyter Notebook 5 Updated Dec 11, 2015
  • HTML 1 Updated Nov 12, 2015
  • Python Updated Nov 5, 2015
  • An example data product using Django

    Python 7 5 Updated Nov 3, 2015
  • Private repo for Team 7.

    JavaScript 1 Updated Oct 26, 2015
  • DATA BANDITS

    JavaScript 2 Updated Oct 24, 2015