@DistrictDataLabs

District Data Labs

Data Science Research, Open Source Projects, and Content.

Pinned repositories

  1. yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 399 92

  2. partisan-discourse

    A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 12 5

  3. cultivar

    Multidimensional data explorer and visualization tool.

    HTML 34 22

  4. minimum-entropy

    Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 5 3

  5. PyCon2017

    Resources and materials related to PyCon 2017.

    HTML 3 2

  • Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 399 92 Updated Jul 20, 2017
  • An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 28 26 Updated Jul 5, 2017
  • Notebooks and code for "Visual Pipelines for Text Analysis" at the Data Intelligence Conference: June 23, 2017.

    Jupyter Notebook 2 Updated Jun 25, 2017
  • Multidimensional data explorer and visualization tool.

    HTML 34 22 Updated May 23, 2017
  • Resources and materials related to PyCon 2017.

    HTML 3 2 Updated May 23, 2017
  • Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.

    Jupyter Notebook 39 47 Updated May 6, 2017
  • Teaching materials for web scraping class

    Jupyter Notebook 4 3 Updated Mar 31, 2017
  • Finding how common the strangers in your life are (reword)

    Python 2 Updated Feb 14, 2017
  • Notebooks and materials for DDL/CEB training.

    Jupyter Notebook 5 5 Updated Feb 9, 2017
  • Code & Data for Introduction to Machine Learning with Scikit-Learn

    Jupyter Notebook 59 65 Updated Jan 9, 2017
  • A machine learning approach to recording and analyzing the 2016 election.

    Jupyter Notebook 1 Updated Dec 16, 2016
  • Private repo for PPM Data team.

    Jupyter Notebook 1 1 Updated Oct 12, 2016
  • A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 12 5 Updated Sep 14, 2016
  • Code and Notebooks for the Natural Language Processing with Python course.

    Jupyter Notebook 33 35 Updated Sep 11, 2016
  • Data and code for "Fast Data Applications with Spark and Python"

    Python 25 23 Updated Sep 11, 2016
  • Graph extraction and NLP analysis for Baleen Corpora

    Python 7 5 Updated Sep 8, 2016
  • Public code files for the DDL blog

    Python 46 34 Updated Jul 26, 2016
  • Code bases, tutorials, posters, and other content for PyCon2016.

    JavaScript 21 20 Updated Jul 22, 2016
  • Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 5 3 Updated Jul 11, 2016
  • Repository for Incubator 4 Team 3

    Jupyter Notebook Updated Jul 7, 2016
  • Repository for Incubator 4 Team 5

    Jupyter Notebook 1 Updated Jun 24, 2016
  • Repository for Incubator 4 Team 1

    CSS Updated Jun 24, 2016
  • A simple web application for activity tracking and event aggregation.

    Python Updated Jun 23, 2016
  • Repository for Incubator 4 Team 4

    CoffeeScript 1 Updated Jun 1, 2016
  • Fast topic survey with associated word cloud visualization on completion.

    HTML 7 Updated May 13, 2016
  • Code and slides for supervised machine learning in R

    HTML 2 Updated May 4, 2016
  • Repository for Incubator 4 Team 2

    Python Updated Apr 3, 2016
  • Jupyter Notebook 2 1 Updated Mar 1, 2016
  • Examples for using the dedupe library

    Python 3 60 Updated Feb 21, 2016
  • Building a simple Python application - Calendar Application Tutorial

    Python 6 Updated Feb 12, 2016