@DistrictDataLabs

District Data Labs

Data Science Research, Open Source Projects, and Content.

Pinned repositories

  1. yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 490 100

  2. partisan-discourse

    A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 13 5

  3. cultivar

    Multidimensional data explorer and visualization tool.

    HTML 35 22

  4. minimum-entropy

    Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 5 3

  5. PyCon2017

    Resources and materials related to PyCon 2017.

    HTML 4 4

  • Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 490 100 Apache-2.0 Updated Nov 13, 2017
  • An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 33 26 MIT Updated Oct 12, 2017
  • resources for pycon 2018

    Updated Sep 22, 2017
  • Notebooks and code for "Visual Pipelines for Text Analysis" at the Data Intelligence Conference: June 23, 2017.

    Jupyter Notebook 2 1 Updated Jun 25, 2017
  • Multidimensional data explorer and visualization tool.

    HTML 35 22 Apache-2.0 Updated May 23, 2017
  • Resources and materials related to PyCon 2017.

    HTML 4 4 Updated May 23, 2017
  • Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.

    Jupyter Notebook 43 47 MIT Updated May 6, 2017
  • Teaching materials for web scraping class

    Jupyter Notebook 4 5 Updated Mar 31, 2017
  • Finding how common the strangers in your life are (reword)

    Python 3 1 Apache-2.0 Updated Feb 15, 2017
  • Notebooks and materials for DDL/CEB training.

    Jupyter Notebook 5 6 MIT Updated Feb 9, 2017
  • Code & Data for Introduction to Machine Learning with Scikit-Learn

    Jupyter Notebook 61 77 MIT Updated Jan 10, 2017
  • A machine learning approach to recording and analyzing the 2016 election.

    Jupyter Notebook 1 Updated Dec 17, 2016
  • Private repo for PPM Data team.

    Jupyter Notebook 1 1 Apache-2.0 Updated Oct 12, 2016
  • A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 13 5 Apache-2.0 Updated Sep 14, 2016
  • Code and Notebooks for the Natural Language Processing with Python course.

    Jupyter Notebook 39 40 MIT Updated Sep 11, 2016
  • Data and code for "Fast Data Applications with Spark and Python"

    Python 25 24 MIT Updated Sep 11, 2016
  • Graph extraction and NLP analysis for Baleen Corpora

    Python 7 5 MIT Updated Sep 8, 2016
  • Public code files for the DDL blog

    Python 44 35 Updated Jul 26, 2016
  • Code bases, tutorials, posters, and other content for PyCon2016.

    JavaScript 26 25 MIT Updated Jul 22, 2016
  • Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 5 3 Apache-2.0 Updated Jul 11, 2016
  • Repository for Incubator 4 Team 3

    Jupyter Notebook Apache-2.0 Updated Jul 7, 2016
  • Repository for Incubator 4 Team 5

    Jupyter Notebook 1 Apache-2.0 Updated Jun 24, 2016
  • Repository for Incubator 4 Team 1

    CSS Apache-2.0 Updated Jun 24, 2016
  • A simple web application for activity tracking and event aggregation.

    Python Apache-2.0 Updated Jun 23, 2016
  • Repository for Incubator 4 Team 4

    CoffeeScript 1 Apache-2.0 Updated Jun 1, 2016
  • Fast topic survey with associated word cloud visualization on completion.

    HTML 7 1 MIT Updated May 13, 2016
  • Code and slides for supervised machine learning in R

    HTML 2 Updated May 4, 2016
  • Repository for Incubator 4 Team 2

    Python Apache-2.0 Updated Apr 3, 2016
  • Jupyter Notebook 2 1 Updated Mar 1, 2016
  • Examples for using the dedupe library

    Python 3 65 Updated Feb 22, 2016