@DistrictDataLabs

District Data Labs

Data Science Research, Open Source Projects, and Content.

Pinned repositories

  1. yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 1.4k 212

  2. partisan-discourse

    A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 19 5

  3. cultivar

    Multidimensional data explorer and visualization tool.

    HTML 39 21

  4. minimum-entropy

    Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 7 3

  5. PyCon2017

    Resources and materials related to PyCon 2017.

    HTML 5 5

  • Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 1,434 212 Apache-2.0 Updated Jul 21, 2018
  • The user interface for the sage application.

    Python GPL-3.0 Updated Jul 19, 2018
  • Data Science and Big Data Overview Training

    MIT Updated Jul 19, 2018
  • Teaching materials for the text analytics course

    Jupyter Notebook 6 2 MIT Updated Jul 16, 2018
  • Teaching materials for the SQL course

    MIT Updated Jul 12, 2018
  • DDL Intro to Python iPython Notebook

    Jupyter Notebook 8 18 Apache-2.0 Updated Jul 9, 2018
  • An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 48 29 MIT Updated Jun 23, 2018
  • Public code files for the DDL blog

    Python 46 36 Updated Jun 6, 2018
  • Turkish translation of Yellowbrick documentation

    Python Apache-2.0 Updated May 7, 2018
  • Chinese translation of Yellowbrick documentation

    Python 4 3 Apache-2.0 Updated Mar 31, 2018
  • Code and Notebooks for the Natural Language Processing with Python course.

    Jupyter Notebook 48 47 MIT Updated Dec 3, 2017
  • resources for pycon 2018

    Updated Sep 21, 2017
  • Notebooks and code for "Visual Pipelines for Text Analysis" at the Data Intelligence Conference: June 23, 2017.

    Jupyter Notebook 4 Updated Jun 25, 2017
  • Multidimensional data explorer and visualization tool.

    HTML 39 21 Apache-2.0 Updated May 23, 2017
  • Resources and materials related to PyCon 2017.

    HTML 5 4 Updated May 23, 2017
  • Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.

    Jupyter Notebook 54 45 MIT Updated May 6, 2017
  • Teaching materials for web scraping class

    Jupyter Notebook 5 3 Updated Mar 31, 2017
  • Finding how common the strangers in your life are (reword)

    Python 5 1 Apache-2.0 Updated Feb 15, 2017
  • Notebooks and materials for DDL/CEB training.

    Jupyter Notebook 6 5 MIT Updated Feb 9, 2017
  • Code & Data for Introduction to Machine Learning with Scikit-Learn

    Jupyter Notebook 70 87 MIT Updated Jan 10, 2017
  • A machine learning approach to recording and analyzing the 2016 election.

    Jupyter Notebook 1 Updated Dec 16, 2016
  • Private repo for PPM Data team.

    Jupyter Notebook 1 1 Apache-2.0 Updated Oct 12, 2016
  • A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 19 5 Apache-2.0 Updated Sep 14, 2016
  • Data and code for "Fast Data Applications with Spark and Python"

    Python 27 20 MIT Updated Sep 11, 2016
  • Graph extraction and NLP analysis for Baleen Corpora

    Python 9 6 MIT Updated Sep 8, 2016
  • Code bases, tutorials, posters, and other content for PyCon2016.

    JavaScript 35 35 MIT Updated Jul 22, 2016
  • Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 7 3 Apache-2.0 Updated Jul 11, 2016
  • Repository for Incubator 4 Team 3

    Jupyter Notebook Apache-2.0 Updated Jul 7, 2016
  • Repository for Incubator 4 Team 5

    Jupyter Notebook 1 Apache-2.0 Updated Jun 24, 2016
  • Repository for Incubator 4 Team 1

    CSS Apache-2.0 Updated Jun 24, 2016