@DistrictDataLabs

District Data Labs

Data Science Research, Open Source Projects, and Content.

Pinned repositories

  1. yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 232 69

  2. baleen

    Forked from bbengfort/baleen

    An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 8 1

  3. partisan-discourse

    A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 4 1

  • Visual analysis and diagnostic tools to facilitate machine learning model selection.

    Python 232 69 Updated Apr 25, 2017
  • Multidimensional data explorer and visualization tool.

    HTML 28 21 Updated Apr 6, 2017
  • Teaching materials for web scraping class

    Jupyter Notebook 3 1 Updated Mar 31, 2017
  • Finding how common the strangers in your life are (reword)

    Python Updated Feb 14, 2017
  • Notebooks and materials for DDL/CEB training.

    Jupyter Notebook 4 5 Updated Feb 9, 2017
  • Code & Data for Introduction to Machine Learning with Scikit-Learn

    Jupyter Notebook 53 60 Updated Jan 10, 2017
  • Resources and materials related to PyCon 2017.

    1 Updated Dec 20, 2016
  • A machine learning approach to recording and analyzing the 2016 election.

    Jupyter Notebook Updated Dec 16, 2016
  • Private repo for PPM Data team.

    Jupyter Notebook 1 Updated Oct 12, 2016
  • A web application that identifies party in political discourse and an example of operationalized machine learning.

    Python 4 1 Updated Sep 14, 2016
  • Code and Notebooks for the Natural Language Processing with Python course.

    Jupyter Notebook 31 32 Updated Sep 11, 2016
  • Data and code for "Fast Data Applications with Spark and Python"

    Python 21 24 Updated Sep 11, 2016
  • Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.

    Jupyter Notebook 34 41 Updated Sep 9, 2016
  • Public code files for the DDL blog

    Python 42 31 Updated Jul 26, 2016
  • Code bases, tutorials, posters, and other content for PyCon2016.

    JavaScript 19 16 Updated Jul 22, 2016
  • Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.

    Python 3 1 Updated Jul 11, 2016
  • Repository for Incubator 4 Team 3

    Jupyter Notebook Updated Jul 7, 2016
  • Repository for Incubator 4 Team 5

    Jupyter Notebook 1 Updated Jun 24, 2016
  • Repository for Incubator 4 Team 1

    CSS Updated Jun 24, 2016
  • A simple web application for activity tracking and event aggregation.

    Python Updated Jun 23, 2016
  • Repository for Incubator 4 Team 4

    CoffeeScript 1 Updated Jun 1, 2016
  • Fast topic survey with associated word cloud visualization on completion.

    HTML 7 Updated May 13, 2016
  • Code and slides for supervised machine learning in R

    HTML 2 Updated May 4, 2016
  • Repository for Incubator 4 Team 2

    Python Updated Apr 3, 2016
  • An automated ingestion service for blogs to construct a corpus for NLP research.

    Python 8 21 Updated Mar 8, 2016
  • Jupyter Notebook 1 1 Updated Mar 1, 2016
  • Examples for using the dedupe library

    Python 3 51 Updated Feb 22, 2016
  • Building a simple Python application - Calendar Application Tutorial

    Python 6 Updated Feb 12, 2016
  • Tutorial on how to ingest data with Twitter into MongoDB

    Python 17 Updated Feb 12, 2016
  • Jupyter Notebook 1 9 Updated Feb 9, 2016