Block or report user


@swcarpentry @EpistasisLab

Pinned repositories

  1. Data-Analysis-and-Machine-Learning-Projects

    Repository of teaching materials, code, and data for my data analysis and machine learning projects.

    Jupyter Notebook 2.8k 662

  2. tpot

    A Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.

    Python 1.8k 257

  3. datacleaner

    A Python tool that automatically cleans data sets and readies them for analysis.

    Python 480 60

  4. reddit-analysis

    A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.

    Python 199 46

  5. optimal-roadtrip-usa

    Contains maps for the article, "Computing the optimal road trip across the U.S." and similar articles

    HTML 166 56

  6. sklearn-benchmarks

    A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets.

    Jupyter Notebook 66 17

1,124 contributions in the last year

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Mon Wed Fri

Contribution activity First pull request First issue First repository Joined GitHub

January 2017

Created a pull request in EpistasisLab/scikit-mdr that received 1 comment

Add MDR utilities module and improve MDR model code

Created an issue in scikit-learn/scikit-learn that received 4 comments

What does cross_val_score do for nested parallelization?

I have been developing a new sklearn-compatible estimator that uses joblib to parallelize the algorithm. However, I run into issues when I set n_jobs

Seeing something unexpected? Take a look at the GitHub profile guide.