Skip to content



@conda-forge @saturncloud


@holdenk @bagder @terrytangyuan

oh hey! 👋

I'm James, a data scientist / engineer from Chicago. My time on GitHub is mostly spent writing Python, R, and disgusting shell scripts on projects for data scientists and data engineers. My time off GitHub is spent at hip hop shows and watching reality TV.

:shipit: open source stuff I'm maintaining

  • doppel-cli: a command-line tool for checking if an R and Python library have the same interface
  • LightGBM: a lightweight gradient boosting machine
  • pkgnet: R package for analyzing an R package's dependencies
  • prefect-saturn: Python client for runing Prefect flows on a Saturn Cloud Dask cluster
  • uptasticsearch: an R data frame client for Elasticsearch

open source stuff I've been making little contributions on

  • prefect: a workflow management thing in Python that plays nicely with Dask
  • xgboost: another gradient boosting machine

💰 things I do for money

  • software engineer at Saturn Cloud, where we're building "Databricks for Dask"
  • adjunct instructor at Marquette University, where I teach "Intro to R Programming"

💻 conference talks

I've given talks on Dask, LightGBM, R, and other random stuff. For a full list and links to videos, see

🎤 talk to me!

My DMs are open if you want to talk about open source, data science careers, Jersey Shore, or anything else.


  1. A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

    C++ 12.2k 3.2k

  2. Test framework for comparing the consistency of library APIs

    Python 8 10

  3. An Elasticsearch client tailored to data science workflows.

    R 47 43

  4. R package for analyzing other R packages via graph representations of their dependencies

    R 106 37

  5. Make a report on one or more users' open source contributions

    JavaScript 3 2

  6. A repo to make some common tools and algorithms from data science pipelines available as EViews add-ins.

    xBase 1

1,944 contributions in the last year

Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mon Wed Fri

Contribution activity

March 2021

Created 2 commits in 1 repository

Created a pull request in sqlalchemy/alembic that received 3 comments

[doc] clarify ALTER language in batch docs

Description Clarifies language about ALTER in the batch migrations docs. I think the double use of "upon" in this phrase is not quite correct, and …

+1 −2 3 comments

Seeing something unexpected? Take a look at the GitHub profile guide.