Skip to content
View jameslamb's full-sized avatar




@NVIDIA @conda-forge @dask @rapidsai
Block or Report

Block or report jameslamb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

oh hey! πŸ‘‹

I'm James, an engineer / data scientist from Chicago. My time on GitHub is mostly spent writing Python, R, and shell scripts on projects for data scientists and data engineers. My time off GitHub is spent with family, at hip hop shows, and watching reality TV.

:shipit: open source stuff I'm maintaining

  • LightGBM: a lightweight gradient boosting machine
  • lightgbm-dask-testing: containerized setup for testing LightGBM's Dask interface locally and on Amazon ECS
  • pkgnet: R package for analyzing an R package's dependencies
  • pydistcheck: linter that finds portability issues in Python package distributions (wheels and sdists)
  • uptasticsearch: an R data frame client for Elasticsearch

βœ‹ some other open source stuff I've contributed on in the past

  • hamilton: a "micro-framework" for feature engineering in Python
  • prefect: a workflow management thing in Python that plays nicely with Dask
  • xgboost: another gradient boosting machine

😊 open source contributions I'm proud of

click for details

The pull requests and none-code contributions below were chosen to showcase the types of software work I've done. This list is not exhaustive.



Bug Fixes

Infrastructure / CI

πŸ’° things I do for money

  • Sr. Software Engineer at NVIDIA, working on RAPIDS (
  • adjunct instructor at Marquette University, where I teach "Intro to R Programming"

πŸ’» conference talks

I've given talks on Dask, LightGBM, R, Python packaging, and other random stuff. For a full list and links to videos, see

🎀 talk to me!

My DMs are open if you want to talk about open source, data science careers, Bravo shows, or anything else.


  1. microsoft/LightGBM microsoft/LightGBM Public

    A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

    C++ 16k 3.8k

  2. talks talks Public

    Conference talks, meetup talks, and misc. writing

    CSS 22 2

  3. pydistcheck pydistcheck Public

    Linter that finds portability issues in Python package distributions (wheels, sdists, conda packages).

    Python 27 1

  4. lightgbm-dask-testing lightgbm-dask-testing Public

    Test LightGBM's Dask integration on different cluster types

    Jupyter Notebook 11 5

  5. uptake/pkgnet uptake/pkgnet Public

    R package for analyzing other R packages via graph representations of their dependencies

    R 147 38

  6. uptake/uptasticsearch uptake/uptasticsearch Public

    An Elasticsearch client tailored to data science workflows.

    R 48 37