Skip to content




Block or Report

Block or report pjbull

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hello, friends 👋

I'm Peter. Nice to meet ya.

📈 Data science + machine learning 📊

I largely help social sector organizations get their data into a shape where machine learning can be valuable. Much of this work ends up on, where you can join a competition to help these organizations, learn from interesting data, try new methods, and make friends that care about impact. Here are some cool recent ones:

Competitions are great, but not every problem is a good fit, so our team of data scientists and software engineers also works with organizations directly to analyze data, build data systems, setup pipelines, train machine learning models, and design and deploy solutions. Check out DrivenData Labs to learn more. There I write case studies, publish on our blog, and maintain our open source work.

Open source 📦

You can find me working on open source projects that are tools for data scientists and engineers using Python. I particularly care about reproducible data science and machine learning and AI ethics.

See below for the projects I regularly contribute to!


  1. A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

    Python 5k 1.6k

  2. A command line tool to easily add an ethics checklist to your data science projects.

    Python 212 40

  3. Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.

    Python 76 8

  4. Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.

    Python 37 6

  5. A Python Package for Identifying 23 Kinds of Animals in Camera Trap Videos

    Python 34 10

  6. Use pathlib syntax to easily work with Pandas series containing file paths.

    Python 27 3

1,039 contributions in the last year

Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Mon Wed Fri

Contribution activity

September 2021

Created a pull request in drivendataorg/cloudpathlib that received 14 comments

Update _s3_file_query to check if an object exists explicitly.

In our current setup, the _is_file_or_dir check relies on _s3_file_query, which requires list permissions. When we pass no_sign_request, we want to…

+64 −42 14 comments
Opened 1 other pull request in 1 repository
drivendataorg/cloudpathlib 1 merged
Opened 1 issue in 1 repository
drivendataorg/cloudpathlib 1 closed
46 contributions in private repositories Sep 1 – Sep 23

Seeing something unexpected? Take a look at the GitHub profile guide.