Skip to content

A simple and efficient tool to parallelize Pandas operations on all available CPUs

License

Notifications You must be signed in to change notification settings

jaysparkx/pandarallel

 
 

Repository files navigation

Pandaral·lel

PyPI version fury.io PyPI license PyPI download month

Without parallelization Without Pandarallel
With parallelization With Pandarallel

Pandaral.lel provides a simple way to parallelize your pandas operations on all your CPUs by changing only one line of code. It also displays progress bars.

Installation

pip install pandarallel [--upgrade] [--user]`

Quickstart

from pandarallel import pandarallel

pandarallel.initialize(progress_bar=True)

# df.apply(func)
df.parallel_apply(func)

Usage

Be sure to check out the documentation.

Examples

An example of each available pandas API is available:

About

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 82.3%
  • Jupyter Notebook 17.2%
  • Shell 0.5%