There are a couple of ways to start using Pandas on Ray. Most users will want to install with pip
, but some users may want to build from the master branch on the GitHub repo.
Modin can be installed with pip.
pip install modin
Currently, Modin depends on pandas version 0.22. The API of pandas has a tendency to change some with each release, so we pin our current version to the most recent version to take advantage of the newest additions. This also typically means better performance and more correct code.
Modin also depends on Ray. Ray is a task-parallel execution framework for parallelizing new and existing applications with minor code changes. Currently, we depend on the most recent Ray release: 0.5.0.
To build from source, you first must clone the repo:
git clone https://github.com/modin-project/modin.git
Once cloned, cd
into the modin
directory and use pip
to install:
cd modin
pip install -e .