Currently we provide tutorial notebooks for the following execution backends:
To get required dependencies for PandasOnRay
, PandasOnDask
and PandasOnUnidist
Jupyter Notebooks
you should create a development environment with pip
using requirements.txt
file located in the respective directory:
pip install -r execution/pandas_on_ray/requirements.txt
to install dependencies needed to run notebooks with Modin on PandasOnRay
execution or
pip install -r execution/pandas_on_dask/requirements.txt
to install dependencies needed to run notebooks with Modin on PandasOnDask
execution or
pip install -r execution/pandas_on_unidist/requirements.txt
to install dependencies needed to run notebooks with Modin on PandasOnUnidist
execution.
Note: Sometimes pip is installing every version of a package. If you encounter that issue,
please install every package listed in requirements.txt
file individually with pip install <package>
.
To get required dependencies for HdkOnNative
Jupyter Notebooks
you should create a development environment with conda
using jupyter_hdk_env.yml
file located in the respective directory:
conda config --set channel_priority strict
conda env create -f execution/hdk_on_native/jupyter_hdk_env.yml
After the environment is created it needs to be activated:
conda activate jupyter_modin_on_hdk
Note: HDK
engine is available on Linux only for now.
A Jupyter Notebook server can be run from the current directory as follows:
jupyter notebook
Navigate to a concrete notebook (for example, to the execution/pandas_on_ray/local/exercise_1.ipynb
).
Note: Since there are some specifics regarding the run of jupyter notebooks with the Unidist
engine,
refer to PandasOnUnidist document
to get more information on the matter.