Skip to content

Latest commit

 

History

History
66 lines (45 loc) · 2.42 KB

README.md

File metadata and controls

66 lines (45 loc) · 2.42 KB

Jupyter notebook examples to run with Modin

Currently we provide tutorial notebooks for the following execution backends:

Creating a development environment

To get required dependencies for PandasOnRay, PandasOnDask and PandasOnUnidist Jupyter Notebooks you should create a development environment with pip using requirements.txt file located in the respective directory:

pip install -r execution/pandas_on_ray/requirements.txt

to install dependencies needed to run notebooks with Modin on PandasOnRay execution or

pip install -r execution/pandas_on_dask/requirements.txt

to install dependencies needed to run notebooks with Modin on PandasOnDask execution or

pip install -r execution/pandas_on_unidist/requirements.txt

to install dependencies needed to run notebooks with Modin on PandasOnUnidist execution.

Note: Sometimes pip is installing every version of a package. If you encounter that issue, please install every package listed in requirements.txt file individually with pip install <package>.

To get required dependencies for HdkOnNative Jupyter Notebooks you should create a development environment with conda using jupyter_hdk_env.yml file located in the respective directory:

conda config --set channel_priority strict
conda env create -f execution/hdk_on_native/jupyter_hdk_env.yml

After the environment is created it needs to be activated:

conda activate jupyter_modin_on_hdk

Note: HDK engine is available on Linux only for now.

Run Jupyter Notebooks

A Jupyter Notebook server can be run from the current directory as follows:

jupyter notebook

Navigate to a concrete notebook (for example, to the execution/pandas_on_ray/local/exercise_1.ipynb).

Note: Since there are some specifics regarding the run of jupyter notebooks with the Unidist engine, refer to PandasOnUnidist document to get more information on the matter.