-
Notifications
You must be signed in to change notification settings - Fork 647
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
Signed-off-by: Alexey Prutskov <alexey.prutskov@intel.com>
- Loading branch information
Showing
1 changed file
with
19 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,23 @@ | ||
pandas on Dask | ||
============== | ||
|
||
The Dask engine and documentation could use your help! Consider opening a | ||
`pull request`_ or an issue_ to contribute or ask clarifying questions. | ||
This section describes usage related documents for the pandas on Dask component of Modin. | ||
|
||
.. _pull request: https://github.com/modin-project/modin/pulls | ||
.. _issue: https://github.com/modin-project/modin/issues | ||
Modin uses pandas as a primary memory format of the underlying partitions and optimizes queries | ||
ingested from the API layer in a specific way to this format. Thus, there is no need to care of choosing it | ||
but you can explicitly specify it anyway as shown below. | ||
|
||
One of the execution engines that Modin uses is Dask. To enable the pandas on Dask execution you should set the following environment variables: | ||
|
||
.. code-block:: bash | ||
export MODIN_ENGINE=dask | ||
export MODIN_STORAGE_FORMAT=pandas | ||
or turn them on in source code: | ||
|
||
.. code-block:: python | ||
import modin.config as cfg | ||
cfg.Engine.put('dask') | ||
cfg.StorageFormat.put('pandas') |