This section shows Modin usage examples in different scenarios like Modin on a local/remote cluster, the use of Modin spreadsheet.
The following tutorials cover the basic usage of Modin. Here is a one hour video tutorial that walks through these basic exercises.
- Exercise 1: Introduction to Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
- Exercise 2: Speed Improvements with Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
- Exercise 3: Defaulting to pandas with Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
The following tutorials covers more advanced features in Modin:
- Exercise 4: Experimental Features in Modin (Spreadsheet, Progress Bar) [Source PandasOnRay, Source PandasOnDask]
- Exercise 5: Setting up Modin in a Cluster Environment [Source PandasOnRay]
- Exercise 6: Running Modin in a Cluster Environment [Source PandasOnRay]
How to get required dependencies for the tutorial notebooks and to run them please refer to the respective README.md file.
- Using Modin with the NYC Taxi Dataset [Source]
- Using Modin with the Census Dataset (coming soon...)
- Using Modin with the Plasticc Dataset (coming soon...)
- Using Modin along with the Spreadsheets API [Source]
- Modin for Machine Learning with scikit-learn [Source]