Sparky is composed of helper functions, classes, and methods that are useful for common routines done with pyspark.
Sparky provides:
- Functions for performing frequent data summarization and transformation tasks in pyspark data pipelines
- Classes and methods for common machine learning workflows using MLlib
- Classes and methods for pyspark system configuration
All Sparky wheels distributed on PyPI are BSD 3-clause licensed.