Spooq is your PySpark based helper library for ETL data ingestion pipeline in Data Lakes.
Extractors, Transformers, and Loaders are independent components which can be plugged-in into a pipeline instance or used separately.
changelog installation setup_development_testing examples extractor/overview transformer/overview loader/overview pipeline/overview base_classes/overview architecture
modindex
search