delta-convertor is a Rust library for converting parquet files from Data Lake in AWS S3 into Delta Lake.
delta-convertor reads parquet files from provided source, converts them to delta format and writes into delta table. Delta tables will be available in AWS Glue catalog and can be queried with AWS Athena.
Use the package manager cargo or docker to install delta-convertor.
- run delta-convertor with mode="init" for initialization empty delta table in AWS S3 (you will see only delta_log folder)
- run AWS Crawler with delta source option for creating delta table in AWS Glue catalog (need to do it only once)
- run delta-convertor with mode="append" for writing parquet files from Data Lake to Delta Lake