Skip to content
#

amazon-s3

Here are 115 public repositories matching this topic...

document-processing-pipeline-for-regulated-industries

Multi-cloud infrastructure inventory and management tool, supporting AWS, Google Cloud, Azure, Oracle Cloud, Rackspace Cloud, Hetzner Cloud, Alibaba Cloud, e24cloud.com, Linode, Cloudflare, GoDaddy and Backblaze B2.

  • Updated Mar 22, 2024
  • Python

ACK is an E(T)L tool specialized in API data ingestion. It is accessible through a Command-Line Interface. The application allows you to easily extract, stream and load data (with minimum transformations), from the API source to the destination of your choice.

  • Updated Oct 3, 2023
  • Python

An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports

  • Updated Dec 7, 2022
  • Python

Improve this page

Add a description, image, and links to the amazon-s3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the amazon-s3 topic, visit your repo's landing page and select "manage topics."

Learn more