The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
Nov 17, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
Official s3cmd repo -- Command line tool for managing S3 compatible storage services (including Amazon S3 and CloudFront).
Continuous Archiving for Postgres
Utils for streaming large files (S3, HDFS, gzip, bz2...)
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Example end to end data engineering project.
Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.
🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵
Packaged version of ultralytics/yolov5 + many extra features
🗂️ a pleasant file explorer in your terminal supporting all filesystems
Jupyter Notebooks in S3 - Jupyter Contents Manager implementation
Add a description, image, and links to the s3 topic page so that developers can more easily learn about it.
To associate your repository with the s3 topic, visit your repo's landing page and select "manage topics."