ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
-
Updated
Jun 3, 2024 - Python
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰
Squirrel dataset hub
Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate
Enables custom tracing of Python applications in Dynatrace
Product scraping from Walmart Canada website, with further cleaning and integration of data from a different store.
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
Infer SQL DDL statements from tabular data.
Python script to extract all .csv/.txt files from a specific AWS S3 bucket & generate the .sql scripts to ingest the files into a AWS Redshift database.
Built real-time data streaming system using the Hadoop ecosystem, which will perform data extraction, data ingestion, data storage data retrieval, data transformation and data analysis in real time.
Data ingestion from Google Sheet to BigQuery
This Repository contains the contents related to Data Engineering Using AWS
Data Integration via Confluent Kafka
A highly flexible and versatile service integration framework.
Data pipeline using S3, Glue, Athena, Lambda and Quicksight to analyze dataset of YouTube
HHA507 / Data Science / Assignment 1
System to predict the outcome from soccer matches.
Add a description, image, and links to the data-ingestion topic page so that developers can more easily learn about it.
To associate your repository with the data-ingestion topic, visit your repo's landing page and select "manage topics."