ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
-
Updated
Jun 3, 2024 - Python
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Product scraping from Walmart Canada website, with further cleaning and integration of data from a different store.
Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate
Enables custom tracing of Python applications in Dynatrace
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰
Squirrel dataset hub
Python script to extract all .csv/.txt files from a specific AWS S3 bucket & generate the .sql scripts to ingest the files into a AWS Redshift database.
Built real-time data streaming system using the Hadoop ecosystem, which will perform data extraction, data ingestion, data storage data retrieval, data transformation and data analysis in real time.
HHA507 / Data Science / Assignment 1
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
Data Integration via Confluent Kafka
System to predict the outcome from soccer matches.
Infer SQL DDL statements from tabular data.
Project to generate fake chess data and perform ingest in AWS S3
a simple search, extractor and ingestion system for get the best sellers products of tech on the Amazon
Data ingestion from Google Sheet to BigQuery
Data pipeline using S3, Glue, Athena, Lambda and Quicksight to analyze dataset of YouTube
Data ingestion through SQL and API requests and exploratory analysis on data collected about Malaysia airports.
Add a description, image, and links to the data-ingestion topic page so that developers can more easily learn about it.
To associate your repository with the data-ingestion topic, visit your repo's landing page and select "manage topics."