-
Updated
Aug 9, 2017 - Jupyter Notebook
dataingestion
Here are 15 public repositories matching this topic...
An application of my Centipede framework to watch 4Chan for any potentially threatening behavior.
-
Updated
Dec 8, 2019 - Python
Proof of concept using localstack as a mock AWS (cloud) to build a basic data ingestion infra using Terraform
-
Updated
Jan 27, 2021 - HCL
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering life…
-
Updated
Oct 6, 2021
-
Updated
Dec 12, 2021 - Python
White and Red Wine classification using logistic regression
-
Updated
Dec 24, 2021 - HTML
course website for data science tools 1
-
Updated
Dec 8, 2022 - Jupyter Notebook
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
-
Updated
Feb 21, 2023 - Jupyter Notebook
The main purpose of this repository is to build the pipeline for training of regression models and predict the compressive strength of concrete to reduce the risk and cost involved in discarding the concrete structures when the concrete cube test fails.
-
Updated
Feb 27, 2023 - Python
O projeto consiste em desenvolver uma solução para a migração de dados de uma fonte com muitos arquivos para uma base de dados hospedada em ambiente Cloud.
-
Updated
Jul 25, 2023 - TSQL
This repo hosts an end-to-end machine learning project designed to cover the full lifecycle of a data science initiative. The project encompasses a comprehensive approach including data Ingestion, preprocessing, exploratory data analysis (EDA), feature engineering, model training and evaluation, hyperparameter tuning, and cloud deployment.
-
Updated
Feb 28, 2024 - Jupyter Notebook
-
Updated
Feb 28, 2024 - Jupyter Notebook
Resource for ETL & Data Ingestion program using Apache Airflow
-
Updated
Mar 7, 2024 - Python
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
-
Updated
May 30, 2024 - Jupyter Notebook
Export sales data from Google Sheet to a relational DBSM
-
Updated
Jun 17, 2024 - Python
Improve this page
Add a description, image, and links to the dataingestion topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataingestion topic, visit your repo's landing page and select "manage topics."