Repositório para armazenar códigos do projeto.
-
Updated
Dec 2, 2021 - Python
Repositório para armazenar códigos do projeto.
How to combine smart store and ingest action for datalake use case
This project is about building a data lake and creating an ETL pipeline in Spark that loads data from Amazon S3, processes the data into analytics tables, and loads them back into S3
Datalake on AW
An insanely customizable framework for key-value storage 💾
Collection of data on Formula One Racing
Application to ingest data into DB from API
Use of Spark to get data from S3 then wrangle it to make available back in S3 with a better schema
Coleta, armazenamento e análise de dados históricos das distribuições de bolsas de estudos do CAPES.
Sample data store project to be hosted on a remote server or cluster. CICD using GitHub actions for SSH Deploy to remote server for docker compose.
Load data from S3, process the data into analytics tables using Spark and load them back into S3. Deployed this Spark process on a cluster using AWS EMR
Add a description, image, and links to the datalake topic page so that developers can more easily learn about it.
To associate your repository with the datalake topic, visit your repo's landing page and select "manage topics."