A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
-
Updated
Nov 9, 2024 - Dockerfile
A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset
The Zoomcamp MLOps Course covers tools like MLflow, Mage, Flask, Prometheus, Evidently, Grafana, Prefect, Terraform, and GitHub Actions. It emphasizes experiment tracking, model deployment, monitoring, CI/CD, and orchestration, culminating in an end-to-end project integrating best practices in MLOps.
The CNPJ Data ETL Pipeline is designed to automate the download, processing, and storage of public CNPJ data from the Brazilian Federal Revenue. The pipeline is built with Mage.ai and AWS S3 to ensure efficient data management and scalability.
An end-to-end data engineering pipeline project that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
# 🇧🇷 CNPJ Data PipelineUm script modular e configurável para processar arquivos CNPJ da Receita Federal do Brasil. 🐙 Este projeto oferece suporte a múltiplos bancos de dados e permite o processamento inteligente de mais de 50 milhões de empresas.
End-to-end data engineering project
A data engineering project built around Smogon's Stats API.
Data modeling and ETL pipeline for data analytics on Uber dataset using Google cloud storage, BigQuery, and Looker Studio
Solutions for @DataTalksClub's Data Engineering Zoomcamp 2024.
An end to end Data Engineering Project
Docker image for Mage AI deployment using Docker
A full data pipeline project. from the ETL to the Dashboard
Data Engineering Pipeline for Uber Data Analysis using Google Cloud Platform, Mage-AI and Looker
This repository contains the research, code, and examples related to the orchestration of data pipelines using a microservices architecture. The project explores the challenges of constructing modern data-centric pipelines and evaluates the role of orchestration tools in Data Science workflows.
In this project, I built a data pipeline using Mage.ai for ETL, GCP for storage, BigQuery for querying, and Looker Studio for analytics. This project helped me learn how to process, store, and visualize data effectively using modern tools.
An end-to-end data engineering project using Amazon S3, EC2, mage.io, Google BigQuery and Looker.
Personal project for Data Engineering Zoomcamp
Add a description, image, and links to the mage-ai topic page so that developers can more easily learn about it.
To associate your repository with the mage-ai topic, visit your repo's landing page and select "manage topics."