Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Oct 31, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Workflow Engine for Kubernetes
ETL pipeline for Amazon product sales data, using Apache Airflow for data orchestration and Supabase for storage, by containerizing the environment with Docker, the setup is scalable and easily deployable, supporting data-driven decision-making.
A Configuration System for Airflow
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
Tasks and projects solved while passing Machine Learning Engineer course by karpov.courses and AI Talent Hub (ITMO University)
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
O objetivo deste projeto é contribuir com a formação de iniciantes que almejam entrar na área de dados, fornecendo uma visão baseada em dados sobre as habilidades e conhecimentos mais demandados pelo mercado. Através da coleta e análise de vagas de emprego/estágio, o projeto visa responder à pergunta: “Como se tornar um profissional de dados?"
AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR
Production Grade Terraform for Provisioning Infrastructure
This project is performed within the context of learning ETL techniques, good development practices and business intelligence.
Elyra extends JupyterLab with an AI centric approach.
Python (scrapy, asyncIO), Apache Airflow, GCP (Storage Bucket, Functions, BigQuery, VM), dbt, Terraform, Docker
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."