#

airflow-dags

Here are 199 public repositories matching this topic...

marlonmoreira1 / Skill-em-Dados

O objetivo deste projeto é contribuir com a formação de iniciantes que almejam entrar na área de dados, fornecendo uma visão baseada em dados sobre as habilidades e conhecimentos mais demandados pelo mercado. Através da coleta e análise de vagas de emprego/estágio, o projeto visa responder à pergunta: “Como se tornar um profissional de dados?"

python bigquery data-science airflow dashboard pandas business-intelligence data-analytics data-analysis powerbi airflow-dags

Updated Aug 2, 2024
Python

igorlangoni / online_retail_data_pipeline

An end-to-end pipeline that ingests raw data from CSV files through Airflow DAGS into BigQuery. From there, it uses dbt to normalize and clean the data and afterwards to make the transformations and come up with relevan reports.

docker bigquery airflow python3 metabase dbt airflow-docker airflow-dags

Updated Aug 1, 2024
Python

gestaogovbr / Ro-dou

Gerador de DAGs no Apache Airflow para fazer clipping do Diário Oficial da União.

open-data dados-abertos apache-airflow government-gazette airflow-dags diario-oficial-da-uniao

Updated Aug 1, 2024
Python

eea / eea-crawler

EEA Crawler contains the tasks (DAGs) used by Apache Airflow to index content from various EEA-Eionet websites into a central Elasticsearch (aka content hub).

elasticsearch crawler indexing etl-pipeline airflow-dags

Updated Jul 30, 2024
Python

tulibraries / cob_datapipeline

Airflow Data Processing Pipeline for TUL Catalog on Blacklight Data

dataops airflow-dags librarysearch

Updated Jul 25, 2024
Python

john-thuo1 / popular_movies_etl

Airflow ETL with AWS, Docker and Postgres consuming TMDb API

etl aws-s3 data-engineering data-pipeline airflow-dags

Updated Jul 21, 2024
Python

KSwaviman / ETL_with_Airbyte

This project showcases an ELT pipeline that extracts JSON data, loads it into a PostgreSQL database, applies transformations using Python scripts, saves the transformed data in a CSV file, and shares it through a FastAPI endpoint.

python docker docker-compose postgresql orchestration apache-airflow custom-connector airflow-dags airbyte elt-pipeline

Updated Jun 30, 2024
Python

DistilledCode / mmrl

Multi-Modal Representational Learning for Social Media Popularity Prediction

neural-network embeddings data-pipeline multimodal-deep-learning praw-reddit airflow-dags chromadb multimodal-large-language-models

Updated Jun 30, 2024
Python

levu12 / NewsAPI_DAG

A simple DAG for NewsAPI and Apache Airflow: https://airflow.apache.org/

airflow newsapi dag airflow-dags

Updated Jun 29, 2024
Python

tulibraries / manifold_airflow_dags

Airflow DAGs for the Manifold (TUL Website) application

dataops airflow-dags library-website

Updated Jun 28, 2024
Python

tulibraries / funcake_dags

Airflow DAGs for PA Digital aggregation processes

dataops dpla airflow-dags

Updated Jul 25, 2024
Python

astronomer / astro-provider-databricks

Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows

python airflow apache workflows databricks apache-airflow databricks-notebooks dags airflow-dags

Updated Jun 28, 2024
Python

status-im / airflow-dags

Status BI python DAGs for Airflow

airflow business-intelligence airflow-dags

Updated Jun 26, 2024
Python

yuhexiong / airflow-dag-kafka-flink-doris-python

workflow airflow kafka apache-flink apache-kafka flink airflow-docker apache-airflow doris airflow-dags apache-doris

Updated Jun 25, 2024
Python

TechWithNate / dbt-data-pipeline

A dbt data pipeline capstone project.

python docker airflow sql snowflake dbt dataengineering airflow-dags

Updated Jun 10, 2024
Python

Turnipdo / Real-Time-BTC-USD-Airflow-DAG-Extract-In-Excel

Using yfinance, we grab minute-by-minute BTC-USD data, dump it into PostgreSQL, and link Excel via ODBC for quick analysis!

docker excel python3 dbeaver odbc-driver airflow-dags powerquerym

Updated Jun 10, 2024
Python

bhavanachitragar / zillow-data-analytics

A Python script extracts data from Zillow and stores it in an initial S3 bucket. Then, Lambda functions handle the flow: copying the data to a processing bucket and transforming it from JSON to CSV format. The final CSV data resides in another S3 bucket, ready to be loaded into Amazon Redshift for in-depth analysis. QuickSight for visualizations

lambda-functions s3 ec2-instance redshift zillow-api etl-pipeline airflow-dags quicksight-dashboard

Updated Jun 10, 2024
Python

kalyani33 / twitter-etl-with-airflow

Ochestraction of ETL process with Apache Airflow

python airflow s3 wsl-ubuntu airflow-dags

Updated Jun 8, 2024
Python

vitorjpc10 / ETL-Pipeline--dbt--Snowflake--Airflow-

This project demonstrates how to build an ELT pipeline using dbt, Snowflake, and Airflow. Follow the steps below to set up your environment, configure dbt, create models, macros, tests, and deploy on Airflow.

airflow snowflake dbt airflow-dags

Updated May 26, 2024
Python

keyhong / airflow-operator-anomaly-detection

A program that analyzes the correlation between Airflow DAGs and tasks, enabling anomaly detection for error tasks and operator failures.

python regex configuration-management preprocessing exception-handling machine-learning-clustering airflow-dags

Updated May 26, 2024
Python

Improve this page

Add a description, image, and links to the airflow-dags topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the airflow-dags topic, visit your repo's landing page and select "manage topics."