Skip to content

daniel-chang-github/Airflow-Mini-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Airflow-Mini-Project

In this project, you’ll use Apache Airflow to create a data pipeline to extract online stock market data and deliver analytical results. You’ll use Yahoo Finance as the data source. Yahoo Finance provides intra-day market price details down a one-minute interval.

The learning objects of this mini project are:

  • Use text processing techniques in Python to make sense of logs
  • Learn where logs are located in Airflow
  • Learn how to monitor automated Airflow DAGs to ensure they are working properly

Docker

The DAGs are located in mnt/airflow/dags directory

Then execute ./start.sh script. This should build and start all the services. Execute docker-compose ps and you should see below. image Go to localhost:8080 for the airflow UI. The userid and password is airflow. After completing you can use ./stop.sh to stop the services

Graph View

image

Tree View

image

Log of the last task

An example of successful execution

image

Log analyzer

Log analyzer will collect all ERRORs from logs and display them all. It will show the total number of errors, which log the error message is from, and the actual error message.

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published