notes for building the airflow on ec2 AWS Linux
Please kindly read the doc
Airflow 1.8, CeleryExecutor, Redis, Celery Flower
-
never use datetime.now, keep use fix datetime anytime
-
keep versioning for dag id and update the version number if there is any changes
-
keep in mind to set start_date as future date time
-
set
max_active_runs_per_dag = 1
-
find ways to sync the dags and config file of master and workers (s3/ git)
-
use
catchup_by_default = False
or keep in mind to use backfill
api plugin
https://github.com/teamclairvoyant/airflow-rest-api-plugin
https://deepumohan.com/tech/setting-up-apache-airflow-on-aws-ec2-instance/
https://stlong0521.github.io/20161023%20-%20Airflow.html
http://site.clairvoyantsoft.com/installing-and-configuring-apache-airflow/
http://site.clairvoyantsoft.com/wp-content/uploads/2016/12/GuideToApacheAirflow.pdf