Skip to content

Add dag_run to the task_instance table or create new taskuuid column and use this to uniquely identify a task #7968

@dimberman

Description

@dimberman

Apache Airflow version: None given

Ticket was created 21/Nov/16 18:35

Kubernetes version (if you are using kubernetes) (use kubectl version):

Environment:

  • Cloud provider or hardware configuration:
  • OS (e.g. from /etc/os-release):
  • Kernel (e.g. uname -a):
  • Install tools:
  • Others:
    What happened:

We are planning to run around 40,000 tasks a day using airflow and some of them are critical to give quick feedback to developers.

Currently having execution date to uniquely identify tasks does not work for us since we mainly trigger dags (instead of running them on schedule) we collide with 1 sec granularity on several occasions. Having a task uuid or associating dag_run to task_instance table and using this for scheduling and updating status will help us here.

What you expected to happen:

How to reproduce it:

Anything else we need to know:

Moved here from https://issues.apache.org/jira/browse/AIRFLOW-642

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions