how to efficiently generate thousands of kubernetesPodOperator tasks #16565
Unanswered
talna-evogene
asked this question in
Q&A
Replies: 1 comment 4 replies
-
|
Well. I am not sure if Airflow is limitation here. Are you sure you have 1 TB of memory in total and 1000 CPUs to run 1000 concurrent KubernetesPodOperators on it ? Have you checked if you have the right cluster to run that much of a workload? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone
I have a DAG that contains a task that should run 8000 times with different parameters.
Currently this DAG gets the webserver stuck and sometimes the scheduler and random tasks fails.
I wanted to know if there is a better way to generate the tasks / some parameter tweaking might help.
would appreciate any help,
technical details:
We have airflow 2.0 running on kubernetes.
We have made these changes:
increased the scheduler to use 3 replicas
changes in the configuration:
AIRFLOW__CORE__SQL_ALCHEMY_POOL_SIZE 1000
AIRFLOW__CORE__MAX_ACTIVE_RUNS_PER_DAG 1000
AIRFLOW__CORE__PARALLELISM 1000
AIRFLOW_CORE_SQL_ALCHEMY_MAX_OVERFLOW 60
AIRFLOW__CORE__DAG_CONCURRENCY 1000
we also increased pgbouncer connection limits accordingly.
I generated this task inside a task group using a loop:
Beta Was this translation helpful? Give feedback.
All reactions