-
Notifications
You must be signed in to change notification settings - Fork 14.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New grid view in Airflow 2.3.0 has very slow performance on large DAGs relative to tree view in 2.2.5 #23772
Comments
Oh yes, 30s is way too long. I would say there are probably 2 issues here:
Could you share the network latency when you click auto-refresh and get a response in the network from |
Also, a temporary work around: changing the number of runs |
Thanks for the quick response @bbovenzi !
|
@bbovenzi Thanks for the progress so far. I tried out the new 2.3.1 release with #23813 in it, but this is still an issue for us. The big DAG mentioned above is now taking >60 seconds to load and then timing out (we have an nginx reverse proxy with a 60 second timeout on it). The timeout is on the Are you still planning more work on this issue? Can we get it into 2.3.2? |
Yes, that change was just for dynamic tasks. I am working on more optimizations. They just didn't make it in time for 2.3.1. |
Going to reopen as we can still do more to improve perfomance for large DAGs |
I can confirm I'm running 2.3.2 and the grid view is not usable (not loading) with large number of tasks and the default of 25 runs. It was no problem in 2.2.x. It loads fine on 2.3.x as 5 runs. The biggest number of tasks I have is 812 in one dag which is essentially 6 datasources loading 7 different days of data for 1-10 tables and about 6 tasks for each so things add up quickly |
@c-thiel / @sbailliez, have you tried with #24284 applied in your environment? I'm curious if that gets it working for you. |
Don't worry that we closed this. We'll keep working on performance improvements. |
Apache Airflow version
2.3.0 (latest released)
What happened
I upgraded a local dev deployment of Airflow from 2.2.5 to 2.3.0, then loaded the new
/dags/<dag_id>/grid
page for a few dag ids.On a big DAG, I’m seeing 30+ second latency on the
/grid
API, followed by a 10+ second delay each time I click a green rectangle. For a smaller DAG I tried, the page was pretty snappy.I went back to 2.2.5 and loaded the tree view for comparison, and saw that the
/tree/
endpoint on the large DAG had 9 seconds of latency, and clicking a green rectangle had instant responsiveness.This is slow enough that it would be a blocker for my team to upgrade.
What you think should happen instead
The grid view should be equally performant to the tree view it replaces
How to reproduce
Generate a large DAG. Mine looks like the following:
Compare against a small DAG, in my case:
The large DAG is unusable, the small DAG is usable.
Operating System
Ubuntu 20.04.3 LTS (Focal Fossa)
Versions of Apache Airflow Providers
No response
Deployment
Docker-Compose
Deployment details
Docker-compose deployment on an EC2 instance running ubuntu.
Airflow web server is nearly stock image from
apache/airflow:2.3.0-python3.9
Anything else
Screenshot of load time:
GIF of click latency:
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: