-
Notifications
You must be signed in to change notification settings - Fork 2.9k
-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Finished queries continue using memory pool on worker #11030
Comments
@sshkvar thanks for the report.
|
@losipiuk thanks for the quick reply, These queries pretty different like simple |
Small update. All stuck queries have reservation only in
Amount of reserved memory is pretty small, up to a few megabytes. |
@arhimondr is this manifestation of #10950? Or different thing? |
And @sshkvar what Trino version did you use before 367? |
We used Trino(Presto) 346 |
And you are not using query/task based retries which were introduced in 367? |
No, we are not using retries. As I know it disabled by default |
Yeah. Tricky - did you maybe have a chance to take a look into logs? Any errors/stacktraces logged? |
@sshkvar Do you know if the queries you see holding memory reservations finished successfully or failed? |
we didn't find any errors
queries finished successfully |
@sshkvar Do your queries contain We've found a race condition that may result in memory not being release properly: #11088. But that shouldn't happen unless you have a Also the likelihood of this problem occurrence should be fairly low. But since you are running thousands of queries it may happen that some of them trigger the race condition mentioned. |
We should also improve our integration tests to verify that memory pools are empty upon completion: #11093 |
Should be fixed by #11088 |
Hi, after update to Trino 367 we found that finished queries doesn't release memory pool on workers.
As a result we have a lot of queries which actually finished but still exists in workers memory pool.
On the chart below you can find amount of such queries and see how these amount grown after cluster restart
As you can see we have 12000 queries which actually finished but using memory pool on workers.
Les't see example of one worker
On this worker we have 166 running queries, but 98 of them already finished but still using memory pool.
For example 20220210_093944_48617_446ev
When I try to click on this query on open in new tab I see query not found
The text was updated successfully, but these errors were encountered: