Fix closing of pools #784

adriangb · 2024-04-10T20:33:39Z

I have a reproducible test where there are still connections left in pool._pool after close() is run. This seems to fix it for my case but I have no idea why and can't really share a reproducible example. My guess is it has something to do with a race condition and/or weirdness with pytest and event loops.

I'm opening this first to see what happens in CI.

adriangb · 2024-04-10T20:51:24Z

@dvarrazzo how would you feel about merging this given test pass? Does the change make sense to you at all?

dvarrazzo · 2024-04-10T20:58:25Z

psycopg_pool/psycopg_pool/pool.py

-        # putconn will just close the returned connection.
-        self._stop_workers(waiting, connections, timeout)
+            # Now that the flag _closed is set, getconn will fail immediately,
+            # putconn will just close the returned connection.


This comment is now misplaced. "Now" refers to the code out of the self._lock block, not to _stop_workers(). (my bad, the lack of a blank line made it deceptive).

Maybe it'd be best you propose a diff / push the change?

Ok, let me try and figure out a different codepath.

dvarrazzo · 2024-04-10T21:10:59Z

psycopg_pool/psycopg_pool/pool.py

+            # putconn will just close the returned connection.
+            self._stop_workers(waiting, connections, timeout)
+
+            self._pool.clear()


As you, I have no idea how you can get it the situation in which there are still connections in the pool. Maybe we call _stop_workers() in a moment in which a worker is busy creating a connection, and it will receive the StopWorker task only after it has added a new connection. Therefore I see how calling _pool.clear() afterwards can make a difference.

One thing that is note done right here is the _pool.clear() which doesn't close a connection eventually left there, which may result in a warning.

Looking at the _stop_worker(), I find the connections parameter extremely weird. It doesn't seem its responsibility to do that. Can you please try to refactor this code by removing the connections parameter from _stop_workers and then, as you did here, clear the pool, but closing the connection, after _stop_worker() call? Something like:

with self._lock: # [snip] self._waiting.clear() self._stop_workers(waiting, timeout) for conn in self._pool: conn.close() self._pool.clear() # Now that the flag _closed is set, getconn will fail immediately, # putconn will just close the returned connection.

I see that _stop_workers() is also called by __del__(), but with no connection list, and also that it takes on itself the responsibility of closing the waiting clients... I don't know if this is really the best organisation of this code: feel free to propose cleanup refactoring if you see any obvious one.

The case has been reported in #784. While not easy to reproduce, it seems that it might be caused by the pool being closed while a worker is still trying to create a connection, which will be put in the _pool state after supposedly no other operation should have been performed. Stop the workers and then empty the pool only after they have stopped to run. Also refactor the cleanup of the pool and waiting queue, moving them to close(). There is no reason why a method called "stop workers" should empty them, and there is no other code path that use such feature. Close #784.

Fix closing of pools

33f20f1

adriangb marked this pull request as ready for review April 10, 2024 20:33

adriangb marked this pull request as draft April 10, 2024 20:33

adriangb marked this pull request as ready for review April 10, 2024 20:44

dvarrazzo reviewed Apr 10, 2024

View reviewed changes

dvarrazzo mentioned this pull request Apr 10, 2024

fix(pool): make sure there are no connection in the pool after close() #786

Merged

dvarrazzo closed this in 910383f Apr 12, 2024

adriangb deleted the fix-close-pool-questionmark branch April 12, 2024 02:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix closing of pools #784

Fix closing of pools #784

adriangb commented Apr 10, 2024 •

edited

adriangb commented Apr 10, 2024

dvarrazzo Apr 10, 2024

adriangb Apr 10, 2024

dvarrazzo Apr 10, 2024

dvarrazzo Apr 10, 2024 •

edited

dvarrazzo Apr 10, 2024

Fix closing of pools #784

Fix closing of pools #784

Conversation

adriangb commented Apr 10, 2024 • edited

adriangb commented Apr 10, 2024

dvarrazzo Apr 10, 2024

Choose a reason for hiding this comment

adriangb Apr 10, 2024

Choose a reason for hiding this comment

dvarrazzo Apr 10, 2024

Choose a reason for hiding this comment

dvarrazzo Apr 10, 2024 • edited

Choose a reason for hiding this comment

dvarrazzo Apr 10, 2024

Choose a reason for hiding this comment

adriangb commented Apr 10, 2024 •

edited

dvarrazzo Apr 10, 2024 •

edited