Fixed #29984 -- Support prefetch_related() with Queryset.iterator() #10707

RaphaelKimmig · 2018-11-29T21:13:05Z

https://code.djangoproject.com/ticket/29984

charettes · 2018-11-29T23:20:56Z

There seems to be some cursor exhaustion issues for PostgreSQL. I haven't tested it but I wonder if psycopg2 allows us to have to have multiple cursors opened on the same connection at the same time. In this case we need way to keep a server side cursor opened while we iteratively open multiple client side cursors to perform the prefetches.

Did you investigate it a bit @RaphaelKimmig?

RaphaelKimmig · 2018-11-30T05:43:49Z

I've just looked into those failures and I think the issue was simply the tests using a batch size large enough to not keep a cursor open.

Given that this implementation now takes chunk_size results at a time, we'd only expect a cursor to remain open if there are more results than can be fetched in a single chunk.

I've updated the failing tests to use an appropriate chunk_size.

From what I understand having multiple cursors open at the same time should not be an issue, in fact there is a test for that (ServerSideCursorsPostgres.test_server_side_cursor_many_cursors).

taylor-cedar · 2019-01-12T06:04:17Z

@charettes What's the steps to getting this merged? This feature would be extremely useful.

charettes

@taylor-cedar AFAIK this is still a PoC because there isn't a clear consensus on how we should turn on the feature given backward compatibility concerns.

charettes · 2019-01-12T17:30:57Z

django/db/models/query.py

@@ -338,7 +338,22 @@ def __or__(self, other):
    ####################################

    def _iterator(self, use_chunked_fetch, chunk_size):
-        yield from self._iterable_class(self, chunked_fetch=use_chunked_fetch, chunk_size=chunk_size)
+        clone = self._chain()


It'd be great if we could avoid this and all of the following if not self._prefetch_related_lookups

e.g.

# Avoid materialization of chunks of objects if no prefetching must take # place. if not self._prefetch_related_lookups: yield from self._iterable_class(self, chunked_fetch=use_chunked_fetch, chunk_size=chunk_size) return

That'll make sure the exact same behavior is preserved when no prefetching is involved and allow you to revert the test_server_side_cursors.py changes.

Yep, makes sense. I've reverted the test changes.

charettes · 2019-01-12T17:32:09Z

django/db/models/query.py

+                clone._result_cache = None
+                return
+
+            if clone._prefetch_related_lookups:


This conditional clause can be dropped once we perform an initial not self._prefetch_related_lookups as described above.

charettes · 2019-01-12T17:32:44Z

django/db/models/query.py

+            clone._prefetch_done = False
+
+            if len(clone._result_cache) == 0:
+                clone._result_cache = None


Why is this is required? I don't think there's a need to unref an empty list given clone is never returned?

charettes · 2019-01-12T17:32:58Z

django/db/models/query.py

+            clone._result_cache = list(islice(iterator, chunk_size))
+            clone._prefetch_done = False
+
+            if len(clone._result_cache) == 0:


Could use boolean not clone._result_cache check as well.

charettes · 2019-01-12T17:33:25Z

django/db/models/query.py

+                clone._prefetch_related_objects()
+
+            for item in clone._result_cache:
+                yield item


Could be reduced to yield from iter(clone._result_cache).

django/db/models/query.py

felixxm · 2020-06-12T10:43:42Z

Closing due to inactivity.

RaphaelKimmig force-pushed the ticket_29984 branch from 08d1e02 to e6d1aef Compare November 30, 2018 05:37

RaphaelKimmig force-pushed the ticket_29984 branch from e6d1aef to 18d9428 Compare November 30, 2018 05:47

charettes reviewed Jan 12, 2019

View reviewed changes

django/db/models/query.py Show resolved Hide resolved

Fixed #29984 -- Support prefetch_related() with Queryset.iterator()

25e8d3e

RaphaelKimmig force-pushed the ticket_29984 branch from 18d9428 to 25e8d3e Compare January 12, 2019 18:28

felixxm closed this Jun 12, 2020

jacobtylerwalls mentioned this pull request Jan 19, 2022

Fixed #29984 -- Added QuerySet.iterator() support for prefetching related objects. #15334

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed #29984 -- Support prefetch_related() with Queryset.iterator() #10707

Fixed #29984 -- Support prefetch_related() with Queryset.iterator() #10707

RaphaelKimmig commented Nov 29, 2018

charettes commented Nov 29, 2018 •

edited

RaphaelKimmig commented Nov 30, 2018

taylor-cedar commented Jan 12, 2019

charettes left a comment

charettes Jan 12, 2019

RaphaelKimmig Jan 12, 2019

charettes Jan 12, 2019

charettes Jan 12, 2019

charettes Jan 12, 2019 •

edited

charettes Jan 12, 2019

felixxm commented Jun 12, 2020

Fixed #29984 -- Support prefetch_related() with Queryset.iterator() #10707

Fixed #29984 -- Support prefetch_related() with Queryset.iterator() #10707

Conversation

RaphaelKimmig commented Nov 29, 2018

charettes commented Nov 29, 2018 • edited

RaphaelKimmig commented Nov 30, 2018

taylor-cedar commented Jan 12, 2019

charettes left a comment

Choose a reason for hiding this comment

charettes Jan 12, 2019

Choose a reason for hiding this comment

RaphaelKimmig Jan 12, 2019

Choose a reason for hiding this comment

charettes Jan 12, 2019

Choose a reason for hiding this comment

charettes Jan 12, 2019

Choose a reason for hiding this comment

charettes Jan 12, 2019 • edited

Choose a reason for hiding this comment

charettes Jan 12, 2019

Choose a reason for hiding this comment

felixxm commented Jun 12, 2020

charettes commented Nov 29, 2018 •

edited

charettes Jan 12, 2019 •

edited