fix: fk resolver permissions leak #1411

superlevure · 2023-05-22T14:55:43Z

This PR re-introduce the changes from #1315 to make sure we go through get_queryset when resolving foreign relationships, with some differences:

It applies the improvement from fix(converter.py): performance fix in `convert_field_to_djangomodel.dynamic_type.CustomField #1391 so we don't perform and additional SQL query when the id of the foreign object can be retrieved from the object itself (via relation_id)
It applies the fix to OneToOneRel as well
It makes sure to use the default resolver when get_queryset was not overloaded by the Type, so we can leverage optimization techniques such as prefetch_related and select_related (see Revert field resolver logic to fix poor query performance #1401)

Left to do:

Update documentation

firaskafri · 2023-05-23T07:06:39Z

@superlevure this seems to be very interesting! waiting to test on my local environment!

superlevure · 2023-05-23T07:20:33Z

@superlevure this seems to be very interesting! waiting to test on my local environment!

Thank you!

If the description was not clear enough, this PR allowsprefetch_related and select_related optimizations when no custom get_queryset is defined for the Type.

When one is defined, though, any filtering on the queryset will invalidate the "cache" offered by those as explained by Django documentation here:

https://docs.djangoproject.com/en/4.2/ref/models/querysets/#prefetch-related

I hope this MR can reconcile the two worlds: those who are dependent on select/prefetch_related and those who rely on custom get_queryset. Note that for the latter, the N+1 problem can be tackled by dataloaders instead.

sjdemartini

@superlevure Thank you for following up on this!

I definitely appreciate your point around this:

I hope this MR can reconcile the two worlds: those who are dependent on select/prefetch_related and those who rely on custom get_queryset

It looks like overall this does the trick for that, and your approach of only using the custom resolver when get_queryset has been overridden seems fairly reasonable.

I was still mostly unconvinced that relying on get_queryset is really the right way to handle authorization, for the reasons around performance that you quoted from the Django docs, in that it tends to mean very expensive N+1 exponential SQL query fan-out. However, your tests around the FilmType/FilmDetailsType show a use-case where you don't actually do any further queryset filtering inside of get_queryset to do your authorization, and that does seem tenable. 👉 Can I suggest that we update the "Global Filtering" graphene-django docs example to be more similar to that, where it doesn't actually change/filter the queryset, but instead just conditionally returns the queryset? I think it would also be worth noting in those docs something like "Warning: if you modify the queryset, such as performing any filtering here, it will result in N+1 queries when this DjangoObjectType is used as a relation from another Django model in graphene." WDYT?

I'm going to try running the tests from graphene-django-optimizer (and from my graphene-django-permissions project) against the changes in this branch to make sure there aren't any separately-revealed performance regressions there.

sjdemartini · 2023-05-24T17:19:47Z

graphene_django/converter.py

+        class CustomField(Field):
+            def wrap_resolve(self, parent_resolver):
+                """
+                Implements a custom resolver which go through the `get_node` method to insure that


minor grammar: go -> goes, insure -> ensure

Same in convert_field_to_djangomodel

Thank you for the catch

sjdemartini · 2023-05-24T17:23:34Z

graphene_django/converter.py

+                    return resolver
+
+                def custom_resolver(root, info, **args):
+                    # Note: this function is used to resolve FK or 1:1 fields


Just out of curiosity, if this is used to resolve 1:1 fields, why does convert_onetoone_field_to_djangomodel also need this similar custom_resolver overriding logic separately?

Good question; for each relation kind, we need to deal with both directions: forward (OneToOneField, ForeignKey, ManyToManyField) and backward (ManyToManyRel, ManyToOneRel, OneToOneRel). The resolving logic depends on which side you're working on, hence the separation.

sjdemartini · 2023-05-24T17:46:24Z

graphene_django/converter.py

    model = field.related_model

    def dynamic_type():
        _type = registry.get_type_for_model(model)
        if not _type:
            return

-        return Field(
+        class CustomField(Field):


As a general comment (and I don't mean this as a slight in any way), the code to achieve this behavior feels quite complex/confusing to me, bordering on it seeming like a code smell. e.g., are we properly handling all scenarios with it? How brittle is it? But I'm still not super familiar with all of the internals of graphene-django, so perhaps that's just how something of this lower-level nature would have to be written. I'd be curious to see @mahdyhamad's thoughts, since you're incorporating some of his efforts he'd described in #1315 (comment) but hadn't completed.

I agree this needs to be tested further and maybe simplified/refactored.
Also, it seems that this part from #1361 by-passes get_queryset when the resolver is awaitable. I plan to tackle this in a future PR.

if is_resolver_awaitable: fk_obj = resolver(root, info, **args) # In case the resolver is a custom awaitable resolver that overwrites # the default Django resolver return fk_obj

sjdemartini

Hm, unfortunately the changes here do break query optimization in graphene-django-optimizer. I overwrote the converter.py file in that project's env to match this PR locally, and the master branch of graphene-django-optimizer then had 8 new tests failing. Specifically:

tests/test_query.py::test_should_reduce_number_of_queries_by_using_select_related FAILED
tests/test_query.py::test_should_optimize_when_using_fragments FAILED
tests/test_query.py::test_should_select_nested_related_fields FAILED
tests/test_query.py::test_should_prefetch_nested_select_related_field FAILED
tests/test_query.py::test_should_select_nested_prefetch_related_field FAILED
tests/test_query.py::test_should_select_nested_prefetch_and_select_related_fields FAILED
tests/test_query.py::test_should_fetch_fields_of_related_field FAILED
tests/test_relay.py::test_should_reduce_number_of_queries_in_relay_schema_by_using_select_related FAILED

For instance, for test_should_reduce_number_of_queries_by_using_select_related:

>       assert_query_equality(items, optimized_items)

tests/test_query.py:39:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

left_query = <QuerySet []>, right_query = <QuerySet []>

    def assert_query_equality(left_query, right_query):
>       assert str(left_query.query) == str(right_query.query)
E       assert 'SELECT "test...."name" = bar' == 'SELECT "tests...."name" = bar'
E         - SELECT "tests_item"."id", "tests_item"."name", "tests_item"."parent_id", "tests_item"."item_id", "tests_item"."value" FROM "tests_item" WHERE "tests_item"."name" = bar
E         + SELECT "tests_item"."id", "tests_item"."name", "tests_item"."parent_id", "tests_item"."item_id", "tests_item"."value", T2."id", T2."name", T2."parent_id", T2."item_id", T2."value" FROM "tests_item" LEFT OUTER JOIN "tests_item" T2 ON ("tests_item"."parent_id" = T2."id") WHERE "tests_item"."name" = bar

It looks to me like the select_related optimization is getting removed. Maybe a quirk of how those tests are written and how this CustomField override is happening? Any ideas how to fix that?

I don't think this should be merged until we can be confident it does not introduce these sorts of performance regressions.

superlevure · 2023-05-25T07:14:00Z

@sjdemartini, thank you for taking the time to test my PR on graphene-django-optimizer.

I'm trying to reproduce the issue, but I can't seem to make the tests pass on master first. What I have done so far:

Cloned https://github.com/tfoxy/graphene-django-optimizer
Followed CONTRIBUTING.md:

virtualenv -p python3 venv
. venv/bin/activate
pip install -r dev-requirements.txt
# run tests:
python setup.py test

I'm getting the following for each test:
AttributeError: 'ExecutionContext' object has no attribute 'collect_fields'

Does that ring a bell to you? Is this the correct repo you are referring to?

superlevure

Thanks for the review @sjdemartini

I was still mostly unconvinced that relying on get_queryset is really the right way to handle authorization, for the reasons around performance that you quoted from the Django docs, in that it tends to mean very expensive N+1 exponential SQL query fan-out.

For our use case, we have to rely on get_queryset to perform auth filtering at the DB level, as pulling all the data from the DB to Python to perform the filter would be a substantial performance/memory cost and would not scale.

We're tackling the N+1 problem with dataloaders instead of select_related/prefetch_related.

Can I suggest that we update the "Global Filtering" graphene-django docs example to be more similar to that, where it doesn't actually change/filter the queryset, but instead just conditionally returns the queryset? I think it would also be worth noting in those docs something like "Warning: if you modify the queryset, such as performing any filtering here, it will result in N+1 queries when this DjangoObjectType is used as a relation from another Django model in graphene."

Updating the documentation is on my to-do for this PR indeed. Also to be noted that the N+1 issue occurs anyway if you use graphene-django without select_related/prefetch_related, so the comment should be more something like "Warning: modifying the queryset in get_queryset will cancel out any optimization technique such as select_related/prefetch_related in the model's manager (link to Django doc)"

superlevure · 2023-05-25T06:48:27Z

graphene_django/converter.py

+        class CustomField(Field):
+            def wrap_resolve(self, parent_resolver):
+                """
+                Implements a custom resolver which go through the `get_node` method to insure that


Thank you for the catch

superlevure · 2023-05-25T08:11:51Z

graphene_django/converter.py

+                    return resolver
+
+                def custom_resolver(root, info, **args):
+                    # Note: this function is used to resolve FK or 1:1 fields


Good question; for each relation kind, we need to deal with both directions: forward (OneToOneField, ForeignKey, ManyToManyField) and backward (ManyToManyRel, ManyToOneRel, OneToOneRel). The resolving logic depends on which side you're working on, hence the separation.

superlevure · 2023-05-25T08:20:15Z

graphene_django/converter.py

    model = field.related_model

    def dynamic_type():
        _type = registry.get_type_for_model(model)
        if not _type:
            return

-        return Field(
+        class CustomField(Field):


I agree this needs to be tested further and maybe simplified/refactored.
Also, it seems that this part from #1361 by-passes get_queryset when the resolver is awaitable. I plan to tackle this in a future PR.

if is_resolver_awaitable: fk_obj = resolver(root, info, **args) # In case the resolver is a custom awaitable resolver that overwrites # the default Django resolver return fk_obj

sjdemartini · 2023-05-25T16:44:25Z

@sjdemartini, thank you for taking the time to test my PR on graphene-django-optimizer.

I'm trying to reproduce the issue, but I can't seem to make the tests pass on master first. What I have done so far:

Cloned https://github.com/tfoxy/graphene-django-optimizer

Followed CONTRIBUTING.md:
virtualenv -p python3 venv
. venv/bin/activate
pip install -r dev-requirements.txt
# run tests:
python setup.py test
I'm getting the following for each test: AttributeError: 'ExecutionContext' object has no attribute 'collect_fields'

Does that ring a bell to you? Is this the correct repo you are referring to?

Thank you for testing it out! Ah, yeah, that is because graphql-core released a newer version that broke some behavior, and the dev-requirements.txt did not pin the version of that package. There is an open PR (tfoxy/graphene-django-optimizer#83) that resolves the problem, so if you use the GitHub CLI, you can gh pr checkout 83 to fix the above error and get tests to pass. (Alternatively, you can add graphql-core==3.1.7 to the dev-env-requirements.txt locally. Caveat that in that situation, there's an unrelated failing test on master test_should_return_valid_result_in_a_relay_query, presumably also due to some graphql-core behavior; that test is patched in the PR I mentioned.)

Updating the documentation is on my to-do for this PR indeed. Also to be noted that the N+1 issue occurs anyway if you use graphene-django without select_related/prefetch_related, so the comment should be more something like "Warning: modifying the queryset in get_queryset will cancel out any optimization technique...

Good point, sounds good to me!

superlevure · 2023-06-09T09:48:24Z

@sjdemartini I've spent some time on graphene-django-optimizer and realized that if you switch the tests fixture BaseItemType base class from OptimizedDjangoObjectType to DjangoObjectType the tests suite still passes on the fix/graphql-3.2 branch. This means that ultimately the custom get_queryset method defined in OptimizedDjangoObjectType is not executed in tests using that object, and thus OptimizedDjangoObjectType has no effect on the test results.

The reason why tests are failing with this branch of graphene-django is precisely because of this custom get_queryset implemented inOptimizedDjangoObjectType, which was previously not being used (due to the bug we are trying to fix with this PR) now getting executed. Switching to DjangoObjectType makes graphene-django-optimizer tests pass with this PR (see this branch: https://github.com/superlevure/graphene-django-optimizer/tree/fix/use-DjangoObjectType).

Some other thoughts on graphene-django-optimizer, the latest commit on main was on Oct 15, 2021, and as of today, the CI is failing on main. I think it's fair to say that the repo is not really maintained anymore. On the contrary, graphene-django still is, and this MR aims to fix a critical bug that can have profound security implications while preserving performance optimization when possible.

sjdemartini

Thanks @superlevure for taking the time to investigate that and figure out what was causing the test failures within graphene-django-optimizer! It looks like if we merge this, the optimizer's OptimizedDjangoObjectType class will effectively be broken, and it's somewhat of a bummer that there will be no way to generically define a DjangoObjectType base class that will be query-optimized for both lists and nodes by default, in the way that was done before there (since overriding get_queryset means query optimization will be "broken" for nested models).

But I definitely take your point on the current maintenance state of that package. It seems to have fizzled out around the time that graphene-django originally did, but never revived sadly. (I've pinged the owner to ask if I can get added as a maintainer, fwiw.) Also fortunately, the OptimizedDjangoObjectType doesn't seem like a critical piece of the package's functionality IMO and is more of a boilerplate-reducer, since you can still use the optimizer directly with resolve_* functions and whatnot.

Since it's still possible to optimize queries in general (and with graphene-django-optimizer, just not using OptimizedDjangoObjectType) with your approach here, your changes seem reasonable to me!

firaskafri · 2023-06-12T11:05:12Z

I believe there had been discussions to have the optimizer built in graphene-django, maybe that is the way to go? to have these optimizations as part of graphene-django? Specially now that the original library is not being maintained.

@jkimbo @ulgens @sjdemartini what do you think guys? or should we just keep things separated? One thought I have is that using these optimizations is actually a "very" django thing so maybe it is the way to go?

sjdemartini · 2023-06-15T17:52:31Z

I'm supportive of the idea of baking in the optimizations into graphene-django itself; it seems like that'd lend itself to cleaner/easier maintenance and likely a better implementation. There's an old issue here #57 where this was discussed a bit and generally landed on the notion that the optimizations should be part of graphene-django (#57 (comment)).

Personally I don't think I'd have the time to dig into the details of how it could be done or port that functionality myself—I imagine it's not a small feat—and sadly the graphene-django-optimizer maintainer hasn't been responsive either. But I generally like the concept if someone else wants to pick that up!

* fix: fk resolver permissions leak * fix: only one query for 1o1 relation * tests: added queries count check * fix: docstring * fix: typo * docs: added warning to authorization * feat: added bypass_get_queryset decorator

fix: fk resolver permissions leak

edb9690

superlevure added permissions security labels May 22, 2023

superlevure added 3 commits May 22, 2023 16:28

fix: only one query for 1o1 relation

013dbbe

tests: added queries count check

a2a95a4

fix: docstring

5079a46

firaskafri requested review from sjdemartini, tcleonard and mahdyhamad May 23, 2023 07:06

sjdemartini reviewed May 24, 2023

View reviewed changes

sjdemartini requested changes May 24, 2023

View reviewed changes

fix: typo

f80bf5f

superlevure commented May 25, 2023

View reviewed changes

docs: added warning to authorization

ccce70e

sjdemartini mentioned this pull request Jun 9, 2023

Fix compatibility with graphql-core v3.2 tfoxy/graphene-django-optimizer#83

Merged

sjdemartini approved these changes Jun 9, 2023

View reviewed changes

superlevure marked this pull request as ready for review June 12, 2023 09:37

Merge branch 'main' into fix-fk-resolver

91ddfbd

feat: added bypass_get_queryset decorator

42d1621

firaskafri self-requested a review July 18, 2023 12:16

firaskafri approved these changes Jul 18, 2023

View reviewed changes

firaskafri merged commit 0de35ca into graphql-python:main Jul 18, 2023
13 checks passed

kiendang mentioned this pull request Dec 6, 2023

Vesion 3.1.5 returns null for an object pointed to non PK in referred table. #1482

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fk resolver permissions leak #1411

fix: fk resolver permissions leak #1411

superlevure commented May 22, 2023 •

edited

firaskafri commented May 23, 2023

superlevure commented May 23, 2023

sjdemartini left a comment

sjdemartini May 24, 2023

superlevure May 25, 2023

sjdemartini May 24, 2023

superlevure May 25, 2023

sjdemartini May 24, 2023

superlevure May 25, 2023

sjdemartini left a comment

superlevure commented May 25, 2023

superlevure left a comment

superlevure May 25, 2023

superlevure May 25, 2023

superlevure May 25, 2023

sjdemartini commented May 25, 2023

superlevure commented Jun 9, 2023

sjdemartini left a comment

firaskafri commented Jun 12, 2023

sjdemartini commented Jun 15, 2023

fix: fk resolver permissions leak #1411

fix: fk resolver permissions leak #1411

Conversation

superlevure commented May 22, 2023 • edited

firaskafri commented May 23, 2023

superlevure commented May 23, 2023

sjdemartini left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sjdemartini left a comment

Choose a reason for hiding this comment

superlevure commented May 25, 2023

superlevure left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sjdemartini commented May 25, 2023

superlevure commented Jun 9, 2023

sjdemartini left a comment

Choose a reason for hiding this comment

firaskafri commented Jun 12, 2023

sjdemartini commented Jun 15, 2023

superlevure commented May 22, 2023 •

edited