Double call of scope_items method before and after pagination #4253

andrey-lizunov · 2022-11-22T11:41:32Z

Describe the bug

Hello.
I see weird behaviour in my project when resolver with cursor pagination calls scope_items method in type two times.
First time the scope in this method is a full set of data without pagination. Second time the scope with pagination.
It affects the performance because first call works with big amounts of data.
I use scope_items method to remove records that not available for current user and policy checks for thousands of records is not what I want :)

Versions

graphql version: 2.0.15
rails (or other framework): 6.1.5

GraphQL schema

class OfferingType < ::Domains::Types::Base::Object
  ...
  def self.scope_items(items, context)
    items.select do |item|
      ::OfferingPolicy.new(current_user, item).show?
    end
  end
end

class OfferingsResolver < ::Domains::Types::Base::Query
  include SearchObject.module(:graphql)
  type ::OfferingType.connection_type, null: false
  ...
end

GraphQL query

query Offerings {
  offerings(
    first: 3,
  ) {
    edges {
      cursor
    }
    nodes {
      id
    }
  }
}

{
  "data": {
    "offerings": {
      "edges": [
        {
          "cursor": "MQ"
        },
        {
          "cursor": "Mg"
        },
        {
          "cursor": "Mw"
        }
      ],
      "nodes": [
        {
          "id": "1407"
        },
        {
          "id": "486"
        },
        {
          "id": "1190"
        }
      ]
    }
  }
}

Steps to reproduce

Try to call any resolver with cursor pagination that works with type object with scope_items method. Set breakpoint in scope_items method.

Expected behavior

scope_items method call should be made once for paginated data set

Actual behavior

scope_items call made for non paginated and paginated data

Additional context

Add any other context about the problem here.

With these details, we can efficiently hunt down the bug!

The text was updated successfully, but these errors were encountered:

rmosolgo · 2022-11-22T14:16:00Z

Hi, thanks for the detailed report! I agree the list should only be scoped once and that the current behavior is a bug.

I think the reason for this bug is that connection types make an extra call to .scope_items, then the connection's nodes (or edges) list also makes a call to scope items. (Field#scoped? returns true for both list-type fields and connection-type fields.) So, I think the fix is to somehow make Field#scoped? return false when the field is a list-type, but when it belongs to a Connection type.

andrey-lizunov · 2022-11-22T22:39:31Z

@rmosolgo could you check the PR and provide your feedback? Does it look like valid fix?

cheald · 2023-01-10T22:21:15Z

I think this fix ended up producing the opposite of the desired behavior; scope_items is now called only on the unpaginated connection, rather than on the final materialized paginated array. What I want is for scoped_items to not run for the connection, but to only run for the list that results from that connection.

I've got this in my BaseObject, which is how I dealt with the two-pass behavior before:

    def self.scope_items(items, context)
      if items.is_a? Array
        items.select { |i| authorized?(i, context) }
      else
        items
      end
    end

Right now, graphql-ruby's behavior when any item in a list is unauthorized is to null the entire list. This turns out to be a problem in a few cases in my application, where my field resolvers resolve lists of records and then count on type authorization to determine if it's visible or not. What I want to have happen is for a list of items to be loaded, and then any unauthorized items just silently removed. I did this rather than allowing for nullable lists, as nullable nodes connections end up being a pain clientside, where we have to handle T[] | null rather than just applying list operations everywhere.

In 2.0.15, the two-pass nature of scope_items allowed this in a performant manner when dealing with paginated ActiveRecord associations. Obviously, an association might represent many thousands of records, and it is highly inefficient to load them all and authorize each of them. The way this worked it that when the field initially resolves, items is an unpaginated connection, because the pagination constraints are imposed by ActiveRecordRelationConnection later in resolution. Therefore, we just passed the relation through unchanged. However, during the second pass, when the list has been coalesced into an actual limited array of results, we can then do a performant pass of the array to remove any unauthorized entries that would otherwise cause the entire connection to return null.

What's happening now is the worst of both: the ActiveRecord_Relation gets passed to scoped_items (where I can't actually do anything with it, because my authorization logic relies on things that I can't do entirely in the DB, and I need authorized records), so it gets ignored, but then the resulting array of my type does not get passed to scope_items.

cheald · 2023-01-10T22:58:12Z

In my case, this does the trick: I do actually double-scope, but the scope on the connection turns into a no-op, then I re-enable scoping on the nodes/edges. This results in scope_items only being run on my arrays.

class Types::BaseConnection < Types::BaseObject 
    include GraphQL::Types::Relay::ConnectionBehaviors 

  def self.edge_type(*args, field_options: {}, **kwargs)
    field_options[:scope] = true # Add this to all nodes fields
    super(*args, field_options: field_options, **kwargs)
  end

  def self.nodes_field(field_options: {}, **kwargs)
    field_options[:scope] = true # Add this to all nodes fields
    super(field_options: field_options, **kwargs)
  end

  # Don't apply any actual scoping to connections. Instead, we're going to scope the nodes directly.
  def self.scope_items(items, context)
    items
  end
end

andrey-lizunov pushed a commit to andrey-lizunov/graphql-ruby that referenced this issue Nov 22, 2022

Fix for double scope_items method call (Issue rmosolgo#4253)

0bf8a93

andrey-lizunov mentioned this issue Nov 22, 2022

Fix for double scope_items method call (Issue #4253) #4254

Closed

rmosolgo mentioned this issue Dec 6, 2022

don't re-scope nodes or edges #4263

Merged

rmosolgo closed this as completed in #4263 Dec 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Double call of scope_items method before and after pagination #4253

Double call of scope_items method before and after pagination #4253

andrey-lizunov commented Nov 22, 2022 •

edited

Loading

rmosolgo commented Nov 22, 2022

andrey-lizunov commented Nov 22, 2022

cheald commented Jan 10, 2023 •

edited

Loading

cheald commented Jan 10, 2023

Double call of scope_items method before and after pagination #4253

Double call of scope_items method before and after pagination #4253

Comments

andrey-lizunov commented Nov 22, 2022 • edited Loading

rmosolgo commented Nov 22, 2022

andrey-lizunov commented Nov 22, 2022

cheald commented Jan 10, 2023 • edited Loading

cheald commented Jan 10, 2023

andrey-lizunov commented Nov 22, 2022 •

edited

Loading

cheald commented Jan 10, 2023 •

edited

Loading