OverlappingFieldsCanBeMerged is slow #1786

redhead · 2020-02-07T13:50:34Z

OverlappingFieldsCanBeMerged can be very slow when many fragments in the query.
Scala implementation resolved it by optimizing the algorithm.

See sangria-graphql-org/sangria#12.

Can we do something similar in graphql-java?

The text was updated successfully, but these errors were encountered:

bbakerman · 2020-02-09T21:09:28Z

Can we do something similar in graphql-java?

of course we can - its an open source project and you can you submit a PR to get this started.

redhead · 2020-02-09T23:51:18Z

It doesn't follow the spec, so I'm asking if it's the way to go.

I may do it once I will get some free time.

bbakerman · 2020-02-10T02:27:32Z

Are you saying that way the sangria team solved the performance problem was to not follow spec?

if so can you expand on that and help us understand what they did and how it deviates?

I read that issue but there is a lot of reverse engineering involved and it would be great to get a head start on that

redhead · 2020-02-11T12:24:54Z

I didn't read it very much into details, but they looked at the algorithm defined by the spec regarding overlapping-fields-can-be-merged, and introduced some optimizations to it (some caching, and some function call refactorings so they don't check things more than needed).

Currently, it is in "experimental" and providing a way how to replace the original validator with the optimized one using their API for validators. Which is not possible in graphql-java if I saw correctly. So the spec algorithm is still the default one, but there is a way to replace it.

See also
(short comment) sangria-graphql-org/sangria#12 (comment)
(details in blog post) https://tech.xing.com/graphql-overlapping-fields-can-be-merged-fast-ea6e92e0a01

redhead · 2020-03-18T14:18:22Z

Bump.

We would like to sort this our for our application, as it's imposing quite a big performance hit for our API.

andimarek · 2020-03-19T07:45:27Z

@redhead can you provide us with some profiling results?

redhead · 2020-03-19T08:12:36Z

I kind of wanted to start a discussion about it first, so I can't give you profiling results of the rewritten algorithm (I can of the current state).

Also, as it's not following the spec's pseudocode, I wasn't sure you would be ok of merging it at all.

redhead · 2020-03-19T08:14:45Z

This is the state now. As you can see, even constructing quite a big SQL select and running it on the DB server was faster than OverlappingFieldsCanBeMerged.isAlreadyChecked

andimarek · 2020-03-19T21:16:13Z

How big was the query you are running? Can maybe share it? Thanks

redhead · 2020-03-20T09:08:54Z

The query is big (2k lines), it's because we are doing a global search query which can potentially return any type of our app model entity. Because of that, it consists of a lot of inline fragments for "casting" to the right GQL type of the app model entity. Most of the entities don't have a common denominator interface to make it consise.

The gist of it is here, it was edited for clarity and to hide our business domain:

query GlobalSearch {
  _search(size: 10, filter: "foobar") {
    edges {
      node {
        gid
        type
        publishedVersion {
          ... on FooType {
            name
            _ancestors {
              gid
              type
              ancestorVersion {
                ... on AType {
                  name
                  _draftType
                }
                ... on BType {
                  label
                  _draftType
                }
              }
            }
          }
          ... on BarType {
            name
            _ancestors {
              gid
              type
              ancestorVersion {
                ... on XType {
                  label
                  _draftType
                }
                ... on YType {
                  yName
                  _draftType
                }
                ... on ZType {
                  zName
                  _draftType
                }
              }
            }
          }

          #################
          ## ...
          ## THIS CAN GO ON LIKE THIS HUNDRED OF TIMES
          ## ...
          #################

        }
      }
    }
    pageInfo {
      hasNextPage
      endCursor
    }
    totalCount
  }
}

redhead · 2020-03-20T09:12:48Z

Please have a look into the sangria bug report (though written in scala, it probably follows the same algorithm):
sangria-graphql/sangria#296

and the query they reported was:
https://gist.github.com/objmagic/3c881449fcdb3a812a371b86bfa5a3c9

redhead · 2020-04-07T19:12:07Z

(I had to delete my previous post from today as it wasn't fully valid.)

I took a chance to try to rewrite the algorithm from Scala (sangria) implementation to graphql-java. As you can see, it went quite well:

I have a dirty WIP code of the new algorithm rewritten from scala., All the tests from OverlappingFieldsCanBeMergedTest are passing except for these cases:

the error messages are not exactly the same (could be more or less of them, as they can be reported case by case)
types that cannot be resolved according to the schema are ignored (fields' types that are unknown in validator are not considered for the validation - they always pass)

fixes graphql-java#1786

davidcurrie · 2020-09-21T10:53:52Z

+1 for giving this some serious consideration. I'm looking at a Java profile where, of the 3.8s of CPU time spent in the GraphQL endpoint, 3.2s is spent in OverlappingFieldsCanBeMerged.leaveSelectionSet.

andimarek · 2021-08-02T06:34:47Z

hi,

we are planning to address this issue via #2495

Special thanks to @redhead: we really appreciate the effort you put in, but ultimately we could not accept a scala version of the algorithm from a longterm maintainability POV. But we implemented the same algorithm based on the xing article.

We are planning to replace the current validation completely with the more performant one.

Andi

redhead · 2021-08-02T07:26:33Z

Thanks a lot. It looks great.

andimarek · 2021-08-03T07:40:49Z

merged now and will be part of 17.0

redhead mentioned this issue Apr 8, 2020

Experimental fast overlapping-fields-can-be-merged algorithm #1854

Closed

redhead pushed a commit to redhead/graphql-java that referenced this issue Apr 8, 2020

adds experimental fast overlapping-fields-can-be-merged algorithm

ffc0f10

fixes graphql-java#1786

andimarek closed this as completed Aug 3, 2021

andimarek added this to the 17.0 milestone Aug 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OverlappingFieldsCanBeMerged is slow #1786

OverlappingFieldsCanBeMerged is slow #1786

redhead commented Feb 7, 2020

bbakerman commented Feb 9, 2020

redhead commented Feb 9, 2020

bbakerman commented Feb 10, 2020

redhead commented Feb 11, 2020

redhead commented Mar 18, 2020

andimarek commented Mar 19, 2020

redhead commented Mar 19, 2020

redhead commented Mar 19, 2020

andimarek commented Mar 19, 2020

redhead commented Mar 20, 2020

redhead commented Mar 20, 2020

redhead commented Apr 7, 2020 •

edited

Loading

davidcurrie commented Sep 21, 2020

andimarek commented Aug 2, 2021

redhead commented Aug 2, 2021

andimarek commented Aug 3, 2021

OverlappingFieldsCanBeMerged is slow #1786

OverlappingFieldsCanBeMerged is slow #1786

Comments

redhead commented Feb 7, 2020

bbakerman commented Feb 9, 2020

redhead commented Feb 9, 2020

bbakerman commented Feb 10, 2020

redhead commented Feb 11, 2020

redhead commented Mar 18, 2020

andimarek commented Mar 19, 2020

redhead commented Mar 19, 2020

redhead commented Mar 19, 2020

andimarek commented Mar 19, 2020

redhead commented Mar 20, 2020

redhead commented Mar 20, 2020

redhead commented Apr 7, 2020 • edited Loading

davidcurrie commented Sep 21, 2020

andimarek commented Aug 2, 2021

redhead commented Aug 2, 2021

andimarek commented Aug 3, 2021

redhead commented Apr 7, 2020 •

edited

Loading