GraphQL threats detection and protection #3769

vpellan · 2024-07-09T13:56:20Z

What does this PR do?

This PR adds threats detection and protection for GraphQL using libddwaf, the reactive engine and a GraphQL-Ruby (fake) tracer acting as a middleware for the method we want to instrument.
Instead of instrumenting execute_query and verifying variables and arguments there, we go through every query's AST in execute_multiplex, which enable us to block all queries if a threat is detected, and not just the one where the threat is actually located.

Motivation:

There has been some customer demand about that feature. The goal is to dogfood this on Datadog's GitLab.

Additional Notes:

There are complementary changes on this PR :

Added support for GraphQL 2.3, and added Rails in GraphQL appraisals to do integration tests on a full rails app.
- This means that a lot of changed files are just gemfiles and gemfile locks.
Refactored waf result specs that were in:
- appsec/contrib/rack/reactive/*
- appsec/contrib/rails/reactive/*
- appsec/contrib/sinatra/reactive/*
- appsec/monitor/reactive/*
in appsec/reactive/shared_examples.rb
Fixed a typo in fetch_configuration in:
- appsec/processor/actions.rb and associated sig & spec
- appsec/response.rb
Fixed indentation in docs/GettingStarted.md
The files that are directly connected to GraphQL Threats detection are:
- Rakefile
- Matrixfile
- lib/datadog/appsec.rb
- lib/datadog/appsec/response.rb
- lib/datadog/appsec/contrib/graphql/*
- sig/datadog/appsec/contrib/graphql/*
- spec/datadog/appsec/contrib/graphql/*
- spec/datadog/tracing/contrib/graphql/support/*
- spec/datadog/tracing/contrib/rails/support/*

How to test the change?

bundle exec appraisal ruby-X.X-graphql-X.X rake spec:appsec:graphql

… integration tests)

spec/datadog/tracing/contrib/graphql/support/application.rb

maycmlee

👍 for docs

lloeki

Overall it looks good.

I see a factorisation opportunity, plus a few small notes+questions for clarification.

lloeki · 2024-07-11T08:50:56Z

lib/datadog/appsec/contrib/graphql/appsec_trace.rb

+                multiplex_return = []
+                gateway_multiplex.queries.each do |query|
+                  query_result = ::GraphQL::Query::Result.new(
+                    query: query,
+                    values: JSON.parse(AppSec::Response.content_json)
+                  )
+                  multiplex_return << query_result


Could this be extracted into AppSec::Response.negotiate?

We could extract it but I believe it would become un-necessarily more complex. AppSec::Response.negotiate was made for HTTP-level frameworks, selecting the type of response the client configured (JSON, HTML or Plain text). But GraphQL only return JSON, so we'd have to make a special case to always enforce JSON when it is GraphQL, and also duplicate the resulting JSON times the number of queries in the multiplex.

Maybe this then?

multiplex_return = appsec_response(some, args)

and

private def appsec_response(some, args) gateway_multiplex.queries.map do |query| query_result = ::GraphQL::Query::Result.new( query: query, values: JSON.parse(AppSec::Response.content_json) ) end end

To split concerns

lloeki · 2024-07-11T08:51:46Z

lib/datadog/appsec/contrib/graphql/ext.rb

+      module GraphQL
+        # GraphQL integration constants
+        # @public_api Changing resource names, tag names, or environment variables creates breaking changes.
+        module Ext


Since this Ext empty, you might as well remove the file.

lloeki · 2024-07-11T08:52:44Z

lib/datadog/appsec/contrib/graphql/gateway/multiplex.rb

+            private
+
+            def create_arguments_hash
+              require 'graphql/language/nodes'


Is there a reason why this require is dynamic instead of top-level?

Also, if it needs to be dynamic for some reason, it might be worth doing the require out of a hot code path.

lloeki · 2024-07-11T08:58:13Z

spec/datadog/appsec/contrib/graphql/appsec_trace_spec.rb

+    bits = schema.execute('query test{ user(id: 1) { name } }')
+    expect(bits.to_h).to eq({ 'data' => { 'user' => { 'name' => 'Bits' } } })
+
+    caniche = schema.execute('query test{ user(id: 10) { name } }')


😆

Suggested change

caniche = schema.execute('query test{ user(id: 10) { name } }')

poodle = schema.execute('query test{ user(id: 10) { name } }')

(j/k)

lloeki · 2024-07-11T09:00:34Z

docs/GettingStarted.md

+| `schemas`                |                            | `Array`  | Array of `GraphQL::Schema` objects (that support class-based schema only) to trace. If you do not provide any, then tracing will applied to all the schemas.                                                    | `[]`             |
+| `with_unified_tracer`    |                            | `Bool`   | Enable to instrument with `UnifiedTrace` tracer, enabling support for API Catalog. `with_deprecated_tracer` has priority over this. Default is `false`, using `GraphQL::Tracing::DataDogTrace` (Added in v2.2)  | `false`          |
+| `with_deprecated_tracer` |                            | `Bool`   | Enable to instrument with deprecated `GraphQL::Tracing::DataDogTracing`. This has priority over `with_unified_tracer`. Default is `false`, using `GraphQL::Tracing::DataDogTrace`                               | `false`          |
+| `service_name`           |                            | `String` | Service name used for graphql instrumentation                                                                                                                                                                   | `'ruby-graphql'` |


'ruby-graphql'

The ruby- prefix seems odd. Shouldn't it simply be graphql? (Really I don't know)

I agree but this has been added 6 years ago, maybe we should create a new PR about it ?

codecov-commenter · 2024-07-11T15:37:27Z

Codecov Report

Attention: Patch coverage is 96.81021% with 20 lines in your changes missing coverage. Please review.

Project coverage is 97.89%. Comparing base (ca006e9) to head (9295cad).
Report is 45 commits behind head on master.

Files	Patch %	Lines
...dog/tracing/contrib/graphql/support/application.rb	90.90%	7 Missing ⚠️
lib/datadog/appsec/contrib/graphql/appsec_trace.rb	83.33%	4 Missing ⚠️
...cing/contrib/graphql/support/application_schema.rb	90.90%	4 Missing ⚠️
...atadog/appsec/contrib/graphql/gateway/multiplex.rb	97.14%	1 Missing ⚠️
.../datadog/appsec/contrib/graphql/gateway/watcher.rb	97.29%	1 Missing ⚠️
lib/datadog/appsec/contrib/graphql/integration.rb	95.00%	1 Missing ⚠️
lib/datadog/appsec/contrib/graphql/patcher.rb	94.44%	1 Missing ⚠️
...ec/datadog/tracing/contrib/rails/support/models.rb	75.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3769      +/-   ##
==========================================
- Coverage   97.91%   97.89%   -0.02%     
==========================================
  Files        1243     1256      +13     
  Lines       74763    74983     +220     
  Branches     3608     3667      +59     
==========================================
+ Hits        73205    73408     +203     
- Misses       1558     1575      +17

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

vpellan added 30 commits July 9, 2024 14:32

Update docs

551f932

Add GraphQL AppSec integration architecture (without reactive engine)

7266a20

Add rbs files for GraphQL AppSec

1cc1d9a

Add throw/catch to stop execution of query

e52d022

Add reactive_engine for GraphQL resolvers

3c64cf2

Add graphql reactive engine sig

0873678

Fix undefined var in error_query

5708dfd

Fix typo for fetch_configuration

85a73e4

Add GraphQL custom JSON block response

785d122

Added iterative tree traversal to get arguments on execute_multiplex

10158e1

Add graphql.server.all_resolvers blocking

2028b55

Remove blocking on individual resolvers

b182075

Add reactive engine multiplex tests

336a0e8

Fixed typing

b419dbe

Factorize reactive engine specs

b9127c0

Extracted multiplex creation in separate helper

8fa09d0

Add multiplex gateway tests

04460fb

Added userByName in test schema

b222c21

Added basic GraphQL query & multiplex tests

372a99e

Added integration tests & rake task for ruby 3.2

86bd19f

Added GraphQL 2.3 appraisals & added Rails to GraphQL appraisals (for…

f977319

… integration tests)

Added more integration tests

a28d284

Update jruby gemfiles

e6c83da

Removed code that belongs to rack

6fdd46a

Added more integration tests

4727301

Add custom JSON + fix blocking query test

246907e

Added multiplex integration test

58fcaa5

Added mutation testing

d6285df

Update libdatadog in ruby-3.3-graphql-2.3 gemfile

6e274dc

Add support for Ruby 3.4

5aa2ed6

vpellan added 3 commits July 9, 2024 14:32

Fix appraisals gemfile.lock

4798b11

Add type signature

16cf884

Removed redundant comment

6ed85c9

vpellan requested review from a team as code owners July 9, 2024 13:56

github-actions bot added appsec Application Security monitoring product integrations Involves tracing integrations labels Jul 9, 2024

vpellan self-assigned this Jul 9, 2024

github-advanced-security bot found potential problems Jul 9, 2024

View reviewed changes

spec/datadog/tracing/contrib/graphql/support/application.rb Dismissed Show dismissed Hide dismissed

maycmlee approved these changes Jul 9, 2024

View reviewed changes

vpellan requested a review from lloeki July 10, 2024 11:38

lloeki reviewed Jul 11, 2024

View reviewed changes

vpellan added 2 commits July 11, 2024 15:37

remove ext file

93ef101

moved dynamically loaded require to top level loaded

5f4f191

Separate GraphQL response generation from blocking detection

9295cad

lloeki approved these changes Jul 23, 2024

View reviewed changes

vpellan merged commit 26a5abf into master Jul 24, 2024
170 checks passed

vpellan deleted the vpellan/graphql-threat-detection branch July 24, 2024 08:34

github-actions bot added this to the 2.3.0 milestone Jul 24, 2024

TonyCTHsu mentioned this pull request Aug 22, 2024

Bump to version 2.3.0 #3861

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GraphQL threats detection and protection #3769

GraphQL threats detection and protection #3769

vpellan commented Jul 9, 2024 •

edited

Loading

maycmlee left a comment

lloeki left a comment

lloeki Jul 11, 2024

vpellan Jul 11, 2024

lloeki Jul 15, 2024 •

edited

Loading

lloeki Jul 11, 2024

lloeki Jul 11, 2024

lloeki Jul 11, 2024

lloeki Jul 11, 2024

vpellan Jul 11, 2024

codecov-commenter commented Jul 11, 2024 •

edited

Loading

	caniche = schema.execute('query test{ user(id: 10) { name } }')
	poodle = schema.execute('query test{ user(id: 10) { name } }')

GraphQL threats detection and protection #3769

GraphQL threats detection and protection #3769

Conversation

vpellan commented Jul 9, 2024 • edited Loading

maycmlee left a comment

Choose a reason for hiding this comment

lloeki left a comment

Choose a reason for hiding this comment

lloeki Jul 11, 2024

Choose a reason for hiding this comment

vpellan Jul 11, 2024

Choose a reason for hiding this comment

lloeki Jul 15, 2024 • edited Loading

Choose a reason for hiding this comment

lloeki Jul 11, 2024

Choose a reason for hiding this comment

lloeki Jul 11, 2024

Choose a reason for hiding this comment

lloeki Jul 11, 2024

Choose a reason for hiding this comment

lloeki Jul 11, 2024

Choose a reason for hiding this comment

vpellan Jul 11, 2024

Choose a reason for hiding this comment

codecov-commenter commented Jul 11, 2024 • edited Loading

Codecov Report

vpellan commented Jul 9, 2024 •

edited

Loading

lloeki Jul 15, 2024 •

edited

Loading

codecov-commenter commented Jul 11, 2024 •

edited

Loading