fix: update pg instrumentation to handle non primitive argument #1146

ericmustin · 2022-03-12T23:38:56Z

Summary

Addresses #1139

the pg connection_adapter .execute method can be invoked with arguments that are not a String class, like a Arel::Nodes::SqlLiteral.

While Arel::Nodes::SqlLiteral is a String subclass, Our otlp exporter doesn't gracefully handle Subclasses of accepted attribute types (see here), and instead just returns an error and drops the attribute when attempting to encode a non valid type.

I'm hesitant to fiddle with the portion of our OTLP exporter code that handles encoding, as we'd have to be careful not to introduce a regression.

Instead, since we know which instrumentation is problematic and what attributes, it's more straightforward to just coerce that specific attribute to aString class, which is what this PR does (it calls .to_s defensively).

ahayworth

I had a question re: the defensive programming, but I don't think it's anything worth blocking the PR over.

This is a good fix - and thank you to @danielmbarlow for finding the issue! As an avid Arel.sql fan myself, we should have been testing for this from the beginning. 🤦 😆

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb

…s/connection.rb Co-authored-by: Ariel Valentin <arielvalentin@users.noreply.github.com>

plantfansam

👌 lovely! Had a in-line nit that you can take or leave as you wish 😄

plantfansam · 2022-03-14T21:49:18Z

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb

@@ -84,7 +84,7 @@ def span_attrs(kind, *args) # rubocop:disable Metrics/AbcSize
            end

            attrs = { 'db.operation' => validated_operation(operation), 'db.postgresql.prepared_statement_name' => statement_name }
-            attrs['db.statement'] = sql unless config[:db_statement] == :omit
+            attrs['db.statement'] = sql.to_s if config[:db_statement] != :omit && sql&.to_s


Nit, but: thoughts on changing to attrs['db.statement'] = sql&.to_s if config[:db_statement] != :omit && !sql.nil? or just attrs['db.statement'] = sql&.to_s if config[:db_statement] != :omit

i've updated this to attrs['db.statement'] = sql&.to_s unless config[:db_statement] == :omit

I think this is as concise as i can get and handles all the edge cases and doesnt anger rubocop

…d for

fbogsany · 2022-03-17T17:34:56Z

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb

@@ -84,7 +84,7 @@ def span_attrs(kind, *args) # rubocop:disable Metrics/AbcSize
            end

            attrs = { 'db.operation' => validated_operation(operation), 'db.postgresql.prepared_statement_name' => statement_name }
-            attrs['db.statement'] = sql unless config[:db_statement] == :omit
+            attrs['db.statement'] = sql&.to_s unless config[:db_statement] == :omit


Is there a circumstance where sql would be nil? That would have been problematic previously, since attribute values have to be non-nil, and validation would have dropped it and logged an error.

Note: I realize this has been beaten to death, but I really don't think this level of defensiveness is warranted here.

This test case

opentelemetry-ruby/instrumentation/pg/test/opentelemetry/instrumentation/pg/instrumentation_test.rb

Lines 137 to 138 in 06b36a6

# We should have evicted the statement from the cache

_(last_span.attributes['db.statement']).must_be_nil

WIthout the defensiveness it seems it's possible that lru_cache returns nil, which then gets encoded to "" by .to_s. This failed test case is what prompted the defensiveness in the first place https://github.com/open-telemetry/opentelemetry-ruby/runs/5524602649?check_suite_focus=true

I hadn't really looked into the lru cache details or instrumentation implementation other than that it does seem possible, in some cases, for sql to be nil, we are currently testing for that case for some reason, and so we should have a nil check.

It feels like that's more a case of the cache not working correctly. If the lru_cache returns nil, we should handle that cache miss and update the cache rather than papering over the problem with a &..

The lru_cache is a little unusual here - the expectation is that we write into the cache in the PREPARE case and read from it in the EXECUTE case. I think we should handle the sql.nil? case on line 81, where we read from the cache. By the time we get here, sql should be non-nil.

I think the idea, (@ahayworth correct me if i'm wrong) is that this specific case is for handling prepared statements. Reviewing the test here, w/an lru cache of size 50:

user prepares 51 statements

each time prepare is called, we perform the obfuscation and cache <statement-name>:<obfuscated-prepared-statement> k:v pair.

The 51st call evicts the least recently used <statement-name>:<obfuscated-prepared-statement> k:v pair

when the user later attempts to execute that already-evicted <statement-name>:<obfuscated-prepared-statement> k:v pair via .execute(<statement-name>), it causes a cache miss and we can't update the cache because we no longer have access to the <obfuscated-prepared-statement> to set as a value, so the lru_cache returns nil.

I don't know what the improvement would be here, this seems like an expected behavior, albeit one with tradeoffs.

🤦 we handle nil attributes on the next line. This is complicated and confusing IMO. I think we should .to_s the result from obfuscate_sql, either in that method or on lines 72 and 77. Then we always cache strings.

@ericmustin your understanding of the LRU cache is correct, both in functionality and in purpose.

plantfansam

LGTM

fix: update pg instrumentation to handle non primitive argument

1309002

ericmustin requested review from fbogsany, mwear, robertlaurin, dazuma, arielvalentin and ahayworth as code owners March 12, 2022 23:38

ericmustin added 2 commits March 12, 2022 19:50

chore: fix tests

ca3e457

chore: linting

ec4e58c

ahayworth approved these changes Mar 14, 2022

View reviewed changes

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb Outdated Show resolved Hide resolved

ahayworth linked an issue Mar 14, 2022 that may be closed by this pull request

PG Instrumentation: Invalid span attribute value type Arel::Nodes::SqlLiteral #1139

Closed

arielvalentin reviewed Mar 14, 2022

View reviewed changes

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb Outdated Show resolved Hide resolved

Update instrumentation/pg/lib/opentelemetry/instrumentation/pg/patche…

b0bfe61

…s/connection.rb Co-authored-by: Ariel Valentin <arielvalentin@users.noreply.github.com>

plantfansam reviewed Mar 14, 2022

View reviewed changes

ericmustin added 2 commits March 14, 2022 18:11

chore: the most middleground change on pg instrumentation nobody aske…

74d4958

…d for

chore: rubocops

1d1e0b0

fbogsany reviewed Mar 17, 2022

View reviewed changes

chore: make sure we call to_s on obufscate_sql return

d386b7f

fbogsany approved these changes Mar 18, 2022

View reviewed changes

plantfansam approved these changes Mar 23, 2022

View reviewed changes

Merge branch 'main' into update_ar_instrumentation

f40c066

robertlaurin merged commit 8eac4a1 into main Mar 29, 2022

robertlaurin deleted the update_ar_instrumentation branch March 29, 2022 14:36

plantfansam mentioned this pull request Apr 7, 2022

Only allow certain types of Numeric values as attribute values. #1173

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: update pg instrumentation to handle non primitive argument #1146

fix: update pg instrumentation to handle non primitive argument #1146

ericmustin commented Mar 12, 2022

ahayworth left a comment

plantfansam left a comment

plantfansam Mar 14, 2022

ericmustin Mar 14, 2022

fbogsany Mar 17, 2022

fbogsany Mar 17, 2022

ericmustin Mar 17, 2022

fbogsany Mar 17, 2022

fbogsany Mar 17, 2022

ericmustin Mar 17, 2022

fbogsany Mar 17, 2022

ericmustin Mar 17, 2022

ahayworth Mar 18, 2022

plantfansam left a comment

	# We should have evicted the statement from the cache
	_(last_span.attributes['db.statement']).must_be_nil

fix: update pg instrumentation to handle non primitive argument #1146

fix: update pg instrumentation to handle non primitive argument #1146

Conversation

ericmustin commented Mar 12, 2022

Summary

ahayworth left a comment

Choose a reason for hiding this comment

plantfansam left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

plantfansam left a comment

Choose a reason for hiding this comment