Skip to content

feat(bqjdbc): extend OpenTelemetry instrumentation for metadata and pagination#12918

Merged
keshavdandeva merged 7 commits intojdbc/feature-branch-otelfrom
jdbc/add-otel-to-metadata
Apr 28, 2026
Merged

feat(bqjdbc): extend OpenTelemetry instrumentation for metadata and pagination#12918
keshavdandeva merged 7 commits intojdbc/feature-branch-otelfrom
jdbc/add-otel-to-metadata

Conversation

@keshavdandeva
Copy link
Copy Markdown
Contributor

@keshavdandeva keshavdandeva commented Apr 24, 2026

b/491245568

Key Changes

Core Instrumentation Logic

  • Database Metadata Tracing: Added OTel spans to key methods in BigQueryDatabaseMetaData.java (getCatalogs, getSchemas, getTables, getColumns) to capture underlying API calls.
  • Pagination Span Links: Captured the parent span context at the start of fetchNextPages in BigQueryStatement.java and linked background pagination spans back to it, avoiding timeline anomalies.
  • Cross-Thread Context Propagation: Stored the SpanContext in BigQueryBaseResultSet.java at creation time and made it current during next() in BigQueryJsonResultSet.java and BigQueryArrowResultSet.java to survive thread hops.
  • Tracer Reuse: Extracted getSafeTracer to BigQueryJdbcOpenTelemetry.java as a static utility to ensure consistent fallback behavior across the driver.
  • Lambda Extraction: Extracted the large lambda function in populateArrowBufferedQueue in BigQueryStatement.java to its own private method processArrowStream to improve readability and maintainability.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request integrates OpenTelemetry tracing into the BigQuery JDBC driver, focusing on ResultSet iteration and DatabaseMetaData operations. Key changes include capturing span contexts during result set initialization and propagating them during next() calls, as well as adding tracing to metadata methods like getTables, getColumns, and getSchemas. Review feedback identifies significant performance overhead from wrapping high-frequency next() methods in telemetry scopes and points out that spans for asynchronous metadata operations end prematurely, leading to inaccurate duration metrics and fragmented traces.

@keshavdandeva
Copy link
Copy Markdown
Contributor Author

/gemini review

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request integrates OpenTelemetry tracing across the BigQuery JDBC driver, specifically targeting metadata operations and result set processing. Key changes include wrapping blocking operations in appropriate tracing scopes, propagating span contexts to background threads, and standardizing tracer acquisition via a new utility method. Feedback focuses on refining the tracing implementation, such as correctly handling background span parents to avoid timeline anomalies, ensuring span statuses are updated on errors, and replacing redundant null checks on SpanContext with validity checks.

@keshavdandeva keshavdandeva marked this pull request as ready for review April 24, 2026 16:25
@keshavdandeva keshavdandeva requested review from a team as code owners April 24, 2026 16:25
try {
// Advance the cursor,Potentially blocking operation
this.cursor = this.buffer.take();
try (Scope scope = Context.current().with(Span.wrap(originalSpanContext)).makeCurrent()) {
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as for Arrow

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment from Gemini about this as well:

Similar to the Arrow implementation, wrapping the entire next() method in a Scope creates substantial performance overhead for every row processed. Since next() is a hot path, the cost of managing the OpenTelemetry context for every iteration can lead to a noticeable regression in throughput for large result sets. It is recommended to limit the scope of context propagation to the parts of the method that actually perform work requiring context, such as the blocking buffer.take() call. Ensure the scope is closed in an exception-safe manner to prevent leaks.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can expand to the try block (162-180), there is not much else happening either way

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually in this version, this.cursor = this.buffer.take() happens on every call to next(), so it'd already create new span each time. Should we check performance & see if it is needed? All we do here is taking an item from the queue.. So maybe no real tracing in next function needed. (in arrow tracing makes sense because we're reading next block in the same thread.. So maybe we can squeeze more from Arrow by reading in the background thread)

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I guess you are right. Since this.buffer.take() is called for every row in and it doesn't trigger any new spans or heavy processing (unlike Arrow where we deserialize a whole batch), maintaining the scope here only adds overhead without any real benefit.
I will remove it for now, and create a task to investigate performance with and without to finalise it later

@keshavdandeva keshavdandeva requested a review from logachev April 27, 2026 17:52
@logachev
Copy link
Copy Markdown
Contributor

Please use bqjdbc in the title, so it will be easier to find PRs in the future (e.g. spanner has their own jdbc in that repo)

@keshavdandeva keshavdandeva changed the title feat(jdbc): extend OpenTelemetry instrumentation for metadata and pagination feat(bqjdbc): extend OpenTelemetry instrumentation for metadata and pagination Apr 28, 2026
@keshavdandeva keshavdandeva merged commit 57a0207 into jdbc/feature-branch-otel Apr 28, 2026
96 checks passed
@keshavdandeva keshavdandeva deleted the jdbc/add-otel-to-metadata branch April 28, 2026 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants