Skip to content

Conversation

@cookiedough77
Copy link
Contributor

@cookiedough77 cookiedough77 commented Oct 21, 2025

What changes were proposed in this pull request?

This PR introduces a new protobuf message, PipelineAnalysisContext, in ect/common/src/main/protobuf/spark/connect/pipelines.proto.

Why are the changes needed?

Special handling is needed for spark.sql in certain contexts. This proto provides a foundation for passing such context in future.

Does this PR introduce any user-facing change?

No, it only adds an internal protobuf message.

How was this patch tested?

Verified through protobuf compilation and existing test coverage.

Was this patch authored or co-authored using generative AI tooling?

No.

@cookiedough77 cookiedough77 changed the title add analysis context proto Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 21, 2025
@cookiedough77 cookiedough77 changed the title Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution [LDP Spark Connect Frontend] [SC-208390] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 21, 2025
@sryza sryza changed the title [LDP Spark Connect Frontend] [SC-208390] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution [SDP] [SC-208390] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 22, 2025
@sryza sryza changed the title [SDP] [SC-208390] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution [SC-208390][SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 22, 2025
@cookiedough77 cookiedough77 marked this pull request as draft October 22, 2025 20:09
@cookiedough77 cookiedough77 marked this pull request as ready for review October 22, 2025 20:23
@cookiedough77 cookiedough77 changed the title [SC-208390][SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 22, 2025
optional string dataflow_graph_id = 1;

// Defines the path assigned at runtime when the pipeline initializes
optional string definition_path = 2;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We determined that we need a corresponding field on SourceCodeLocation, right? Also, perhaps we should give this consistent naming with the file_name field there? Perhaps outer_file_name?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discussed to be definition_path. added to SourceCodeLocation.

Copy link
Contributor

@sryza sryza left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@sryza sryza closed this in c707f59 Oct 23, 2025
@sryza sryza changed the title Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution [SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 23, 2025
@sryza sryza changed the title [SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution [SPARK-52762][SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution Oct 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants