Skip to content

Conversation

@AHeise
Copy link
Contributor

@AHeise AHeise commented Nov 26, 2025

What is the purpose of the change

Advanced ScanProviders may return multiple transformations and hence overwriting just the last name is meaningless and potentially dangerous. It is fair to assume that the users of the advanced ScanProviders set names and uids on their own.

Brief change log

  • Removed overwriting of transformation name.

Verifying this change

Trivial fix (deletion of 2 lines).

Does this pull request potentially affect one of the following parts:

  • Dependencies (does it add or upgrade a dependency): (yes / no)
  • The public API, i.e., is any changed class annotated with @Public(Evolving): (yes / no)
  • The serializers: (yes / no / don't know)
  • The runtime per-record code paths (performance sensitive): (yes / no / don't know)
  • Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (yes / no / don't know)
  • The S3 file system connector: (yes / no / don't know)

Documentation

  • Does this pull request introduce a new feature? (yes / no)
  • If yes, how is the feature documented? (not applicable / docs / JavaDocs / not documented)

…oviders

Advanced ScanProviders may return multiple transformations and hence overwriting just the last name is meaningless and potentially dangerous. It is fair to assume that the users of the advanced ScanProviders set names and uids on their own.
@flinkbot
Copy link
Collaborator

flinkbot commented Nov 26, 2025

CI report:

Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

((DataStreamScanProvider) provider)
.produceDataStream(createProviderContext(config), env)
.getTransformation();
meta.fill(sourceTransform);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am curious - could we add a test that has the advanced ScanProviders "setting names and uids on their own". I think it would be useful, if this is expected practise.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with David here. This PR needs proper testing in org.apache.flink.table.planner.plan.nodes.exec.TransformationsTest

@github-actions github-actions bot added the community-reviewed PR has been reviewed by the community. label Nov 26, 2025
Copy link
Contributor

@twalthr twalthr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the PR @AHeise. In general, I agree with the changes. But we need more tests. This is a serious bug that will need good release notes. Maybe we should also offer table option to restore the old behavior until sources are updated.

((DataStreamScanProvider) provider)
.produceDataStream(createProviderContext(config), env)
.getTransformation();
meta.fill(sourceTransform);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with David here. This PR needs proper testing in org.apache.flink.table.planner.plan.nodes.exec.TransformationsTest

@AHeise AHeise marked this pull request as draft November 27, 2025 12:59
@twalthr
Copy link
Contributor

twalthr commented Nov 27, 2025

This issue is larger than originally thought. I assigned this ticket to me and will close this PR.

@twalthr twalthr closed this Nov 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-reviewed PR has been reviewed by the community.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants