New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BEAM-5145] Make PTransform names stable in Join/CoGroupByKey #6212
[BEAM-5145] Make PTransform names stable in Join/CoGroupByKey #6212
Conversation
78ac217
to
a2118e1
Compare
The failed test UnboundedEventSourceTest.resumeFromCheckpoint doesn't really seem related |
Run Java PreCommit |
Normally names aren't required if there is exactly one type of transform being used inside of Are you getting warnings that this isn't stable in some way? |
We are inner joining a PCollection on itself in a recursive manner (a configurable level deep), and then TestPipeline gives an error that the transforms do not have stable unique names. We use pipeline.getOptions().setStableUniqueNames(PipelineOptions.CheckEnabled.OFF), but it would be nice if we can remove that line. |
So names are based upon the names of the parents.
should generate Also, this change looks fine to me I'm just trying to better understand whether there is a class of bug through recursion that we are hitting which is a larger problem in general. |
Run Java PreCommit |
R: @lukecwik |
This is an example of the case we are seeing (in our case with Join and CoGroupByKey). If the setStableUniqueNames call is removed, the test breaks. When adding names in all PTransform.apply calls, the test runs fine again.
|
Run Java PreCommit |
Thanks for the example. Should the following apply lines also have a name?
|
If I change
to
then the pipeline runs fine, and has stable/unique names. This is what I wanted to do inside Join/CoGroupByKey |
Thanks for the details, I'm just trying to get our tests to pass as several are flaky. I'll merge once I can get a green run. |
In Join and CoGroupByKey there were some instances where apply was called without a name. This fixes this, so pipelines that uses these PTransforms have stable names.
Follow this checklist to help us incorporate your contribution quickly and easily:
[BEAM-XXX] Fixes bug in ApproximateQuantiles
, where you replaceBEAM-XXX
with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.It will help us expedite review of your Pull Request if you tag someone (e.g.
@username
) to look at it.Post-Commit Tests Status (on master branch)