[BEAM-2966] Allow subclasses of tuple, list, and dict as pvaluish inputs/outputs. by robertwb · Pull Request #3831 · apache/beam

robertwb · 2017-09-09T00:53:35Z

Follow this checklist to help us incorporate your contribution quickly and easily:

Make sure there is a JIRA issue filed for the change (usually before you start working on it). Trivial changes like typos do not require a JIRA issue. Your pull request should address just this issue, without pulling in other changes.
Each commit in the pull request should have a meaningful subject line and body.
Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue.
Write a pull request description that is detailed enough to understand what the pull request does, how, and why.
Run mvn clean verify to make sure basic checks pass. A more thorough check will be performed on your pull request automatically.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

robertwb · 2017-09-09T00:55:32Z

R: @KesterTong

robertwb · 2017-09-11T20:40:26Z

jenkins: retest this please

robertwb · 2017-09-13T22:38:00Z

jenkins: retest this please

coveralls · 2017-09-13T23:53:58Z

Coverage decreased (-0.2%) to 69.509% when pulling 2c98ab8 on robertwb:pvalueish into e9d3a4a on apache:master.

coveralls · 2017-09-16T00:11:17Z

Coverage decreased (-0.2%) to 69.523% when pulling 541b4b3 on robertwb:pvalueish into e9d3a4a on apache:master.

KesterTong · 2017-09-16T18:17:44Z

sdks/python/apache_beam/transforms/ptransform.py

-    return {key: self.visit(value, *args) for (key, value) in node.items()}
+  def visit_nested(self, node, *args):
+    if isinstance(node, (tuple, list)):
+      # namedtuples require unpacked arguments in their constructor,


It's not clear that this supports subclasses of a namedtuple. Such subclasses will inherit the _make class method from the namedtuple, but the _make method will produce the namedtuple not the subclass. Instead, could we test for the existence of _make (as a test of whether we are dealing with a subclass of namedtuple or a direct subclass of tuple, which we assume has the usual tuple constructor) but still invoke the constructor node.__class__ when dealing with a subclass of a namedtuple? i.e.

if isinstance(node, tuple) and hasattr(node.__class__, '_make'): # node is an instance of a subclass of a namedtuple. return node.__class__(*[self.visit(x, *args) for x in node]) elif isinstance(node, (tuple, list)): ...

KesterTong

Regarding "Support multiple materializations of the smae pvalue." can you clarify what the new behavior of the cache is? It seems to me that the cache is now not really functioning as a cache because we never decrement the refcount. If so that's not a problem I just want to understand the new behavior.

KesterTong · 2017-09-16T18:35:12Z

is there a JIRA issue for this PR? If not I could open one. Would be helpful as reference for tf.Transform release notes etc. since tf.Transform will rely on older versions of Beam which will hit the bug this PR fixes.

robertwb · 2017-09-18T17:47:03Z

PTAL

KesterTong · 2017-09-18T19:54:01Z

Thanks, I think you missed my question above regarding the commit titled "Support multiple materializations of the smae pvalue."?

robertwb · 2017-09-18T20:12:02Z

Sorry, I added comments but did not address it directly. The cache is solely an implementation detail that is thrown away as soon as the pipeline is gc'd (and, hopefully, will simply go away completely when we clean things up). In particular, the ref-counting is only used during pipeline execution.

robertwb · 2017-09-19T17:59:36Z

retest this please

coveralls · 2017-09-19T19:34:10Z

Changes Unknown when pulling cbe8dd8 on robertwb:pvalueish into ** on apache:master**.

Allow subclasses of tuple, list, and dict as pvaluish inputs/outputs.

ff97905

lint

2c98ab8

KesterTong reviewed Sep 16, 2017

View reviewed changes

robertwb added 2 commits September 18, 2017 10:44

Support multiple materializations of the same pvalue.

4c6b672

Address comments.

cbe8dd8

robertwb force-pushed the pvalueish branch from 541b4b3 to cbe8dd8 Compare September 18, 2017 17:45

robertwb changed the title ~~Allow subclasses of tuple, list, and dict as pvaluish inputs/outputs.~~ [BEAM-2966] Allow subclasses of tuple, list, and dict as pvaluish inputs/outputs. Sep 18, 2017

KesterTong approved these changes Sep 18, 2017

View reviewed changes

asfgit closed this in cfbdb61 Sep 19, 2017

Conversation

robertwb commented Sep 9, 2017

Uh oh!

robertwb commented Sep 9, 2017

Uh oh!

robertwb commented Sep 11, 2017

Uh oh!

robertwb commented Sep 13, 2017

Uh oh!

coveralls commented Sep 13, 2017

Uh oh!

coveralls commented Sep 16, 2017

Uh oh!

KesterTong Sep 16, 2017

Choose a reason for hiding this comment

Uh oh!

robertwb Sep 18, 2017

Choose a reason for hiding this comment

Uh oh!

KesterTong left a comment

Choose a reason for hiding this comment

Uh oh!

KesterTong commented Sep 16, 2017

Uh oh!

robertwb commented Sep 18, 2017

Uh oh!

KesterTong commented Sep 18, 2017

Uh oh!

robertwb commented Sep 18, 2017

Uh oh!

robertwb commented Sep 19, 2017

Uh oh!

coveralls commented Sep 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants