Raise a Type Value error when result is not iterable #6242

h4rr21 · 2018-08-16T21:30:10Z

Check the result should be a list Type

Follow this checklist to help us incorporate your contribution quickly and easily:

Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

It will help us expedite review of your Pull Request if you tag someone (e.g. @username) to look at it.

Post-Commit Tests Status (on master branch)

Lang	Apex	Dataflow	Flink	Gearpump	Samza	Spark
Go	---	---	---	---	---	---
Java
Python	---		---	---	---	---

boyuanzz · 2018-08-29T03:52:22Z

Hey @h4rr21 , could you please take care of this python precommit failures?

robertwb · 2018-08-29T08:53:27Z

Thanks for looking at this.

One thing we have to be very careful of here is that this code runs for every element in at every step for every pipeline, so it's quite performance critical. Running the benchmarks at https://github.com/apache/beam/blob/master/sdks/python/apache_beam/tools/map_fn_microbenchmark.py (you might need #6293) I see

Before

Fixed cost   0.6794422043757005
Per-element  7.113301970741965e-07
R^2          0.9817527457194359

After

Fixed cost   0.6880206489042804
Per-element  7.551892020485618e-07
R^2          0.9807901397710658

which is about a 5% regression. This may be worse if iter(results) is expensive.

Perhaps change the string test to "type(results) is str" and omit the iter(...) test as it already gives a more informative TypeError: 'T' object is not iterable in that case.

h4rr21 · 2018-08-29T13:39:25Z

@boyuanzz I was trying to open the details, but this url :

https://builds.apache.org/job/beam_PreCommit_Python_Commit/888/

doesn't work for me

h4rr21 · 2018-08-29T14:53:31Z

Hello @robertwb thanks for pointing this microbenchmark.

I'm trying to validate "results" as any itererable python object, I also now there is still under discussion if "string" should be a valid output...

so we just be looking at __iter__ attribute instead of "iter(results)

# validate results is not iterable or string
    if not hasattr(results,"__iter__")
        raise TypeError("This is not an iterable")

this shouldn't impact the performance

robertwb · 2018-08-29T15:24:35Z

At this level, even __hasattr__ can have a performance impact. But given a type error is raised for non-iterables on the next line, there's no reason to add an extra test here.

stale · 2018-11-24T16:54:26Z

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@beam.apache.org list. Thank you for your contributions.

stale · 2018-12-01T17:51:44Z

This pull request has been closed due to lack of activity. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Raise a Type Value error when result is not a list

c0a6be7

pabloem self-requested a review August 16, 2018 23:43

h4rr21 changed the title ~~Raise a Type Value error when result is not a list~~ Raise a Type Value error when result is not iterable Aug 17, 2018

Juan Carlos and others added 8 commits August 19, 2018 03:31

Raise a Type Value error when result is not a list or if it's a string

51be9c0

Merge branch 'BT-3530' of github.com:h4rr21/beam into BT-3530

6d70c75

Merge branch 'BT-3530' of github.com:h4rr21/beam into BT-3530

254bbb6

check pipeline output is not a string

e60c48f

Merge branch 'BT-3530' of github.com:h4rr21/beam into BT-3530

5df2b61

ammend commit error

ba22e8f

fix errors

df0a415

fix typo

451af8e

Omit extra iter check.

17c877f

stale bot added the stale label Nov 24, 2018

stale bot closed this Dec 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Raise a Type Value error when result is not iterable #6242

Raise a Type Value error when result is not iterable #6242

Uh oh!

h4rr21 commented Aug 16, 2018

Uh oh!

boyuanzz commented Aug 29, 2018

Uh oh!

robertwb commented Aug 29, 2018

Uh oh!

h4rr21 commented Aug 29, 2018

Uh oh!

h4rr21 commented Aug 29, 2018 •

edited

Loading

Uh oh!

robertwb commented Aug 29, 2018

Uh oh!

stale bot commented Nov 24, 2018

Uh oh!

stale bot commented Dec 1, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Raise a Type Value error when result is not iterable #6242

Raise a Type Value error when result is not iterable #6242

Uh oh!

Conversation

h4rr21 commented Aug 16, 2018

Post-Commit Tests Status (on master branch)

Uh oh!

boyuanzz commented Aug 29, 2018

Uh oh!

robertwb commented Aug 29, 2018

Uh oh!

h4rr21 commented Aug 29, 2018

Uh oh!

h4rr21 commented Aug 29, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

robertwb commented Aug 29, 2018

Uh oh!

stale bot commented Nov 24, 2018

Uh oh!

stale bot commented Dec 1, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

h4rr21 commented Aug 29, 2018 •

edited

Loading