Add take_while experimental dataset op #24645

Squadrick · 2018-12-31T20:02:48Z

Addresses #24105

Squadrick · 2018-12-31T20:07:59Z

Still need to add test cases, but any changes required in the current commits?

@jsimsa @mrry Most other dataset op that take in a function as a parameter, check for short circuits using ComputeShortCircuitIndices, will I need to do that here as well?

Also, right now take_while is state-less, similar to map. Do I need to also build the scan equivalent for take_while, i.e. take_while with a state?

jsimsa

Overall, this looks promising. I left (mostly minor) comments throughout the PR.

As for your questions, 1) implementing the short-circuit path is optional and 2) take_while should be stateless. If a user needs a stateful version, they can build that out of existing transformations (scan + take_while + map).

tensorflow/core/kernels/data/experimental/take_while_dataset_op.cc

tensorflow/python/data/experimental/ops/take_while_ops.py

Remove preserve_cardinality Update license year Propagate predicate's error to the caller Inline _transformation_name

Squadrick · 2019-01-01T16:59:31Z

I've made the required changes. Adding tests in the next couple of hours.

Squadrick · 2019-01-02T14:00:48Z

@jsimsa I've added tests and made the changes mentioned. Take a look, and let me know if further changes are required.

jsimsa

Thanks @Squadrick. Left a couple more small comments. I am going to trigger presubmit tests which might require additional fixes.

tensorflow/core/kernels/data/experimental/take_while_dataset_op.cc

...python/data/experimental/kernel_tests/serialization/take_while_dataset_serialization_test.py

tensorflow/python/data/experimental/kernel_tests/take_while_test.py

Squadrick · 2019-01-04T10:13:33Z

@jsimsa Made the changes and fixed whatever was failing on the tests. I wasn't exactly sure what the api_def should look like for this, so I copied and edited api_def_FilterDataset, so can you take a look and tell me whether it's right or wrong?

Also, what's the difference between base_api and python_api?

jsimsa

a couple more minor comments

tensorflow/core/kernels/data/experimental/take_while_dataset_op.cc

tensorflow/python/data/experimental/kernel_tests/take_while_test.py

`take_while` ShortCircuit tests are parameterized Fix errors in `BUILD` files using `buildifier` LoopIteratorPredicate takes `vector<Tensor>&`

Squadrick · 2019-01-05T12:08:57Z

@jsimsa Made the changes you asked and fixed whatever was causing the CI to fail. I think it should be good to go unless I've overlooked something.

jsimsa · 2019-01-10T18:17:55Z

@Squadrick one of the test failures tensorflow/tools/api/tests:api_compatibility_test is in fact related:

can you run the following:

    $ bazel build tensorflow/tools/api/tests:api_compatibility_test
    $ bazel-bin/tensorflow/tools/api/tests/api_compatibility_test \
          --update_goldens True

and update the PR

… take-while

Squadrick · 2019-01-11T17:23:15Z

@jsimsa I pulled master and ran the commands. This is what my git status looks like now.

On branch take-while
Changes not staged for commit:
  (use "git add/rm <file>..." to update what will be committed)
  (use "git checkout -- <file>..." to discard changes in working directory)

	modified:   tensorflow/tools/api/golden/v2/tensorflow.data.experimental.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-baseline-classifier.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-baseline-estimator.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-baseline-regressor.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-boosted-trees-classifier.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-boosted-trees-regressor.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-checkpoint-saver-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-checkpoint-saver-listener.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-classifier.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-estimator.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-linear-combined-classifier.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-linear-combined-estimator.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-linear-combined-regressor.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-d-n-n-regressor.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-estimator.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-feed-fn-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-final-ops-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-global-step-waiter-hook.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-linear-classifier.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-linear-estimator.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-linear-regressor.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-logging-tensor-hook.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.-mode-keys.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-nan-loss-during-training-error.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-nan-tensor-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-profiler-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-second-or-step-timer.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-session-run-args.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-session-run-context.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-session-run-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-session-run-values.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-step-counter-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-stop-at-step-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.-summary-saver-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.experimental.-in-memory-evaluator-hook.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.experimental.-linear-s-d-c-a.pbtxt
	deleted:    tensorflow/tools/api/golden/v2/tensorflow.estimator.experimental.pbtxt
	modified:   tensorflow/tools/api/golden/v2/tensorflow.estimator.pbtxt

Untracked files:
  (use "git add <file>..." to include in what will be committed)

	tensorflow/tools/api/golden/v2/tensorflow.estimator.inputs.pbtxt

no changes added to commit (use "git add" and/or "git commit -a")

I also got this warning:

WARNING:tensorflow:Golden file update requested!
All test failures have been skipped, see the logs for detected diffs.
This test is now going to write new golden files.
Make sure to package the updates together with your change.

You will need an explicit API approval. This may take longer than a normal
review.

And this seems to be the diff:

+   member_method {
+     name: "take_while"
+     argspec: "args=[\'predicate\'], varargs=None, keywords=None, defaults=None"
+   }

I have no idea how to proceed. How do I go about getting API approval?

jsimsa · 2019-01-11T17:34:45Z

The only file pbtxt file you should add to your PR is tensorflow/tools/api/golden/v2/tensorflow.data.experimental.pbtxt. Ignore all the tensorflow/tools/api/golden/v2/tensorflow.estimator.* (I am not sure why they showed up).

I will take care of the API approval (it happens during the internal review of the PR).

jsimsa

The internal review surfaced a couple of issues.

tensorflow/tools/api/golden/v2/tensorflow.data.experimental.pbtxt

tensorflow/python/data/experimental/kernel_tests/serialization/BUILD

…o take-while

Squadrick · 2019-01-16T15:21:08Z

@jsimsa Running the update script didn't change any of the v1 scripts.

In tensorflow/api_template.__init__.py, on line 56, there's _compat.enable_v2_behavior(), this sets _force_enable in tensorflow/python/tf2.py to True, and every time tf2.enabled() is called it returns True. So setting TF2_BEHAVIOR=0 has no effect on which TensorFlow version to run. tensorflow/api_template.__init__.py is one of the srcs for api_compatibility_test, so I had to manually remove @test_util.run_v1_only('b/120545219') in api_compatibility_tests.py on line 317 and 336, to get v1 golden API files to update.

Is there something I'm overlooking?

…ion test

…o take-while

…into take-while

Squadrick · 2019-01-16T15:42:07Z

@jsimsa Pushed the requested changes.

jsimsa · 2019-01-16T17:23:44Z

@Squadrick thanks, I will resume the internal review.

PiperOrigin-RevId: 229625006

Add take_while experimental dataset op (tests pending)

ceb64c1

googlebot added the cla: yes label Dec 31, 2018

jsimsa self-requested a review December 31, 2018 20:30

jsimsa requested changes Dec 31, 2018

View reviewed changes

ymodak self-assigned this Jan 1, 2019

ymodak added the awaiting review Pull request awaiting review label Jan 1, 2019

ymodak added this to Assigned Reviewer in PR Queue via automation Jan 1, 2019

Make required changes

5356b68

Remove preserve_cardinality Update license year Propagate predicate's error to the caller Inline _transformation_name

Add take_while serialization test

4e07d4a

ymodak added the size:L CL Change Size: Large label Jan 2, 2019

tensorflowbutler removed the awaiting review Pull request awaiting review label Jan 2, 2019

Add take_while tests

ab7465e

jsimsa requested changes Jan 2, 2019

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Reviewer Requested Changes Jan 2, 2019

jsimsa added the kokoro:run label Jan 2, 2019

kokoro-team removed the kokoro:run label Jan 2, 2019

Squadrick added 3 commits January 4, 2019 15:19

Add api_def for ExperimentalTakeWhileDataset

f76dc80

Format files according to TF style guide and minor fixes

2456898

Use parameterized test case

28edbb8

Add ShortCircuit optimization for take_while and corresponding test

7a830da

jsimsa requested changes Jan 4, 2019

View reviewed changes

jsimsa added the kokoro:run label Jan 4, 2019

kokoro-team removed the kokoro:run label Jan 4, 2019

Minor bug fixes

186f573

`take_while` ShortCircuit tests are parameterized Fix errors in `BUILD` files using `buildifier` LoopIteratorPredicate takes `vector<Tensor>&`

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Jan 7, 2019

ymodak removed the kokoro:run label Jan 10, 2019

Merge branch 'master' of https://github.com/squadrick/tensorflow into…

0d37687

… take-while

Add take_api golden API to pbtxt

93a4a83

Squadrick dismissed jsimsa’s stale review via 93a4a83 January 11, 2019 18:03

PR Queue automation moved this from Approved by Reviewer to Reviewer Requested Changes Jan 11, 2019

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Jan 11, 2019

jsimsa approved these changes Jan 11, 2019

View reviewed changes

jsimsa added the kokoro:run label Jan 11, 2019

kokoro-team removed kokoro:run labels Jan 11, 2019

ymodak added the ready to pull PR ready for merge process label Jan 12, 2019

Add take_while golden API to pbtxt

4eb7eaa

PR Queue automation moved this from Approved by Reviewer to Reviewer Requested Changes Jan 14, 2019

jsimsa requested changes Jan 14, 2019

View reviewed changes

tensorflow/tools/api/golden/v2/tensorflow.data.experimental.pbtxt Show resolved Hide resolved

tensorflow/python/data/experimental/kernel_tests/serialization/BUILD Show resolved Hide resolved

Merge branch 'master' of https://github.com/tensorflow/tensorflow int…

3c6b5d7

…o take-while

Squadrick added 4 commits January 16, 2019 20:53

Add take_while to v1 golden API

2834c69

Add absl_py parameterized testing dependency to take_while_serializat…

6255167

…ion test

Merge branch 'master' of https://github.com/tensorflow/tensorflow int…

ce955e3

…o take-while

Merge branch 'take-while' of https://github.com/squadrick/tensorflow …

ef15adf

…into take-while

tensorflow-copybara merged commit ef15adf into tensorflow:master Jan 16, 2019

PR Queue automation moved this from Reviewer Requested Changes to Merged Jan 16, 2019

tensorflow-copybara pushed a commit that referenced this pull request Jan 16, 2019

Merge pull request #24645 from Squadrick:take-while

f44c6e0

PiperOrigin-RevId: 229625006

Squadrick deleted the take-while branch January 16, 2019 22:53

tilakrayal mentioned this pull request Jan 18, 2023

Add an ability to terminate tf.data.choose_from_datasets/sample_from_datasets early #24105

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add take_while experimental dataset op #24645

Add take_while experimental dataset op #24645

Squadrick commented Dec 31, 2018

Squadrick commented Dec 31, 2018

jsimsa left a comment

Squadrick commented Jan 1, 2019

Squadrick commented Jan 2, 2019

jsimsa left a comment

Squadrick commented Jan 4, 2019 •

edited

jsimsa left a comment

Squadrick commented Jan 5, 2019

jsimsa commented Jan 10, 2019

Squadrick commented Jan 11, 2019 •

edited

jsimsa commented Jan 11, 2019

jsimsa left a comment

Squadrick commented Jan 16, 2019

Squadrick commented Jan 16, 2019

jsimsa commented Jan 16, 2019

Add take_while experimental dataset op #24645

Add take_while experimental dataset op #24645

Conversation

Squadrick commented Dec 31, 2018

Squadrick commented Dec 31, 2018

jsimsa left a comment

Choose a reason for hiding this comment

Squadrick commented Jan 1, 2019

Squadrick commented Jan 2, 2019

jsimsa left a comment

Choose a reason for hiding this comment

Squadrick commented Jan 4, 2019 • edited

jsimsa left a comment

Choose a reason for hiding this comment

Squadrick commented Jan 5, 2019

jsimsa commented Jan 10, 2019

Squadrick commented Jan 11, 2019 • edited

jsimsa commented Jan 11, 2019

jsimsa left a comment

Choose a reason for hiding this comment

Squadrick commented Jan 16, 2019

Squadrick commented Jan 16, 2019

jsimsa commented Jan 16, 2019

Squadrick commented Jan 4, 2019 •

edited

Squadrick commented Jan 11, 2019 •

edited