Add new op BytesInUse, similar to MaxBytesInUse #13107

yaroslavvb · 2017-09-17T23:02:48Z

Adding BytesInUse
This is more useful than MaxBytesInUse for getting peak memory for a given session.run call because the latter gives maximum memory usage over lifetime of allocator, which can span multiple session.run calls/multiple session objects

mention-bot · 2017-09-17T23:02:49Z

@yaroslavvb, thanks for your PR! By analyzing the history of the files in this pull request, we identified @wujingyue, @lukeiwanski and @tensorflower-gardener to be potential reviewers.

tensorflow-jenkins · 2017-09-17T23:02:49Z

Can one of the admins verify this patch?

yaroslavvb · 2017-09-18T00:06:21Z

Note that bytes_in_use is also what's used by timeline for memory tracking -- 7b5f590

wujingyue · 2017-09-18T00:12:35Z

tensorflow/contrib/memory_stats/kernels/memory_stats_ops.cc

+  }
+};
+
+// register this op on GPU only, see comment for MaxBytesInUse for reason


Nit: "// Register"

wujingyue · 2017-09-18T00:12:40Z

tensorflow/contrib/memory_stats/kernels/memory_stats_ops.cc

+
+#ifdef TENSORFLOW_USE_SYCL
+REGISTER_KERNEL_BUILDER(
+    Name("MaxBytesInUse").Device(DEVICE_SYCL).HostMemory("out"),


Do you mean "BytesInUse" instead of "MaxBytesInUse"?

wujingyue · 2017-09-18T00:13:19Z

tensorflow/contrib/memory_stats/kernels/memory_stats_ops.cc

@@ -16,6 +16,31 @@ limitations under the License.

 namespace tensorflow {

+// Op that measures current memory in bytes.
+class BytesInUseOp : public MemoryStatsOp {


For consistency, I suggest move this op after MemoryStatsOp.

wujingyue · 2017-09-18T00:17:57Z

tensorflow/contrib/memory_stats/python/kernel_tests/memory_stats_ops_test.py

      self.assertGreaterEqual(max_bytes_in_use, matrix_size_in_bytes * 3)
      self.assertLess(max_bytes_in_use, matrix_size_in_bytes * 4)
+      self.assertGreaterEqual(bytes_in_use, matrix_size_in_bytes * 3)


The tests you added don't distinguish BytesInUse from MaxBytesInUse. One idea is to launch a second session with a smaller graph. MaxBytesInUse should return the larger size, and BytesInUse should return the smaller size.

yaroslavvb · 2017-09-18T17:31:47Z

OK, I guess for adding new test I'll actually need to setup TensorFlow development env :) I'll ping this thread after updating

yaroslavvb · 2017-09-23T04:27:27Z

added test to make sure intermediate memory usage gets measured correctly, ptal

sb2nov · 2017-09-26T18:09:43Z

Jenkins, test this please.

wujingyue · 2017-09-27T04:00:02Z

tensorflow/contrib/memory_stats/python/kernel_tests/memory_stats_ops_test.py

+      # intermediate result allocates 1 matrix, max usage is at least 2
+      self.assertGreaterEqual(bytes_in_use, matrix_size_in_bytes * 1)
+      self.assertLess(bytes_in_use, matrix_size_in_bytes * 2)
+      self.assertGreaterEqual(max_bytes_in_use, matrix_size_in_bytes * 2)


Can we make this more strict? max_bytes_in_use should >= matrix_size_in_bytes * 3, because as you said it's sticky to the allocator.

I left it that way because I wasn't sure about guarantees about lifetime of the allocator. If the allocator gets reset between run calls this test would be too strict. I'll assume allocator lifetime is guaranteed to span 2 session.run calls and update this in a bit

wujingyue

LGTM with minor comments.

sb2nov · 2017-09-28T20:04:57Z

Jenkins, test this please.

yaroslavvb · 2017-09-28T21:34:27Z

MacOS failures unrelated to this cl (GCS cloud tests)

sb2nov · 2017-09-28T22:17:09Z

That is a known flaky test so re-running the tests.

Jenkins, test this please.

Imported from GitHub PR openxla/xla#13107 Copybara import of the project: -- 6555b6d216bb18b1f98eb36c5dac3f58f4d09c05 by mmakevic <Milica.Makevic@amd.com>: Fix reduce_row_vectorized.hlo.test Merging this change closes #13107 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#13107 from ROCm:ci_reduce_row_vectorized 6555b6d216bb18b1f98eb36c5dac3f58f4d09c05 PiperOrigin-RevId: 638442671

Imported from GitHub PR openxla/xla#13107 Copybara import of the project: -- 6555b6d216bb18b1f98eb36c5dac3f58f4d09c05 by mmakevic <Milica.Makevic@amd.com>: Fix reduce_row_vectorized.hlo.test Merging this change closes #13107 PiperOrigin-RevId: 638453162

googlebot added the cla: yes label Sep 17, 2017

yaroslavvb requested a review from wujingyue September 17, 2017 23:02

wujingyue reviewed Sep 18, 2017

View reviewed changes

yaroslavvb added the stat:awaiting response Status - Awaiting response from author label Sep 18, 2017

yaroslavvb added 3 commits September 22, 2017 16:04

Add new op BytesInUse, similar to MaxBytesInUse

1b909c5

incorporate PR suggestions

088cdea

improve test + fix

7e6fd8a

yaroslavvb force-pushed the bytes_in_use branch from dafaf99 to 7e6fd8a Compare September 23, 2017 04:26

yaroslavvb removed the stat:awaiting response Status - Awaiting response from author label Sep 23, 2017

sb2nov assigned wujingyue Sep 26, 2017

sb2nov added the awaiting review Pull request awaiting review label Sep 26, 2017

wujingyue reviewed Sep 27, 2017

View reviewed changes

wujingyue approved these changes Sep 27, 2017

View reviewed changes

make test more strict

6c42648

sb2nov self-assigned this Sep 28, 2017

sb2nov merged commit 0b13150 into tensorflow:master Sep 29, 2017

yaroslavvb deleted the bytes_in_use branch September 29, 2017 23:30

yaroslavvb mentioned this pull request Feb 8, 2018

Feature request: get available GPU memory #7537

Closed

copybara-service bot mentioned this pull request May 29, 2024

PR #13107: [ROCm] Fix reduce_row_vectorized.hlo.test #68849

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new op BytesInUse, similar to MaxBytesInUse #13107

Add new op BytesInUse, similar to MaxBytesInUse #13107

yaroslavvb commented Sep 17, 2017

mention-bot commented Sep 17, 2017

tensorflow-jenkins commented Sep 17, 2017

yaroslavvb commented Sep 18, 2017

wujingyue Sep 18, 2017

wujingyue Sep 18, 2017

wujingyue Sep 18, 2017

wujingyue Sep 18, 2017 •

edited

Loading

yaroslavvb commented Sep 18, 2017

yaroslavvb commented Sep 23, 2017

sb2nov commented Sep 26, 2017

wujingyue Sep 27, 2017

yaroslavvb Sep 27, 2017 •

edited

Loading

wujingyue left a comment

sb2nov commented Sep 28, 2017

yaroslavvb commented Sep 28, 2017

sb2nov commented Sep 28, 2017

Add new op BytesInUse, similar to MaxBytesInUse #13107

Add new op BytesInUse, similar to MaxBytesInUse #13107

Conversation

yaroslavvb commented Sep 17, 2017

mention-bot commented Sep 17, 2017

tensorflow-jenkins commented Sep 17, 2017

yaroslavvb commented Sep 18, 2017

wujingyue Sep 18, 2017

Choose a reason for hiding this comment

wujingyue Sep 18, 2017

Choose a reason for hiding this comment

wujingyue Sep 18, 2017

Choose a reason for hiding this comment

wujingyue Sep 18, 2017 • edited Loading

Choose a reason for hiding this comment

yaroslavvb commented Sep 18, 2017

yaroslavvb commented Sep 23, 2017

sb2nov commented Sep 26, 2017

wujingyue Sep 27, 2017

Choose a reason for hiding this comment

yaroslavvb Sep 27, 2017 • edited Loading

Choose a reason for hiding this comment

wujingyue left a comment

Choose a reason for hiding this comment

sb2nov commented Sep 28, 2017

yaroslavvb commented Sep 28, 2017

sb2nov commented Sep 28, 2017

wujingyue Sep 18, 2017 •

edited

Loading

yaroslavvb Sep 27, 2017 •

edited

Loading