Eliminate dummy batches and init_cuda_buffer. #3732

stephenroller · 2021-06-17T18:11:58Z

Patch description
Eliminate the dreaded _dummy_batch and _init_cuda_buffer.

Users have complained about the presence of these methods. When debugging, the first batch being a dummy causes some confusion (why am I getting gibberish data?) and the manual implementation of a _dummy_batch is annoying (why is my model breaking on the first pass?)

Adopt Fairseq's newer approach:

Cache the very first batch we ever seen. If it's invalid, we should be raising an exception anyway. If it's valid, it will be good enough
Do NOT initialize the cuda buffer. Just keep that batch around
If we need to recover from an OOM, used the cached dummy batch as the data.

Testing steps
CI.

klshuster

thanks for making this change. I'm approving but did have one perhaps pertinent question

klshuster · 2021-06-21T22:00:26Z

parlai/core/torch_generator_agent.py

+        Cache a batch to be used as a dummy during _fake_forward_pass.
+        """
+        if not hasattr(self, '_dummy_batch'):
+            self._dummy_batch = batch


does this unnecessarily use up memory? what if the first batch is huge for example?

Could be a worse problem for images. For text, imagine it's a 2x2048x1024 LongTensor (2x for content and label; bs 2048; string length 1024). A long is 8 bytes. So we'll be wasting 32MiB of memory.

Eliminate dummy batches and init_cuda_buffer.

eb903cd

stephenroller requested a review from klshuster June 17, 2021 18:12

facebook-github-bot added the CLA Signed label Jun 17, 2021

klshuster approved these changes Jun 21, 2021

View reviewed changes

stephenroller merged commit 36a63c7 into master Jun 21, 2021

stephenroller deleted the noinitcuda branch June 21, 2021 22:21

stephenroller mentioned this pull request Jul 14, 2021

Eliminate init_cuda_buffer and dummy_batch #3412

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliminate dummy batches and init_cuda_buffer. #3732

Eliminate dummy batches and init_cuda_buffer. #3732

stephenroller commented Jun 17, 2021

klshuster left a comment

klshuster Jun 21, 2021

stephenroller Jun 21, 2021

Eliminate dummy batches and init_cuda_buffer. #3732

Eliminate dummy batches and init_cuda_buffer. #3732

Conversation

stephenroller commented Jun 17, 2021

klshuster left a comment

Choose a reason for hiding this comment

klshuster Jun 21, 2021

Choose a reason for hiding this comment

stephenroller Jun 21, 2021

Choose a reason for hiding this comment