Remove vocab from cuda #955

snisarg · 2019-09-06T18:50:26Z

Summary:
We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU.

With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU.

Note that this only applies during training.

Differential Revision: D17114398

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: ba7b004c6e2e75af1ee9cff64eee563cf3e52435

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: a9f1791d83d67f331094e64f1574cf1c149deabf

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: e28b2981fbcbb248a6a704fd3c6e325fd45490e9

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: 840f37f77c70089137f2cf23a262dc503e5e2080

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: 56343dd90a9e05d021650b9d765274a721dffa13

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Differential Revision: D17114398 fbshipit-source-id: 8da9f9628c64f23ba751d6ceb63ffe1ce9b05c17

Summary: Pull Request resolved: facebookresearch#955 We have users who can't train models on extremely large embeddings because we try to allocate space for that on the GPU. With this diff, in training, we add a flag which users can set explicitly to keep the embedding layer on CPU even when the model is getting trained on GPUs. This is not default because we need the user to know that there will be a cost associated moving the tensors on and off the GPU. Note that this only applies during training. Also note that this does not work in a multi-GPU environment because of the way the weights are synced via NCCL. Reviewed By: chenyangyu1988 Differential Revision: D17114398 fbshipit-source-id: 1d4c41940af0d69415b8e606899afcecc843b064

facebook-github-bot · 2019-10-02T02:49:04Z

This pull request has been merged in 84adc39.

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Sep 6, 2019

snisarg force-pushed the export-D17114398 branch from 76792e3 to f376133 Compare September 21, 2019 02:10

snisarg force-pushed the export-D17114398 branch from f376133 to ac54233 Compare September 24, 2019 02:06

snisarg force-pushed the export-D17114398 branch from ac54233 to 189fdf4 Compare September 24, 2019 02:37

snisarg force-pushed the export-D17114398 branch 2 times, most recently from 8a15419 to a5e9775 Compare September 26, 2019 20:36

snisarg force-pushed the export-D17114398 branch from a5e9775 to 324802c Compare September 26, 2019 20:37

snisarg force-pushed the export-D17114398 branch from 324802c to 309248c Compare October 1, 2019 20:51

facebook-github-bot closed this in 84adc39 Oct 2, 2019

facebook-github-bot added the Merged label Oct 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove vocab from cuda #955

Remove vocab from cuda #955

snisarg commented Sep 6, 2019

facebook-github-bot commented Oct 2, 2019

Remove vocab from cuda #955

Remove vocab from cuda #955

Conversation

snisarg commented Sep 6, 2019

facebook-github-bot commented Oct 2, 2019