GH-1079: Increase CPU training speed by pinning tensors #1082

alanakbik · 2019-09-10T13:33:18Z

This PR makes minor modifications to the .to() method of the DataPoint base class and all implementing classes, namely adding the option of moving a data point tensor to pinned memory. Pinning a tensor is a one-time cost but then allows all subsequents GPU tensor copy operations to work faster. When training a model with embeddings_storage_mode = 'cpu', we at each epoch move tensors from CPU to GPU, so this PR increases overall training speed (closes #1079).

This PR also adds a check if the .to() operation is even necessary (not needed if a tensor is already on the relevant device), leading to a small increase in training speed.

kashif · 2019-09-10T15:30:55Z

👍

alanakbik · 2019-09-10T15:31:31Z

👍

aakbik added 2 commits September 9, 2019 14:52

GH-1079: add option for pinning DataPoints to memory

39494e2

GH-1079: pin only if training on GPU

f304018

alanakbik merged commit 3c93339 into master Sep 10, 2019

alanakbik deleted the GH-1079-pinned-tensors branch September 11, 2019 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-1079: Increase CPU training speed by pinning tensors #1082

GH-1079: Increase CPU training speed by pinning tensors #1082

alanakbik commented Sep 10, 2019

kashif commented Sep 10, 2019

alanakbik commented Sep 10, 2019

GH-1079: Increase CPU training speed by pinning tensors #1082

GH-1079: Increase CPU training speed by pinning tensors #1082

Conversation

alanakbik commented Sep 10, 2019

kashif commented Sep 10, 2019

alanakbik commented Sep 10, 2019