Fix FlauBERT GPU test #6142

LysandreJik · 2020-07-29T18:59:21Z

No description provided.

codecov · 2020-07-29T19:08:00Z

Codecov Report

Merging #6142 into master will decrease coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #6142      +/-   ##
==========================================
- Coverage   78.35%   78.32%   -0.04%     
==========================================
  Files         146      146              
  Lines       26403    26403              
==========================================
- Hits        20689    20679      -10     
- Misses       5714     5724      +10

Impacted Files	Coverage Δ
src/transformers/modeling_flaubert.py	`86.61% <100.00%> (ø)`
src/transformers/generation_tf_utils.py	`83.95% <0.00%> (-2.26%)`	⬇️
src/transformers/file_utils.py	`82.20% <0.00%> (-0.29%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 54f9fbe...7034997. Read the comment docs.

sgugger · 2020-07-29T19:36:41Z

src/transformers/modeling_flaubert.py

        if lengths is None:
            if input_ids is not None:
                lengths = (input_ids != self.pad_index).sum(dim=1).long()
            else:
-                lengths = torch.LongTensor([slen] * bs)
+                lengths = torch.tensor([slen] * bs, device=device)


Why not LongTensor?

LongTensor isn't a factory to create tensors, but the tensor class, so you can't directly set the device type on it. You would have to do:

lengths = torch.LongTensor([slen] * bs).to(device)

If I understand correctly, this first creates the tensor on the CPU, before creating a new one on the updated device (if the device is different). When using the factory method, I believe it's only created once, and is therefore more efficient. I didn't check the source code directly though so I may be wrong.

Makes sense, thanks for clarifying!

Fix GPU test

4c7d087

LysandreJik assigned sgugger Jul 29, 2020

Remove legacy constructor

7034997

LysandreJik requested a review from sgugger July 29, 2020 19:08

sgugger reviewed Jul 29, 2020

View reviewed changes

sgugger approved these changes Jul 29, 2020

View reviewed changes

LysandreJik merged commit ec02674 into master Jul 30, 2020

LysandreJik deleted the fix-flaubert-gpu-test branch July 30, 2020 15:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix FlauBERT GPU test #6142

Fix FlauBERT GPU test #6142

LysandreJik commented Jul 29, 2020

codecov bot commented Jul 29, 2020 •

edited

Loading

sgugger Jul 29, 2020

LysandreJik Jul 29, 2020

sgugger Jul 29, 2020

Fix FlauBERT GPU test #6142

Fix FlauBERT GPU test #6142

Conversation

LysandreJik commented Jul 29, 2020

codecov bot commented Jul 29, 2020 • edited Loading

Codecov Report

sgugger Jul 29, 2020

Choose a reason for hiding this comment

LysandreJik Jul 29, 2020

Choose a reason for hiding this comment

sgugger Jul 29, 2020

Choose a reason for hiding this comment

codecov bot commented Jul 29, 2020 •

edited

Loading