Checking size attribute of dst when dst is None #2

hrishikeshvganu · 2017-09-23T01:32:39Z

In the code below if dst is None the dst.sizes[idx] block in Exception ... will throw an unhandled error.

This is around https://github.com/facebookresearch/fairseq-py/blob/master/fairseq/data.py#L222

for idx in indices:
        # - 2 here stems from make_positions() where we offset positions
        # by padding_value + 1
        if src.sizes[idx] < 2 or \
                (dst is not None and dst.sizes[idx] < 2) or \
                sizes[idx] > max_positions - 2:
            raise Exception("Unable to handle input id {} of "
                            "size {} / {}.".format(idx, src.sizes[idx], dst.sizes[idx]))

To fix this (dst is not None and dst.sizes[idx] < 2) can be modified to (False if dst is None else dst.sizes[idx] < 2)

The text was updated successfully, but these errors were encountered:

edunov · 2017-09-24T18:18:30Z

Thank you @hrishikeshvganu for finding this. It is now fixed.

Fixes #2

…ain_step (facebookresearch#2) Summary: Pull Request resolved: fairinternal/fairspeq#2 Pull Request resolved: facebookresearch#689 We found not raising OOM during trainer.train_step causes various issue, including NCCL hangs / gloo sync errors because gradient is not synced properly. Before we found the root cause, let's give users an option to raise OOMs. Reviewed By: jmp84 Differential Revision: D15170357 fbshipit-source-id: 1c3defd70bf97b2f4e2f1b39661c735907258194

…ain_step (#2) Summary: Pull Request resolved: fairinternal/fairspeq#2 Pull Request resolved: #689 We found not raising OOM during trainer.train_step causes various issue, including NCCL hangs / gloo sync errors because gradient is not synced properly. Before we found the root cause, let's give users an option to raise OOMs. Reviewed By: jmp84 Differential Revision: D15170357 fbshipit-source-id: 3e15e4e111a8380612157955509c39821a216ec4

deleted obsolete file

Gradient shenanigans

…ain_step (facebookresearch#2) Summary: Pull Request resolved: fairinternal/fairspeq#2 Pull Request resolved: facebookresearch#689 We found not raising OOM during trainer.train_step causes various issue, including NCCL hangs / gloo sync errors because gradient is not synced properly. Before we found the root cause, let's give users an option to raise OOMs. Reviewed By: jmp84 Differential Revision: D15170357 fbshipit-source-id: 3e15e4e111a8380612157955509c39821a216ec4

Summary: Motivation: I want to save checkpoints frequently, due to unreliable jobs in FB cluster that restart frequently. I want to do this without spamming Manifold storage, but still save some historical checkpoints (i.e. every 10k updates), so I can track how WER evolves over time. To save frequently, I can use a small --save-interval-updates. To delete old checkpoints to save storage, I can use --keep-interval-updates. However, this deletes all old checkpoints. This is where --keep-interval-updates-pattern comes in. If I now do: ``` --save-interval-updates 1000 --keep-interval-updates 1 --keep-interval-updates-pattern 10000 ``` This will: 1. checkpoint every 1000 updates so that job restarts don't impact us significantly 2. keep only the latest checkpoint to avoid saving a bunch of huge models in manifold 3. make an exception for #2 for every 10k updates so we can track WER over time Reviewed By: myleott Differential Revision: D27578403 fbshipit-source-id: 5aec2dc9a22778015f7a3daa017210190af81240

Update fairseq_criterion.py

edunov pushed a commit that referenced this issue Sep 24, 2017

Issue #2, Checking size attribute of dst when dst is None

2d3161d

edunov closed this as completed Sep 24, 2017

myleott pushed a commit that referenced this issue Sep 26, 2017

Fix call to self.proj

bbecedd

Fixes #2

myleott pushed a commit that referenced this issue Sep 26, 2017

Issue #2, Checking size attribute of dst when dst is None

9a11382

travel-go mentioned this issue Feb 26, 2018

out of memory #103

Closed

zhao1iang mentioned this issue Apr 27, 2019

RuntimeError during training #674

Closed

huihuifan mentioned this issue Aug 8, 2019

Questions regarding Strategies for Structuring Story Generation #993

Closed

taylanbil referenced this issue in taylanbil/fairseq Aug 15, 2019

Merge pull request #2 from taylanbil/deloldtrain

5a872af

deleted obsolete file

miaodl mentioned this issue Feb 7, 2020

using Memory-Efficiency-FP16, reload model and training cause RuntimeError: A tensor was not cuda. in fused_adam.py #1683

Closed

jogonba2 mentioned this issue Jul 9, 2020

BART-Large: RuntimeError: CUDA error: the launch timed out and was terminated #2311

Open

pmichel31415 pushed a commit to pmichel31415/fairseq that referenced this issue Aug 24, 2020

Merge pull request facebookresearch#2 from xianxl/gradient-shenanigans

9ca8a3c

Gradient shenanigans

thies1006 mentioned this issue Aug 25, 2020

terminate called after throwing an instance of 'c10::Error' #2526

Closed

shizhediao mentioned this issue Sep 5, 2020

How to transfer a TPU-model to GPU. #2578

Closed

Gavin90s mentioned this issue Mar 11, 2021

speech_recognition/w2l_decoder.py load kenlm core dump #3337

Closed

Jxu-Thu mentioned this issue Jul 1, 2021

Wav2vec2 error after validation when training : terminate called after throwing an instance of 'c10::Error' #3674

Open

WilliamTambellini mentioned this issue Jun 4, 2022

Add a note to install python-dev in README #4435

Open

4 tasks

st-vincent1 added a commit to st-vincent1/fairseq that referenced this issue Jul 25, 2022

Fixed generation bug facebookresearch#2

bbccf6c

jiayidengYumy mentioned this issue Nov 20, 2023

CUDA error of self.embed_positions #5381

Closed

sunyt32 pushed a commit to sunyt32/fairseq that referenced this issue Mar 23, 2024

Merge pull request facebookresearch#2 from ustcwhy/patch-1

7b370e1

Update fairseq_criterion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checking size attribute of dst when dst is None #2

Checking size attribute of dst when dst is None #2

hrishikeshvganu commented Sep 23, 2017 •

edited

Loading

edunov commented Sep 24, 2017

Checking size attribute of dst when dst is None #2

Checking size attribute of dst when dst is None #2

Comments

hrishikeshvganu commented Sep 23, 2017 • edited Loading

edunov commented Sep 24, 2017

hrishikeshvganu commented Sep 23, 2017 •

edited

Loading