Why there is cnt variable in get_collate_function? #15

ari9dam · 2021-08-09T19:59:42Z

In https://github.com/jingtaozhan/DRhard/blob/dc17f3d1f7f59d13d15daa1a728dc8d6efc48b92/dataset.py, if we take a look at the data collator,

def get_collate_function(max_seq_length):
    cnt = 0
    def collate_function(batch):
        nonlocal cnt
        length = None
        if cnt < 10:
            length = max_seq_length
            cnt += 1

        input_ids = [x["input_ids"] for x in batch]
        attention_mask = [x["attention_mask"] for x in batch]
        data = {
            "input_ids": pack_tensor_2D(input_ids, default=1, 
                dtype=torch.int64, length=length),
            "attention_mask": pack_tensor_2D(attention_mask, default=0, 
                dtype=torch.int64, length=length),
        }
        ids = [x['id'] for x in batch]
        return data, ids
    return collate_function

we see that there is a cnt variable which is deciding if the collate_function should pad or not. I couldn't get why it is needed. Could you please explain the significance of cnt ?

Thank you
AM

The text was updated successfully, but these errors were encountered:

jingtaozhan · 2021-08-10T01:23:41Z

It is a simple trick I used. Some inappropriate hyperparameters may trigger `outofmemory' error during training. Therefore, this code requires the input to have the max sequence length at the beginning of training. Therefore, if the batch size is too big or max seq length is too big, the error will be triggered from the beginning and I can easily know.
You can also delete this code.

ari9dam · 2021-08-10T21:09:39Z

I see. Thanks for the explanation.

ari9dam closed this as completed Aug 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why there is cnt variable in get_collate_function? #15

Why there is cnt variable in get_collate_function? #15

ari9dam commented Aug 9, 2021 •

edited

Loading

jingtaozhan commented Aug 10, 2021

ari9dam commented Aug 10, 2021

Why there is cnt variable in get_collate_function? #15

Why there is cnt variable in get_collate_function? #15

Comments

ari9dam commented Aug 9, 2021 • edited Loading

jingtaozhan commented Aug 10, 2021

ari9dam commented Aug 10, 2021

ari9dam commented Aug 9, 2021 •

edited

Loading