Bugs in evaluation code for processing language. #182

Silverster98 · 2023-12-13T04:19:37Z

Hi,

I have noticed that there may be a bug in your modified evaluation code as follows.

motion-diffusion-model/data_loaders/humanml/motion_loaders/comp_v6_model_dataset.py

Line 214 in af061ca

'cap_len': len(tokens[bs_i]),

Because the tokens are all padded, if you use len(tokens[bs_i]) to obtain the cap_len, then all sentence lengths will be max_text_len=20 + 2. This will influence the language feature extraction for computing metrics.

And the following code is the original code in HumanML3D, which uses the right token length.

motion-diffusion-model/data_loaders/humanml/motion_loaders/comp_v6_model_dataset.py

Line 100 in af061ca

'cap_len': cap_lens[0].item(),

I think this bug may lead to a performance drop in MatchingScore, R-Precision, and so on.

The text was updated successfully, but these errors were encountered:

GuyTevet · 2023-12-13T07:56:09Z

That's interesting, can you share the performance of the published model after your bug fix?

Fixed evaluation bug #182 - wrong cap_len calculation

GuyTevet · 2024-01-31T08:48:37Z

Fixed. Thanks!

GuyTevet added a commit that referenced this issue Jan 31, 2024

Merge pull request #189 from roey1rg/main

63edacf

Fixed evaluation bug #182 - wrong cap_len calculation

GuyTevet closed this as completed Jan 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs in evaluation code for processing language. #182

Bugs in evaluation code for processing language. #182

Silverster98 commented Dec 13, 2023

GuyTevet commented Dec 13, 2023

GuyTevet commented Jan 31, 2024

Bugs in evaluation code for processing language. #182

Bugs in evaluation code for processing language. #182

Comments

Silverster98 commented Dec 13, 2023

GuyTevet commented Dec 13, 2023

GuyTevet commented Jan 31, 2024