Dramatic performance degradation w/o top-k trick #25

FrontierBreaker · 2021-07-17T11:25:43Z

According to my retraining, I got ~84 J&F w/ top-k but ~80 J&F w/o top-k. It seems this trick influences the overall performance greatly. Have you observed such a phenomenon in your exp?

FrontierBreaker · 2021-07-17T11:27:19Z

By the way, have you studied the selection of Hyper-param "K" in top-k filtering?

hkchengrex · 2021-07-17T13:05:43Z

Our s03 model drops to 84.1 without top-k. Top-k is a pretty neat trick and we have also applied it to the baseline in all ablation studies. We just tried a few values and picked 20.

hkchengrex · 2021-08-22T17:28:09Z

Please see the updated readme for further help in reproducibility.

hkchengrex closed this as completed Aug 22, 2021

zhouweii234 mentioned this issue Aug 31, 2021

Do you use top-k trick in the code? #53

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dramatic performance degradation w/o top-k trick #25

Dramatic performance degradation w/o top-k trick #25

FrontierBreaker commented Jul 17, 2021

FrontierBreaker commented Jul 17, 2021

hkchengrex commented Jul 17, 2021

hkchengrex commented Aug 22, 2021

Dramatic performance degradation w/o top-k trick #25

Dramatic performance degradation w/o top-k trick #25

Comments

FrontierBreaker commented Jul 17, 2021

FrontierBreaker commented Jul 17, 2021

hkchengrex commented Jul 17, 2021

hkchengrex commented Aug 22, 2021