How many train steps are needed to get the performance of the paper when finetuning TVR dataset? #14

liveseongho · 2021-03-10T07:36:54Z

Hi, I'm trying to fintune TVR dataset with HERO pretrained model.
But with 5000 or 10000 train steps, I failed to reach the performance of the paper.

How many train steps are needed to finetune TVR dataset?
Is the number of GPU is critical to performance? I'm running this finetuning with 4 gpus.

Also, the paper doesn't describe any about hard negative sampling, but it seems to be important.
3. Have you done ablation study about hard negatives? Could you share your experience?

linjieli222 · 2021-03-10T07:51:56Z

Hi,

Thanks for your interests in this project.

We have provided with the best training config. The performance reported in the paper is from 5000 steps on 8 GPUs.
GPUs will affect the performance, as our hard negative sampling is conducted across all GPUs. So with less GPUs, you are seeing less examples in a single training steps.
For hard negatives, we strictly followed the original TVR work on how the model get trained. Please have a check on their repo.
Thanks.

liveseongho · 2021-03-11T02:02:37Z

Thanks for your quick response! 😃

liveseongho closed this as completed Mar 11, 2021

minjoong507 mentioned this issue Mar 24, 2021

How to setting the Multi-GPU for training? jayleicn/TVRetrieval#7

Closed

linjieli222 mentioned this issue Jun 10, 2022

Datapath #44

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How many train steps are needed to get the performance of the paper when finetuning TVR dataset? #14

How many train steps are needed to get the performance of the paper when finetuning TVR dataset? #14

liveseongho commented Mar 10, 2021

linjieli222 commented Mar 10, 2021

GPUs will affect the performance, as our hard negative sampling is conducted across all GPUs. So with less GPUs, you are seeing less examples in a single training steps.

liveseongho commented Mar 11, 2021

How many train steps are needed to get the performance of the paper when finetuning TVR dataset? #14

How many train steps are needed to get the performance of the paper when finetuning TVR dataset? #14

Comments

liveseongho commented Mar 10, 2021

linjieli222 commented Mar 10, 2021

GPUs will affect the performance, as our hard negative sampling is conducted across all GPUs. So with less GPUs, you are seeing less examples in a single training steps.

liveseongho commented Mar 11, 2021