Training process not utilizing a dynamically updated template #12

luowyang · 2021-06-12T06:49:41Z

It seems that STARK doesn't mention anything about a dynamically updated template (DUT for short) during training procedure, is it a deliberate design or am I missing something?

I reckon that the DUT is actually something like a short-term memory, and it should not be treated equally as a normal template from the first frame by the transformer, so the DUT should be explicitly included in training. However, this is not how STARK has been implemented.

So I'm curious what's the intuition or reasoning behind STARK's current training protocol of dismissing the DUT?

MasterBin-IIAU · 2021-06-14T11:59:16Z

Hi, we use get_frame_ids_trident function to sample the initial template frame, search region frame, and the dynamic template frame. To be specific, we first randomly sample two frames as the initial template and the search region. The interval between them is ranged from [0, L] (L is the sequence length). Then we sample an extra frame as the dynamic template. The interval between the search frame and the dynamic frame is ranged from [0, max_interval] (here we set the default max_interval as 200)

luowyang · 2021-06-15T08:21:48Z

Got it. So I guess the gradient of the loss function won't be backpropagated through time through dynamic templates, since they are sampled in an indifferentiable way. This is still confusing, during inference dynamic templates are chosen by the network, but during training they seem to be independent of other parts. Will such a gap affect accuracy?

MasterBin-IIAU · 2021-06-15T11:22:08Z

Yep, during the training stage, the dynamic template is sampled heuristically rather than using the network. Ideally, the training process should be completely consistent with the test process, which is in a sequential manner. However, due to the memory limit, we don't implement backpropagation through time.

luowyang closed this as completed Jun 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training process not utilizing a dynamically updated template #12

Training process not utilizing a dynamically updated template #12

luowyang commented Jun 12, 2021 •

edited

MasterBin-IIAU commented Jun 14, 2021

luowyang commented Jun 15, 2021

MasterBin-IIAU commented Jun 15, 2021

Training process not utilizing a dynamically updated template #12

Training process not utilizing a dynamically updated template #12

Comments

luowyang commented Jun 12, 2021 • edited

MasterBin-IIAU commented Jun 14, 2021

luowyang commented Jun 15, 2021

MasterBin-IIAU commented Jun 15, 2021

luowyang commented Jun 12, 2021 •

edited