Some questions in your paper #13

liming-ai · 2021-01-20T13:16:05Z

Thanks for your contribution, I tried again and could reproduce your result! It is really an amazing work!

I read your paper carefully, but there are still some details I cannot understand, could you please answer me if you have time?

Could you please explain the figure 2 in your paper? What does the Y-axis Density mean?
Can I understand the original features are obtained from embedded feature only use main pipeline as the whole model while separated features use both main pipeline and Uncertainty modeling as final model?
Does the softmax score used in table 3 of ablation study means only use main pipeline in figure3 to obtain result?
If i understand correctly, the softmax score is obtained by the original features, which means they are not separated, have unconstrained magnitudes, so is the description in the figure below wrong? It should be For the **first**, as the original......
Could you please provide your extracted features and pretrained models for ActivityNet 1.2 and ActivityNet 1.3?

Thanks again for your contribution and patience, hope you can reply to me!

The text was updated successfully, but these errors were encountered:

Pilhyeon · 2021-01-25T07:40:08Z

Hello, thanks for your interest!
I hope the replies below would help.

Density in Fig. 2 indicates the portion of samples, e.g., the value of 0.008 in density means 0.8 % of the samples are located there. It plays the exactly same role as normalization, which is necessary as the amounts of action and background frames are quite different.
Yes, you're right.
Yes, it is. To clarify more, the softmax score uses the first term (softmax score) in Eq. 3.
In fact, the softmax score is unrelated to the feature magnitudes, as they are never used. On the other hand, suppose the case where fusion score is used without uncertainty modeling loss. As you mentioned, the magnitudes are not separated, so we need to perform min-max normalization rather than using m. Therefore, it should be "the second".
For some reason, we put the ActivityNet features on hold. They may be released after the conference. We are sorry for the delay.

If you have further questions, feel free to let me know.
Thanks!

liming-ai · 2021-01-25T08:04:26Z

Thanks for your reply!

liming-ai · 2021-01-25T08:29:38Z

Hello, thanks for your interest!
I hope the replies below would help.

Density in Fig. 2 indicates the portion of samples, e.g., the value of 0.008 in density means 0.8 % of the samples are located there. It plays the exactly same role as normalization, which is necessary as the amounts of action and background frames are quite different.

Yes, you're right.

Yes, it is. To clarify more, the softmax score uses the first term (softmax score) in Eq. 3.

In fact, the softmax score is unrelated to the feature magnitudes, as they are never used. On the other hand, suppose the case where fusion score is used without uncertainty modeling loss. As you mentioned, the magnitudes are not separated, so we need to perform min-max normalization rather than using m. Therefore, it should be "the second".

For some reason, we put the ActivityNet features on hold. They may be released after the conference. We are sorry for the delay.

If you have further questions, feel free to let me know.
Thanks!

@Pilhyeon Could you please tell me that if there are also videos are excluded during training or testing in ActivityNet v1.2 or v1.3?

xumh-9 · 2021-01-26T08:01:42Z

Hello, thanks for your interest!
I hope the replies below would help.

Density in Fig. 2 indicates the portion of samples, e.g., the value of 0.008 in density means 0.8 % of the samples are located there. It plays the exactly same role as normalization, which is necessary as the amounts of action and background frames are quite different.

Yes, you're right.

Yes, it is. To clarify more, the softmax score uses the first term (softmax score) in Eq. 3.

In fact, the softmax score is unrelated to the feature magnitudes, as they are never used. On the other hand, suppose the case where fusion score is used without uncertainty modeling loss. As you mentioned, the magnitudes are not separated, so we need to perform min-max normalization rather than using m. Therefore, it should be "the second".

For some reason, we put the ActivityNet features on hold. They may be released after the conference. We are sorry for the delay.

If you have further questions, feel free to let me know.
Thanks!

@Pilhyeon Could you please tell me that if there are also videos are excluded during training or testing in ActivityNet v1.2 or v1.3?

Can you reproduce the result by training the model in your environment by yourself not using the pre-trained model ?

Pilhyeon · 2021-01-27T07:07:28Z

@mitming In fact, some of the ActivityNet videos are unavailable at this time, so the entries of training/validation videos used for experiments are slightly different depending on the papers. In our case, 9,272 training videos and 4,541 validation videos were available.

liming-ai · 2021-03-09T11:07:13Z

Hello, thanks for your interest!
I hope the replies below would help.

Density in Fig. 2 indicates the portion of samples, e.g., the value of 0.008 in density means 0.8 % of the samples are located there. It plays the exactly same role as normalization, which is necessary as the amounts of action and background frames are quite different.

Yes, you're right.

Yes, it is. To clarify more, the softmax score uses the first term (softmax score) in Eq. 3.

In fact, the softmax score is unrelated to the feature magnitudes, as they are never used. On the other hand, suppose the case where fusion score is used without uncertainty modeling loss. As you mentioned, the magnitudes are not separated, so we need to perform min-max normalization rather than using m. Therefore, it should be "the second".

For some reason, we put the ActivityNet features on hold. They may be released after the conference. We are sorry for the delay.

If you have further questions, feel free to let me know.
Thanks!

@Pilhyeon Could you please tell me that if there are also videos are excluded during training or testing in ActivityNet v1.2 or v1.3?

Can you reproduce the result by training the model in your environment by yourself not using the pre-trained model ?

Sorry, I have tried many times, but I still cannot reproduce the result in paper without pre-trained model, could you reproduce it?

Pilhyeon closed this as completed Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions in your paper #13

Some questions in your paper #13

liming-ai commented Jan 20, 2021

Pilhyeon commented Jan 25, 2021

liming-ai commented Jan 25, 2021

liming-ai commented Jan 25, 2021 •

edited

Loading

xumh-9 commented Jan 26, 2021

Pilhyeon commented Jan 27, 2021

liming-ai commented Mar 9, 2021

Some questions in your paper #13

Some questions in your paper #13

Comments

liming-ai commented Jan 20, 2021

Pilhyeon commented Jan 25, 2021

liming-ai commented Jan 25, 2021

liming-ai commented Jan 25, 2021 • edited Loading

xumh-9 commented Jan 26, 2021

Pilhyeon commented Jan 27, 2021

liming-ai commented Mar 9, 2021

liming-ai commented Jan 25, 2021 •

edited

Loading