You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Violet uses most commen answers as candidates, but there are other answers in the test set. How do you deal with them? Are they abandoned according to the txt_msvd.json?
The text was updated successfully, but these errors were encountered:
I have checked the code and data. It seems that the authors just keep the Top-1k answers (MSVD-QA) and remove the unseen-answer samples . It is certainly not a standard and will give rise to unreasonably higher results than others. I thus doubt about the fairness of the comparison in the paper.
Please correct me if I am wrong, or othewise the authors need to revise their evaluation & results..
Super thanks for the clarification!
After checking it, there is exactly a performance error for QAOE in the original VIOLET.
In our new Empirical-MVM, we have fixed it (treat all questions in the testing set), and the accuracy is correct.
We will release the code of Empirical-MVM soon 😊
Violet uses most commen answers as candidates, but there are other answers in the test set. How do you deal with them? Are they abandoned according to the txt_msvd.json?
The text was updated successfully, but these errors were encountered: