Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluation results #1

Open
ed-fish opened this issue May 4, 2021 · 13 comments
Open

Evaluation results #1

ed-fish opened this issue May 4, 2021 · 13 comments

Comments

@ed-fish
Copy link

ed-fish commented May 4, 2021

Hi,

Thanks for your work making a Pytorch version of the paper - much appreciated!

How does this implementation compare to results in the original paper. Specifically on the Moments in Time dataset.

Thanks,

Ed

@marco-hmc
Copy link

marco-hmc commented May 28, 2021

I am also interested at this topic.
If there is anyone could provide me more information about the model parameters that might help me fix the problem, I would be thankful for that because using the default parameters is always overfitting.

Thanks.
Marco

@Linwei94
Copy link

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.

Thanks,
Dylan

@marco-hmc
Copy link

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.

Thanks,
Dylan

I am facing with the same situation with you. I still don't have any ideat about that now. Waiting for the reply for authors.

@Linwei94
Copy link

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.
Thanks,
Dylan

I am facing with the same situation with you. I still don't have any ideat about that now. Waiting for the reply for authors.

I wonder whether the problem results from the code or from my too-small dataset.

@vaibhavsah
Copy link

I tried it with a nearly 2000 Videos. Run different Epochs. But still the accuracy is not more than 21.09%. Strange thing is that it's same for most of the runs. No change in figures.

@Linwei94
Copy link

I tried it with a nearly 2000 Videos. Run different Epochs. But still the accuracy is not more than 21.09%. Strange thing is that it's same for most of the runs. No change in figures.

Your dataset is too small. You can try run ViViT with ViT’s weight loaded for both temporal and spatial part.

@vaibhavsah
Copy link

@DylanTao94 Can you share how I can do that.

@Linwei94
Copy link

Sry mate, my code is not allowed to share. You can follow the steps in ViViT paper.

@seandatasci
Copy link

yes this model works fine i've tested it on a dataset of 50k videos

@vaibhavsah
Copy link

@seandatasci i think I might be doing something wrong with the code. Can you help me out here. My code is here

@Mark-Dou
Copy link

@seandatasci i think I might be doing something wrong with the code. Can you help me out here. My code is here

i have the same problems with you, and i wonder you have resolved the problems whether or not, the acc or auc results is lower than 50%, the dataset size is also 2000, Thank u

@mx-mark
Copy link

mx-mark commented Dec 11, 2021

Inspired from the implementation of the ViViT by the author, we have reimplement the TimeSformer and ViViT, and release the pretrain-model weights on Kinetics600 can be found here

@mnauf
Copy link

mnauf commented Jun 8, 2023

The model isn't learning. Trained on 2 classes of UCF101 dataset. Adam optimizer, CrossEntropyLoss

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants