-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Confirm results of pretrained models #8
Comments
Hi @escorciav, Thanks for raising this, you're right this is what you'll get from the currently released models 😧.
I must have retrained the models with the erroneous configurations. I have reverted the change in 731db0d and replaced the checkpoints on dropbox. You should now be able to replicate the results in the README following the same steps you did previously Apologies for the inconvenience, I should have checked this yesterday when I made the change. Here are the results of the other reverted checkpoints TRN RGB
TSN RGB
|
I updated the std manually last night but I got the same result 😅 . I assumed the std was loaded from the ckpt.
In the meantime, I will launch again by syncing my repo with origin/master, origin=this-repo. |
The weights are the same, just the std in the hyperparameters dict is updated. No need to redownload. I might try and retrain the models over christmas with the original imagenet mean and then update the models, although I don't expect to see much change in the metrics.
…On Dec 17 2020, at 10:05 am, Victor Escorcia Castillo ***@***.***> wrote:
I updated the std manually last night but I got the same result 😅 . I assumed the std was loaded from the ckpt.
Should I download new weights?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub (#8 (comment)), or unsubscribe (https://github.com/notifications/unsubscribe-auth/AAHL4PNJJ65PDCPR67LSQD3SVHJYJANCNFSM4U6WFKDQ).
|
Hi Will! As I said before, it seems that the std is loaded from the ckpt. I'm using commit: 731db0d. Take a look at the config below:
|
Sure, the parameters are loaded from the checkpoint, you need to update the hyperparameters dict in the checkpoint. Or you can just redownload the models, that's all I did to update them.
…On Dec 17 2020, at 11:11 am, Victor Escorcia Castillo ***@***.***> wrote:
Hi Will!
As I said before, it seems that the std is loaded from the ckpt.
I'm using commit: 731db0d. Take a look at the config below:
INFO:test:Disabling distributed backend
INFO:test:Number of GPUs 1
INFO:test:Config:
modality: RGB
seed: 42
data:
frame_count: 8
segment_length: 1
train_gulp_dir: ${data._root_gulp_dir}/rgb_train
val_gulp_dir: /fast_scratch/datasets/epic-kitchens-100/data/processed/gulp/rgb_validation
test_gulp_dir: ${data._root_gulp_dir}/rgb_test
worker_count: 8
pin_memory: true
preprocessing:
bgr: false
rescale: true
input_size: 224
scale_size: 256
mean:
- 0.485
- 0.456
- 0.406
std:
- 0.229
- 0.224
- 0.225
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub (#8 (comment)), or unsubscribe (https://github.com/notifications/unsubscribe-auth/AAHL4PPXSUOGXWTP7EK5WQ3SVHRPHANCNFSM4U6WFKDQ).
|
What was the conclusion?
|
I have trained some new RGB models with setting the std to the ImageNet mean and I get the following results (action top-1 accuracy on test set):
These are all lower than the models I have released where I trained with the std set to the ImageNet mean (results are available in the README). I will leave the original model checkpoints available since they have better performance despite their odd training regime. |
I trained these models on the training set only, not on train+val. |
#8 has more details about why we have kept our unconventional training strategy of setting the std in preprocessing to the ImageNet mean (better results across all models)
Cool, thanks for letting me know. BTW, I've not read the instructions for the evaluation server yet. |
Thanks for your note. Would be helpful if you read the instructions before raising a Q. |
Noted with thanks. Have a beautiful day and week ☀️ ! |
Hi!
I was testing the pre-trained model, TSM RGB, and I got odd results in the validation set.
For action@1, I got 28.23 while you reported 35.75
commit: d58e695
Steps
The text was updated successfully, but these errors were encountered: