Cannot reproduce accuracy 84% (after step2) #11

asanakoy · 2018-10-31T00:15:26Z

Hi Hao,

Thank you for a neat implementation.

I wonde if training with the hyperparameters written in README

 --base_lr 1e-2 \
 --batch_size 64 --epochs 25 --weight_decay 1e-5 \
 --model "model.pth"

gives 84.17% test accuracy?

I used exactly the commads which you provide in the README:

    Step 1.
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/bilinear_cnn_fc.py --base_lr 1.0 \
          --batch_size 64 --epochs 55 --weight_decay 1e-8 \
          | tee "[fc-] base_lr_1.0-weight_decay_1e-8-epoch_.log"

    Step 2. 
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/bilinear_cnn_all.py --base_lr 1e-2 \
          --batch_size 64 --epochs 25 --weight_decay 1e-5 \
          --model "model.pth" \
          | tee "[all-] base_lr_1e-2-weight_decay_1e-5-epoch_.log"

I have trained step1 model and got 76.67% accuracy on test. I use this as initialization for step2 model and finetune all the layers further. But the accuracy saturates at 76.61% and doesn't grow further.

Are there any extra tricks to get the desired performance?

The text was updated successfully, but these errors were encountered:

rohitgajawada · 2018-10-31T08:51:50Z

Have you tried doing step-2 directly?

asanakoy · 2018-10-31T14:32:23Z

@rohitgajawada yes, it gets even lower ~ 57%

rohitgajawada · 2018-10-31T14:59:16Z

Oh that is sad, I also need a reproducible bilinear-cnn in pytorch.
In this code after doing the bilinear operation, the output x is only undergoing a sqrt operation. In the original paper, they do: sign(x) * sqrt(|x|) instead. Could this be a cause of reduced accuracy or am I missing something out?

rohitgajawada · 2018-10-31T15:04:30Z

Never mind, saw issue #4

asanakoy · 2018-10-31T15:07:36Z

That's not a problem, since they compute a Gram matrix after Relu. Do you have any other good repository in mind?

rohitgajawada · 2018-10-31T17:14:41Z

Ya my bad, realized it immediately after commenting :P If I find another repo that is able to reach the required accuracy, I'll notify you

rohitgajawada · 2018-11-02T16:11:16Z

Hi @HaoMood , any updates with this problem? Were you able to obtain the 84% test accuracy?

HaoMood · 2019-01-06T10:42:14Z

It is weird, since random seeds are fixed and I had tried it several times to make it can be re-implemented before this submission.

Maybe you can give some details about your hardware, such as the GPU used as well as CUDA and cuDNN version.

HaoMood closed this as completed Jan 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce accuracy 84% (after step2) #11

Cannot reproduce accuracy 84% (after step2) #11

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018 •

edited

Loading

rohitgajawada commented Nov 2, 2018

HaoMood commented Jan 6, 2019

Cannot reproduce accuracy 84% (after step2) #11

Cannot reproduce accuracy 84% (after step2) #11

Comments

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018

asanakoy commented Oct 31, 2018

rohitgajawada commented Oct 31, 2018 • edited Loading

rohitgajawada commented Nov 2, 2018

HaoMood commented Jan 6, 2019

rohitgajawada commented Oct 31, 2018 •

edited

Loading