Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I run the model, but it doesn't convergence. Is there any point that I need to pay attention to? #3

Open
sccbhxc opened this issue Mar 16, 2018 · 3 comments

Comments

@sccbhxc
Copy link

sccbhxc commented Mar 16, 2018

I run the following code:

# transductive 1-shot 5-way Omniglot.
python -u run_omniglot.py --shots 1 --inner-batch 25 --inner-iters 3 --meta-step 1 --meta-batch 10 --meta-iters 100000 --eval-batch 25 --eval-iters 5 --learning-rate 0.001 --meta-step-final 0 --train-shots 15 --checkpoint ckpt_o15t --transductive

The output results is "batch XXX: train=0.000000 test=0.000000".
Is there any wrong?

@sccbhxc sccbhxc changed the title I run the model, but it doesn't convergence. There is any point I run the model, but it doesn't convergence. There is any point that I need to pay attention to? Mar 16, 2018
@sccbhxc sccbhxc changed the title I run the model, but it doesn't convergence. There is any point that I need to pay attention to? I run the model, but it doesn't convergence. Is there any point that I need to pay attention to? Mar 16, 2018
@unixpickle
Copy link
Contributor

Could you show multiple lines of the output? Each line in the output corresponds to a single task evaluation, not an average, so there will be some zeros unless the model is perfect. Does every single line look like that?

By the way, a better way to see the results is to let the run finish, at which point a full evaluation is performed. With those arguments, this will be after 100K iterations. You can also use tensorboard to see smoothed learning curves during training.

@sccbhxc
Copy link
Author

sccbhxc commented Mar 17, 2018

@unixpickle
I fails to upload the picture of tensorboard curves yesterday. The training accuracy curves I get is as follow.
2018-03-16_170534

@unixpickle
Copy link
Contributor

TensorBoard has a smoothing option, which should make the curves easier to read.

Some things to check:

  • Did you definitely download all of Omniglot? There should be ~30K images.
  • If you let the script run for the full 100K iterations, what accuracy does it output at the end?
  • What version of TensorFlow/Python are you using?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants