Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Strange fluctuation on curves even after large #seqs have been trained with #10

Closed
marcwww opened this issue Jun 4, 2018 · 6 comments
Closed
Assignees

Comments

@marcwww
Copy link

marcwww commented Jun 4, 2018

7711528080263_ pic_hd
7701528080263_ pic_hd

(random seed=10)
As the plots show, after 120,000 seqs, there still occurs some fluctuation of cost, which seems not to match that of the results in your experiments and the original authors'.
What could probably be the reasons?
How to copy with this?
THANKS A LOT.

@loudinthecloud
Copy link
Owner

Hey, can you please test after reverting d7b3840?

@marcwww
Copy link
Author

marcwww commented Jun 6, 2018

image
image
it seems that change does not helpy

@loudinthecloud
Copy link
Owner

Interesting, perhaps it's related to the seed (initialization and random training samples). Can you please test using a different seed? My test involved averaging 4 different seeds. I'll try to reproduce with seed=10 as well.

@loudinthecloud loudinthecloud self-assigned this Jun 26, 2018
@loudinthecloud
Copy link
Owner

It's seems to be a seed issue, I attached plots for the training of the copy task with seed=1000. Results may vary based on the seed as it controls the initialization and the training examples as well (which are random sequences of bits).

Loss

Cost

@marcwww
Copy link
Author

marcwww commented Jun 28, 2018

Could u please list a detailed param setting? Thanks a lot.

@loudinthecloud
Copy link
Owner

Hi, sure.
It appears in the copy notebook, at the beginning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants