Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about "visual sentinel" #13

Open
wzn0828 opened this issue Feb 27, 2018 · 3 comments
Open

question about "visual sentinel" #13

wzn0828 opened this issue Feb 27, 2018 · 3 comments

Comments

@wzn0828
Copy link

wzn0828 commented Feb 27, 2018

Dear Jiasen Lu,
Thank you for your work on "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning".
I am writing to ask about the "visual sentinel": what is the difference between your "visual sentinel" and the hidden state ht?
I think your visual sentinel "st" and the LSTM's hidden state "ht" are the same except for the different symbols. Am I right ? If I am wrong, could you kindly give me some further explanation?

Many thanks in advance for your answer.
Kind regards
Zhennan Wang

@jamiechoi1995
Copy link

@wzn0828
In my opinion,
the formulations of "ht" and "st" are similar,
but they are affected by different variables when backpropagating loss,
which results in their different effects.

@jiasenlu
Copy link
Owner

Yes, I agree with @jamiechoi1995 Base on the different model inductive bias you impose, the weight will learn different functions.

@wzn0828
Copy link
Author

wzn0828 commented Apr 24, 2018

@jamiechoi1995 @jiasenlu Thank you all, I think I have gotten a deeper understanding of this sentinel with your help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants