Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

which paper is implementd by this repository?"what you get"..or "coarse to fine" #99

Open
ganliqiang opened this issue Jun 21, 2019 · 1 comment

Comments

@ganliqiang
Copy link

the paper link is not the "what you get is what you see ",the link is the paper"Image-to-Markup Generation with Coarse-to-Fine Attention", i want to know which paper i should read to know the principle.thanks

@da03
Copy link
Owner

da03 commented Jun 21, 2019

Hi @ganliqiang, "image-to-markup generation with coarse-to-fine attention" is just an updated version of "what you get is what you see". However, coarse-to-fine attention is not implemented in this repo, so reading the earlier version of the paper is enough for understanding this implementation.

However, note that there is a difference: in this repo the attention is over columns of features (so each feature in the column is given the same weight), while in the paper we used a more powerful attention which attends over individual features. Which one is better (attending over columns or individual features) both depends on the problem (in normal OCR probably column-wise attention is enough if the images are not distorted, while for more complicated structure like math formula or tables we should attend to individual positions). It also depends on the amount of training data we have: since attending to individual features is more powerful, we'd expect more training data to make it work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants