New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LayoutLM] How to reproduce FUNSD result #134
Comments
Hi @nv-quan ,
|
Thank you, here are my preprocessing scripts:
Testing:
|
Also, I can see a lot of "WARNING maximum sequence length exceeded: No prediction for" in the log, is that normal? |
I don't think so. The documents longer than 512 should be split into chunks to fit the max sequence length. So these warnings are abnormal. I can correctly generate data with the preprocessing commands you provided. Please check if the commands have been correctly executed. |
Hi @nv-quan where you able to resolve this issue? |
@marythomaa98 not yet, I was kind of busy so I didn't look at it yet, but I'll try to fix this bug tomorrow. |
@nv-quan okay sure! Do let me know if it works out, I am getting the same support number as you. |
@marythomaa98 The preprocessing is totally fine, but for some reason there are less prediction labels than input |
@nv-quan The dataset contains empty text but with non-empty labels. I think you may need to remove them. |
@wolfshow I'm comparing 2 files output/test_predictions.txt and data/test.txt, everything seems ok until line 181, the test data is still continue for that example_id while in test_predictions it prints '\n' (end of example_id). And the text in the testing data is not empty at all. |
Hi @nv-quan , |
@ranpox thank you, now the number of support is correct but the result is still off:
|
Hi @nv-quan on adding --do_lower_case and --fp16 works for me |
@marythomaa98 thanks a lot, it works when I add --do_lower_case to my test script. And also remove the data/cached_test_model_512 |
@marythomaa98 @nv-quan @ranpox can you paste your final command here to predict? I am having a bit of trouble understanding where I place my test input, where i place test output, and where the trained model sits. Also @ranpox I had to set the max sequence length to 128 or the cuda would run out of memory, is that an issue? |
could you explain the difference between the different labels. I know the difference of Answer vs Header vs Question vs Other. but what does B-ANSWER vs E-ANSWER vs I-ANSWER vs S-ANSWER mean?? |
As far as I understand, B is beginning, E is end I is in-between (or something similar), S is single. |
Hello,
I have run fine tuning for the Sequence Labeling Task with FUNSD dataset but my I couldn't achieve the result presented in the paper (precision is only 40%), here are some scripts and log that I used, any idea about what could be wrong?
Thank you very much.
Training:
Testing:
Some log:
The text was updated successfully, but these errors were encountered: