Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to handle short sentences/contexts #13

Closed
raman-r-4978 opened this issue Jan 30, 2020 · 1 comment
Closed

How to handle short sentences/contexts #13

raman-r-4978 opened this issue Jan 30, 2020 · 1 comment

Comments

@raman-r-4978
Copy link

Hi @seominjoon @jhyuklee

  • The default model performs well for SQUAD_v1.1 dataset (where context length is ~700 chars)
    But It fails to perform, when I try to index my custom data which has small paragraph/contexts (length ~100-150 characters).
    • The problem is, irrespective of the query, the same result (wrong) is being returned as the output
    • Most the time, the result is just single random character like ? . (end of the context)
    • I have debugged into this and realized that the problem stays in start vectors which we generate from model output

Ques:

  1. May I know why this scenario occurs?
  2. What is the solution?

Setting:
All the results are obtained using the commands mentioned in README.

@jhyuklee
Copy link
Collaborator

Reference: #2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants