-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better handling for long text #11
Comments
Check if appendix is included for this example https://browse.arxiv.org/html/2401.00437v1 |
Moved to 13,500 99add83 Still need to improve handling. |
Now skips when too long, but still unsatisfactory. Here are some current ones:
|
Found that many are due to equations. Removed |
Removed This seems to improve unexpected lengths |
Added LangChain plus MapReduce #23 |
Currently use approx word counts by whitespace before sending to API; will truncate.
Also skips if OpenAI comes back with error.
Need a better solution to check with OpenAI's tokenizer.
The text was updated successfully, but these errors were encountered: