Skip to content
Discussion options

You must be logged in to vote

Thanks for the detailed report - the logs made this easy to pin down.

What's happening

The truncation is on the output side, and graphify is actually recovering from it: when a chunk's JSON comes back truncated/unparseable, it bisects the chunk (splitting into halves of 2 and 2) and re-extracts the smaller halves. So those warnings are noisy but not data loss - the affected files get re-extracted on smaller inputs.

But there was a real bug underneath it. The OpenAI-compatible backends (ollama, openai, deepseek, kimi) define their output cap as max_tokens: 16384 in the backend config, but the request dispatch only read a max_completion_tokens key - which only the gemini config defines. So …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@designermonkey
Comment options

Answer selected by safishamsi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants