v1.7.0
DeepDoc 1.7.0 adds real token-by-token streaming to the chatbot's Fast and Deep Research modes, delivering the same live-answer experience already available in Code-aware mode.
Added
- Added
POST /query/streamSSE endpoint that streams the Fast mode answer token-by-token before emitting a finalresultevent. - Added
POST /deep-research/streamSSE endpoint that streams the Deep Research synthesis answer token-by-token before emitting a finalresultevent. - Added
complete_stream()method toLiteLLMChatClientusinglitellm.completion(stream=True), yielding token strings as they arrive. - Added
token_callbackparameter to_complete_with_continuation(),query(),deep_research(), and_run_research_mode()so the final answer generation can push tokens to any caller. - Added
synthesis_token_callbacktoDeepResearcherso only the synthesis step streams tokens (sub-question expansions remain non-streaming).
Changed
- Updated the generated chatbot UI so Fast and Deep modes fetch from the new
/streamendpoints and progressively render the answer withReactMarkdownas tokens arrive, falling back to the non-streaming endpoints if the stream is unavailable.