fix(litellm): parse DeepSeek-V3 proprietary inline tool-call tokens by fuchun1010 · Pull Request #5654 · google/adk-python

fuchun1010 · 2026-05-10T15:01:09Z

Problem

DeepSeek-V3 emits tool calls using proprietary special tokens embedded in the content field:

<｜tool▁calls▁begin｜><｜tool▁call▁begin｜>function<｜tool▁sep｜>analysis_input
```json
{"work_dir_name":"..."}
```<｜tool▁call▁end｜><｜tool▁calls▁end｜>

When LiteLLM does not translate these into structured tool_calls (intermittent), ADK's fallback JSON parser finds the JSON object but rejects it because the function name (analysis_input) is embedded in the tokens (<｜tool▁sep｜>analysis_input) rather than as a name key inside the JSON payload.

Result: tool call is silently dropped and the raw tokens appear as text content.

Solution

Added _parse_deepseek_tool_calls_from_text — detects the proprietary token format, extracts function name + arguments, and emits standard ChatCompletionMessageToolCall objects
Added _extract_json_from_deepseek_args helper — handles optional Markdown code fences (```json ```) around the arguments payload
Integrated into the existing _parse_tool_calls_from_text as the first-pass parser, with fallback to generic inline JSON parsing
Supports: single tool calls, multi-tool calls, code-fenced JSON, bare JSON, surrounding text, mixed formats

Testing Plan

Unit Tests: Added 8 new tests covering:

Single tool call with code-fenced JSON args
Multiple tool calls in a single wrapped block
Bare JSON args (no code fences)
Tool call embedded in surrounding text
Text without DeepSeek tokens (no false positives)
Empty/whitespace-only text
Integration test via _parse_tool_calls_from_text
Mixed formats (DeepSeek tokens + standard inline JSON)

Regression: Full test_litellm.py: 264 passed, 0 failed

Files Changed

File	Changes
`src/google/adk/models/lite_llm.py`	+147 lines (2 new functions + integration)
`tests/unittests/models/test_litellm.py`	+124 lines (8 new test functions)

google-cla · 2026-05-10T15:01:13Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

DeepSeek-V3 emits tool calls using proprietary special tokens (<｜tool▁calls▁begin｜>…<｜tool▁call▁begin｜>function<｜tool▁sep｜>NAME) embedded in the content field. When LiteLLM does not translate these into structured tool_calls (intermittent), the existing fallback JSON parser rejects the payload because the function name is stored inside the tokens rather than as a 'name' key in the JSON object. Add _parse_deepseek_tool_calls_from_text that detects the proprietary token format, extracts the function name and arguments, and emits standard ChatCompletionMessageToolCall objects. Integrate it into the existing _parse_tool_calls_from_text pipeline. Also add _extract_json_from_deepseek_args helper to handle optional Markdown code fences (json … ) that DeepSeek wraps around the arguments payload. Closes google#5024

rohityan · 2026-05-12T03:49:06Z

Hi @fuchun1010 , Thank you for your contribution! We appreciate you taking the time to submit this pull request. Your PR has been received by the team and is currently under review. We will provide feedback as soon as we have an update to share.

rohityan · 2026-05-12T03:49:45Z

Hi @xuanyang15 , can you please review this.

xuanyang15 · 2026-05-12T18:07:21Z

@GWeale Could you please help review?

fuchun1010 · 2026-05-13T16:57:26Z

Hi @GWeale — gentle ping for review on this PR when you have a moment.
This fixes #5024 (DeepSeek-V3 tool calls silently dropped when LiteLLM doesn't translate proprietary inline tokens). 271 additions across 2 files: parser + 8 unit tests. CI is green.
Happy to address any feedback. Thanks!

fuchun1010

Review Summary

Thanks for this PR! The DeepSeek inline tool-call format is a real pain point when LiteLLM's translation is inconsistent, and this parser is a clean solution.

What Works Well ✅

Well-documented: Clear references to DeepSeek API docs and inline comments explaining the token format
Comprehensive test coverage: 8 test cases covering single/multi calls, plain JSON args (no code fences), surrounding text, mixed formats (DeepSeek + standard inline JSON), empty/whitespace-only input, and integration with the generic parser
Clean remainder handling: Surrounding text is correctly preserved and returned, matching the existing _parse_tool_calls_from_text contract
Recursive mixed-format support: When both DeepSeek tokens and standard inline JSON appear in the same text, the fallback recursion in _parse_tool_calls_from_text handles both correctly — nice touch
Quick guard optimization: The _DS_TCALLS_BEGIN not in text_block and _DS_TCALL_BEGIN not in text_block check avoids regex overhead on normal responses

Suggestions / Questions

_extract_json_from_deepseek_args round-trip: The function does json.loads(raw_decode(...)) → json.dumps(candidate, ensure_ascii=False). While functionally correct (JSON objects are unordered by spec), this round-trip could theoretically reorder keys. Is there a reason not to return the raw substring from raw_decode? Something like:
```
candidate, end = _JSON_DECODER.raw_decode(args_text, open_brace)
return args_text[open_brace:end]
```
This preserves the original formatting and avoids the serialize/deserialize cycle.
Edge case — truncated tokens: What happens when the model output is cut off mid-token (e.g., partial ＜｜tool▁call▁begin｜ due to max_tokens)? The current code appends the unparsed text to remainder_parts via the end_idx == -1 branches, which seems correct — the partial token becomes remainder text. Worth adding a test for this scenario?
Thread safety of _JSON_DECODER: The module-level _JSON_DECODER is used in _extract_json_from_deepseek_args. json.JSONDecoder instances are generally thread-safe for read-only operations (raw_decode doesn't mutate state AFAIK), but worth double-checking since lite_llm.py may be used in async/threaded contexts.
Minor: test helper deduplication: _DS_BEGIN_CALLS etc. are redefined as module-level constants in the test file with the same values as in lite_llm.py. Consider importing them from the source module to avoid drift — though I understand this may be intentional to keep tests independent of implementation details.

Verdict

LGTM overall. The suggestions above are non-blocking — the core logic is solid and the test coverage is thorough. Happy to approve once the questions above are addressed (or dismissed).

fuchun1010 force-pushed the fix/deepseek-tool-call-parsing branch from c319bae to e91b1f6 Compare May 10, 2026 15:15

fuchun1010 force-pushed the fix/deepseek-tool-call-parsing branch from e91b1f6 to 08e864e Compare May 10, 2026 15:34

rohityan self-assigned this May 12, 2026

Merge branch 'main' into fix/deepseek-tool-call-parsing

91b4afd

rohityan added models [Component] Issues related to model support needs review [Status] The PR/issue is awaiting review from the maintainer labels May 12, 2026

rohityan requested a review from xuanyang15 May 12, 2026 03:48

xuanyang15 requested a review from GWeale May 12, 2026 18:06

xuanyang15 assigned GWeale May 12, 2026

Merge branch 'main' into fix/deepseek-tool-call-parsing

231c76c

fuchun1010 added 2 commits May 14, 2026 08:09

Merge branch 'main' into fix/deepseek-tool-call-parsing

bff38fc

Merge branch 'main' into fix/deepseek-tool-call-parsing

ce2da28

fuchun1010 commented May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(litellm): parse DeepSeek-V3 proprietary inline tool-call tokens#5654

fix(litellm): parse DeepSeek-V3 proprietary inline tool-call tokens#5654
fuchun1010 wants to merge 5 commits into
google:mainfrom
fuchun1010:fix/deepseek-tool-call-parsing

fuchun1010 commented May 10, 2026

Uh oh!

google-cla Bot commented May 10, 2026

Uh oh!

rohityan commented May 12, 2026

Uh oh!

rohityan commented May 12, 2026

Uh oh!

xuanyang15 commented May 12, 2026

Uh oh!

fuchun1010 commented May 13, 2026

Uh oh!

fuchun1010 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

fuchun1010 commented May 10, 2026

Problem

Solution

Testing Plan

Files Changed

Uh oh!

google-cla Bot commented May 10, 2026

Uh oh!

rohityan commented May 12, 2026

Uh oh!

rohityan commented May 12, 2026

Uh oh!

xuanyang15 commented May 12, 2026

Uh oh!

fuchun1010 commented May 13, 2026

Uh oh!

fuchun1010 left a comment

Choose a reason for hiding this comment

Review Summary

What Works Well ✅

Suggestions / Questions

Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants