feat: add streaming with tools support and resilience #94

nazq · 2025-12-09T22:53:26Z

Summary

Add StreamChunk enum and chat_stream_with_tools() method for streaming responses while handling tool calls in real-time
Implement for Anthropic (custom SSE parser) and OpenAI-compatible providers (OpenAI, Mistral, Groq, Cohere, OpenRouter, HuggingFace)
Add resilience (retry with exponential backoff) to chat_stream_struct() and chat_stream_with_tools() in ResilientLLM

Test plan

24 unit tests for SSE parsing (Anthropic + OpenAI-compatible)
Integration tests for streaming with tools (Anthropic)
Integration tests for resilient streaming (test_anthropic_resilient_chat_stream_with_tools, test_anthropic_resilient_chat_stream_struct)
cargo clippy --lib --features full passes
cargo fmt applied

API

// New enum for streaming with tools
pub enum StreamChunk {
    Text(String),
    ToolUseStart { index: usize, id: String, name: String },
    ToolUseInputDelta { index: usize, partial_json: String },
    ToolUseComplete { index: usize, tool_call: ToolCall },
    Done { stop_reason: String },
}

// New trait method
async fn chat_stream_with_tools(
    &self,
    messages: &[ChatMessage],
    tools: Option<&[Tool]>,
) -> Result<Pin<Box<dyn Stream<Item = Result<StreamChunk, LLMError>> + Send>>, LLMError>;

Usage

let llm = LLMBuilder::new()
    .backend(LLMBackend::Anthropic)
    .api_key(api_key)
    .model("claude-3-5-haiku-20241022")
    .resilient(true)
    .function(/* ... */)
    .build()?;

let mut stream = llm.chat_stream_with_tools(&messages, llm.tools()).await?;
while let Some(chunk) = stream.next().await {
    match chunk? {
        StreamChunk::Text(t) => print!("{}", t),
        StreamChunk::ToolUseComplete { tool_call, .. } => {
            // Handle tool call
        }
        StreamChunk::Done { stop_reason } => break,
        _ => {}
    }
}

graniet · 2025-12-14T14:48:45Z

Thank you for your work ! @nazq

Copilot

Pull request overview

This PR adds streaming support for LLM responses with tool calls, enabling real-time processing of both text and tool invocations. It implements the new chat_stream_with_tools() method across Anthropic and OpenAI-compatible providers (OpenAI, Mistral, Groq, Cohere, OpenRouter, HuggingFace), and extends resilience capabilities to streaming operations.

Key changes:

Introduces StreamChunk enum to represent different streaming events (text, tool use start/delta/complete, done)
Implements custom SSE parsers for Anthropic and OpenAI-compatible streaming formats that handle tool calls
Adds retry logic with exponential backoff to chat_stream_struct() and chat_stream_with_tools() in ResilientLLM

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/chat/mod.rs	Defines `StreamChunk` enum and adds `chat_stream_with_tools()` trait method with comprehensive documentation
src/backends/anthropic.rs	Implements Anthropic-specific streaming with tools, including SSE parser and 18 unit tests covering various streaming scenarios
src/backends/openai.rs	Delegates streaming with tools to the generic OpenAICompatibleProvider implementation
src/providers/openai_compatible.rs	Implements generic OpenAI-compatible streaming with tools, SSE parser, and 12 unit tests covering OpenAI and vLLM formats
src/resilient_llm.rs	Adds retry logic with exponential backoff to both `chat_stream_struct()` and `chat_stream_with_tools()` methods
tests/test_backends.rs	Adds 4 integration tests covering streaming with tools, text-only responses, and resilient streaming for both tools and struct modes

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-14T14:51:16Z

src/backends/anthropic.rs

+        let anthropic_messages: Vec<AnthropicMessage> = messages
+            .iter()
+            .map(|m| AnthropicMessage {
+                role: match m.role {
+                    ChatRole::User => "user",
+                    ChatRole::Assistant => "assistant",
+                },
+                content: match &m.message_type {
+                    MessageType::Text => vec![MessageContent {
+                        message_type: Some("text"),
+                        text: Some(&m.content),
+                        image_url: None,
+                        source: None,
+                        tool_use_id: None,
+                        tool_input: None,
+                        tool_name: None,
+                        tool_result_id: None,
+                        tool_output: None,
+                    }],
+                    MessageType::Pdf(raw_bytes) => {
+                        vec![MessageContent {
+                            message_type: Some("document"),
+                            text: None,
+                            image_url: None,
+                            source: Some(ImageSource {
+                                source_type: "base64",
+                                media_type: "application/pdf",
+                                data: BASE64.encode(raw_bytes),
+                            }),
+                            tool_use_id: None,
+                            tool_input: None,
+                            tool_name: None,
+                            tool_result_id: None,
+                            tool_output: None,
+                        }]
+                    }
+                    MessageType::Image((image_mime, raw_bytes)) => {
+                        vec![MessageContent {
+                            message_type: Some("image"),
+                            text: None,
+                            image_url: None,
+                            source: Some(ImageSource {
+                                source_type: "base64",
+                                media_type: image_mime.mime_type(),
+                                data: BASE64.encode(raw_bytes),
+                            }),
+                            tool_use_id: None,
+                            tool_input: None,
+                            tool_name: None,
+                            tool_result_id: None,
+                            tool_output: None,
+                        }]
+                    }
+                    MessageType::ImageURL(ref url) => vec![MessageContent {
+                        message_type: Some("image_url"),
+                        text: None,
+                        image_url: Some(ImageUrlContent { url }),
+                        source: None,
+                        tool_use_id: None,
+                        tool_input: None,
+                        tool_name: None,
+                        tool_result_id: None,
+                        tool_output: None,
+                    }],
+                    MessageType::ToolUse(calls) => calls
+                        .iter()
+                        .map(|c| MessageContent {
+                            message_type: Some("tool_use"),
+                            text: None,
+                            image_url: None,
+                            source: None,
+                            tool_use_id: Some(c.id.clone()),
+                            tool_input: Some(
+                                serde_json::from_str(&c.function.arguments)
+                                    .unwrap_or(c.function.arguments.clone().into()),
+                            ),
+                            tool_name: Some(c.function.name.clone()),
+                            tool_result_id: None,
+                            tool_output: None,
+                        })
+                        .collect(),
+                    MessageType::ToolResult(responses) => responses
+                        .iter()
+                        .map(|r| MessageContent {
+                            message_type: Some("tool_result"),
+                            text: None,
+                            image_url: None,
+                            source: None,
+                            tool_use_id: None,
+                            tool_input: None,
+                            tool_name: None,
+                            tool_result_id: Some(r.id.clone()),
+                            tool_output: Some(r.function.arguments.clone()),
+                        })
+                        .collect(),
+                },
+            })
+            .collect();


The message preparation logic (converting ChatMessage to AnthropicMessage) is duplicated between chat_with_tools() (lines 380-477) and chat_stream_with_tools() (lines 709-806). This is approximately 100 lines of identical code that should be extracted into a private helper method to improve maintainability and reduce the risk of inconsistencies.

Copilot · 2025-12-14T14:51:17Z

src/backends/anthropic.rs

+        let maybe_tool_slice: Option<&[Tool]> = tools.or(self.tools.as_deref());
+        let anthropic_tools = maybe_tool_slice.map(|slice| {
+            slice
+                .iter()
+                .map(|tool| AnthropicTool {
+                    name: &tool.function.name,
+                    description: &tool.function.description,
+                    schema: &tool.function.parameters,
+                })
+                .collect::<Vec<_>>()
+        });
+
+        let tool_choice = match self.tool_choice {
+            Some(ToolChoice::Auto) => {
+                Some(HashMap::from([("type".to_string(), "auto".to_string())]))
+            }
+            Some(ToolChoice::Any) => Some(HashMap::from([("type".to_string(), "any".to_string())])),
+            Some(ToolChoice::Tool(ref tool_name)) => Some(HashMap::from([
+                ("type".to_string(), "tool".to_string()),
+                ("name".to_string(), tool_name.clone()),
+            ])),
+            Some(ToolChoice::None) => {
+                Some(HashMap::from([("type".to_string(), "none".to_string())]))
+            }
+            None => None,
+        };
+
+        let final_tool_choice = if anthropic_tools.is_some() {
+            tool_choice.clone()
+        } else {
+            None
+        };
+


The tool preparation and tool_choice mapping logic is duplicated between chat_with_tools() (starting around line 479) and chat_stream_with_tools() (lines 808-839). Extract this into a private helper method to avoid duplication and ensure consistency.

Suggested change

let maybe_tool_slice: Option<&[Tool]> = tools.or(self.tools.as_deref());

let anthropic_tools = maybe_tool_slice.map(|slice| {

slice

.iter()

.map(|tool| AnthropicTool {

name: &tool.function.name,

description: &tool.function.description,

schema: &tool.function.parameters,

})

.collect::<Vec<_>>()

});

let tool_choice = match self.tool_choice {

Some(ToolChoice::Auto) => {

Some(HashMap::from([("type".to_string(), "auto".to_string())]))

}

Some(ToolChoice::Any) => Some(HashMap::from([("type".to_string(), "any".to_string())])),

Some(ToolChoice::Tool(ref tool_name)) => Some(HashMap::from([

("type".to_string(), "tool".to_string()),

("name".to_string(), tool_name.clone()),

])),

Some(ToolChoice::None) => {

Some(HashMap::from([("type".to_string(), "none".to_string())]))

}

None => None,

};

let final_tool_choice = if anthropic_tools.is_some() {

tool_choice.clone()

} else {

None

};

let (anthropic_tools, final_tool_choice) = Self::prepare_anthropic_tools_and_choice(

tools,

self.tools.as_deref(),

&self.tool_choice,

);

Fair review, want an update to refactor ?

Add chat_stream_with_tools method for streaming responses while handling tool calls. Implemented for Anthropic and OpenAI-compatible providers with full SSE parsing support. Also adds resilience (retry with exponential backoff) to all streaming methods: chat_stream_struct and chat_stream_with_tools. Includes integration tests for resilient streaming and vLLM-compatible SSE parsing tests.

Address PR review feedback by extracting duplicated code into reusable helper methods: - convert_messages_to_anthropic(): Converts ChatMessage slice to Anthropic message format, handling all message types (text, images, PDFs, tool use, tool results) - prepare_tools_and_choice(): Prepares Anthropic tools and tool_choice configuration from provided tools and instance settings This removes ~100 lines of duplicated code between chat_with_tools() and chat_stream_with_tools(), improving maintainability and reducing the risk of inconsistencies.

nazq · 2025-12-16T22:14:23Z

Addressed the copilot review above

nazq force-pushed the main branch from 7dc7cfd to 3ca419b Compare December 10, 2025 14:15

nazq mentioned this pull request Dec 10, 2025

Resilience and chat_stream_struct are not compatible #93

Closed

graniet requested a review from Copilot December 14, 2025 14:48

Copilot started reviewing on behalf of graniet December 14, 2025 14:49 View session

Copilot AI reviewed Dec 14, 2025

View reviewed changes

nazq mentioned this pull request Dec 16, 2025

feat: add runtime extra HTTP headers support #95

Open

4 tasks

nazq added 2 commits December 16, 2025 16:38

nazq force-pushed the main branch from 3ca419b to 3aea9fc Compare December 16, 2025 21:40

graniet merged commit 0eca58b into graniet:main Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add streaming with tools support and resilience #94

feat: add streaming with tools support and resilience #94

Uh oh!

nazq commented Dec 9, 2025

Uh oh!

graniet commented Dec 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 14, 2025

Uh oh!

Copilot AI Dec 14, 2025

Uh oh!

nazq Dec 16, 2025

Uh oh!

nazq commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add streaming with tools support and resilience #94

feat: add streaming with tools support and resilience #94

Uh oh!

Conversation

nazq commented Dec 9, 2025

Summary

Test plan

API

Usage

Uh oh!

graniet commented Dec 14, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

nazq Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

nazq commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants