feat: improve message content chunks handling #1845

drbh · 2024-05-02T03:55:52Z

This PR improves the parsing of content in messages by deserializing all content into ContentChunks (a vector of ContentChunk). Each ContentChunk is an enum representing either raw text or an image URL. For minimal change impact, ContentChunks can be serialized directly into a flattened string.

drbh · 2024-05-14T19:45:44Z

**note this PR improves how images are handle when sent as separate content in the chat endpoint. If ![]() markdown images are included they are still handled the same way, and a future RP may deprecate markdown images in chat and improve how images are sent to the model

example request

from huggingface_hub import InferenceClient

client = InferenceClient("http://127.0.0.1:3000")

chat = client.chat_completion(
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Whats in this image?"},
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/rabbit.png"
                    },
                },
            ],
        },
    ],
    seed=42,
    max_tokens=100,
)

router/src/infer.rs

router/src/lib.rs

Narsil · 2024-05-15T14:22:09Z

Can you add some tests too for the deserialization ?

drbh · 2024-05-15T17:33:07Z

updates:

this pr now also parses markdown images into typed ContentChunks

for example both the string and structured JSON inputs are deserialized into ContentChunks and both currently serialize back into a string

text-generation-inference/router/src/lib.rs

Lines 1277 to 1288 in f8be8d5

    
           let content = json!("Whats in this image?![](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/rabbit.png)"); 
        
           let chunks = message_content_serde::deserialize(content) 
        
               .expect("Failed to deserialize") 
        
               .unwrap(); 
        
           assert_eq!( 
        
               chunks, 
        
               ContentChunks(vec![ 
        
                   ContentChunk::Text("Whats in this image?".to_string()), 
        
                   ContentChunk::ImageUrl("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/rabbit.png".to_string()) 
        
               ]) 
        
           );

and

text-generation-inference/router/src/lib.rs

Lines 1300 to 1319 in f8be8d5

    
           let content = json!([ 
        
               {"type": "text", "text": "Whats in this image?"}, 
        
               { 
        
                   "type": "image_url", 
        
                   "image_url": { 
        
                       "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/rabbit.png" 
        
                   }, 
        
               }, 
        
           ]); 
        
           let chunks: ContentChunks = message_content_serde::deserialize(content) 
        
               .expect("Failed to deserialize") 
        
               .unwrap(); 
        
           assert_eq!( 
        
               chunks, 
        
               ContentChunks(vec![ 
        
                   ContentChunk::Text("Whats in this image?".to_string()), 
        
                   ContentChunk::ImageUrl("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/rabbit.png".to_string()) 
        
               ]) 
        
           );

Narsil · 2024-05-16T08:21:25Z

router/src/lib.rs

+
+fn parse_markdown_to_chunks(s: &str) -> Result<Vec<ContentChunk>, serde_json::Error> {
+    let mut chunks = Vec::new();
+    let re = Regex::new(r"!\[([^\]]*)\]\(([^)]+)\)").unwrap();


This is exactly what we do not want.

drbh · 2024-05-16T14:54:55Z

closing in favor of #1906

feat: improve message content chunks handling

c98a6b9

drbh marked this pull request as ready for review May 14, 2024 19:38

Narsil reviewed May 15, 2024

View reviewed changes

router/src/infer.rs Outdated Show resolved Hide resolved

router/src/lib.rs Outdated Show resolved Hide resolved

router/src/lib.rs Outdated Show resolved Hide resolved

feat: improve serde add tests and cleanup

f8be8d5

fix: bump huggingface_hub version

887ffe7

Narsil reviewed May 16, 2024

View reviewed changes

drbh closed this May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: improve message content chunks handling #1845

feat: improve message content chunks handling #1845

drbh commented May 2, 2024

drbh commented May 14, 2024

Narsil commented May 15, 2024

drbh commented May 15, 2024

Narsil May 16, 2024

drbh commented May 16, 2024

feat: improve message content chunks handling #1845

feat: improve message content chunks handling #1845

Conversation

drbh commented May 2, 2024

drbh commented May 14, 2024

Narsil commented May 15, 2024

drbh commented May 15, 2024

Narsil May 16, 2024

Choose a reason for hiding this comment

drbh commented May 16, 2024