Kaveri Model Context Protocol (MCP) Integrations

This directory contains integration scripts to bridge our local HuggingFace multimodal models with the Model Context Protocol (MCP) ecosystem. It allows our models to both consume external tools (as an MCP Client) and provide their capabilities to external MCP-compatible agents (as an MCP Server).

Overview

The Model Context Protocol standardized how AI models interact with data sources and tools. This directory includes two separate standalone implementations:

mcp_client.py: A client that loads a local LLM/multimodal model and connects it to any standard MCP Server. This enables the model to ask the server for available tools and execute them dynamically in an interactive loop.
mcp_server.py: A host application that exposes our local Multimodal model as an MCP Server (using the FastMCP framework). This allows IDEs (like Claude Desktop) or other agents to call our model as a standard tool.

1. MCP Client (`mcp_client.py`)

This script loads a HuggingFace causal LM, connects to an external MCP server over stdio, retrieves the available tools, and prompts the model to logically utilize those tools via JSON representations.

Usage

python mcp_client.py \
    --model-path "path/to/local/hf/model" \
    --server "npx" \
    --server-args "-y" "@modelcontextprotocol/server-filesystem" "/Users/path/to/expose" \
    --prompt "List the files in the exposed directory."

Features

Connects securely over standard I/O streams using mcp.client.stdio.
Dynamically converts MCP Server JSON schemas into prompt descriptions.
Parses the model's output for structured JSON boundaries to automatically execute requested tools and return the result logic.

2. MCP Server (`mcp_server.py`)

This script wraps a HuggingFace multimodal model behind the MCP boundary using the FastMCP framework. It serves as an execution endpoint for multimodal inferences.

Exposed Tools

analyze_image(image_path, prompt): Opens a local image, parses it through the vision processor, and returns the model inference.
generate_text(prompt): Standard text-only inference.

Usage

python mcp_server.py --model-path "path/to/local/multimodal/model"

Once started, the server runs an event loop over stdio. You can connect to it using any compliant MCP client, instructing it to run the analyze_image or generate_text tool endpoints.

Dependencies

You need the mcp SDK installed to run these integrations:

pip install mcp transformers torch pillow

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
mcp_client.py		mcp_client.py
mcp_server.py		mcp_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaveri Model Context Protocol (MCP) Integrations

Overview

1. MCP Client (`mcp_client.py`)

Usage

Features

2. MCP Server (`mcp_server.py`)

Exposed Tools

Usage

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kaveri Model Context Protocol (MCP) Integrations

Overview

1. MCP Client (mcp_client.py)

Usage

Features

2. MCP Server (mcp_server.py)

Exposed Tools

Usage

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. MCP Client (`mcp_client.py`)

2. MCP Server (`mcp_server.py`)

Packages