DeepstreamMCP

An information access platform that automatically downloads, converts, vectorizes, and provides search/chat APIs for NVIDIA DeepStream documentation.

🚀 Project Purpose

DeepstreamMCP automatically downloads the NVIDIA DeepStream SDK documentation, converts it to text, indexes it into a vector database, and provides natural language search/chat capabilities. The goal is to enable fast and intelligent querying of technical documentation.

🔍 Features

Automatically downloads and updates DeepStream documentation
Converts HTML documents to readable plain text
Indexes all texts into a vector database (ChromaDB)
Natural language search and sample document retrieval
Smart chatbot interface integrated with Gemini LLM
Tool-based API support via the MCP protocol

🏗️ Architecture & Workflow

download_docs.py: Downloads DeepStream documentation from the web (HTML).
html2txt.py: Converts downloaded HTML files to readable plain text.
vectorize_docs.py: Vectorizes all text files and adds them to ChromaDB.
mcp_server.py: Provides search and sample document APIs over the vector database (via MCP protocol).
client.py: Interactive client to connect to the MCP server and test tools.
gemini_chatbot.py: Smart chatbot interface integrated with Gemini LLM and document search.

docs (downloaded HTML) → docs_txt (plain text) → chroma_db (vector DB)

⚡ Installation

Install the required dependencies:

uv sync
uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
uv pip install sentence_transformers

Download the documentations:
```
uv run python download_docs.py
```
Convert HTML to text:
```
uv run python html2txt.py
```
Build the vector database:
```
uv run python vectorize_docs.py
```

🧑‍💻 Usage

Search with MCP Server

uv run python mcp_server.py

🛠️ GitHub Copilot Chat: MCP Server Integration

To use this project as an MCP server in GitHub Copilot Chat:

Open Copilot Chat and go to Configure Tools.
Scroll down and click Add More Tools → Add MCP Server.
For Command (stdio), enter:
```
uv run --directory C:/Users/mehmu/OneDrive/Masaüstü/DeepstreamMCP mcp_server.py
```
⚠️ Note: Adjust the path after --directory to match your own workspace location.
Set the Server ID to:
```
deepstream_docs_http
```
For Workspace, select Global.

When prompted, your mcp.json should look like this (with your correct path):

{
  "servers": {
    "deepstream_docs_http": {
      "command": "uv",
      "args": [
        "run",
        "--directory",
        "C:/Users/mehmu/OneDrive/Masaüstü/DeepstreamMCP",
        "mcp_server.py"
      ],
      "type": "stdio"
    }
    // ...other servers...
  },
  "inputs": []
}

or with the interactive client:

uv run python client.py mcp_server.py

Smart Q&A with Gemini Chatbot

uv run python gemini_chatbot.py mcp_server.py

📦 Dependencies

Python 3.12+
torch, torchvision, torchaudio
sentence_transformers
chromadb
beautifulsoup4, readability-lxml
requests
mcp, mcp-cli
google-generativeai, python-dotenv

See requirements.txt and pyproject.toml for the full list of dependencies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepstreamMCP

🚀 Project Purpose

🔍 Features

🏗️ Architecture & Workflow

⚡ Installation

🧑‍💻 Usage

Search with MCP Server

🛠️ GitHub Copilot Chat: MCP Server Integration

Smart Q&A with Gemini Chatbot

📦 Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
client.py		client.py
download_docs.py		download_docs.py
gemini_chatbot.py		gemini_chatbot.py
html2txt.py		html2txt.py
mcp_server.py		mcp_server.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock
vectorize_docs.py		vectorize_docs.py

Folders and files

Latest commit

History

Repository files navigation

DeepstreamMCP

🚀 Project Purpose

🔍 Features

🏗️ Architecture & Workflow

⚡ Installation

🧑‍💻 Usage

Search with MCP Server

🛠️ GitHub Copilot Chat: MCP Server Integration

Smart Q&A with Gemini Chatbot

📦 Dependencies

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages