Grok 4 MCP Server

A Model Context Protocol (MCP) server that provides AI assistants with access to Grok 4's capabilities including chat completions, live search, and model management.

Features

Chat Completions: Interact with Grok 4 for conversational AI tasks
Live Search: Real-time web search with structured results
Multi-Model Support: Supports grok-4-1-fast-reasoning, grok-4-1-fast-non-reasoning, and grok-code-fast-1
Rate Limiting: Built-in request throttling and circuit breaker patterns
Caching: Intelligent response caching for improved performance
Metrics: Prometheus metrics for monitoring and observability
Security: Input validation, sanitization, and secure configuration

Supported Models

Model	Context Window	TPM	RPM	Input Price	Output Price	Use Case
grok-4-1-fast-reasoning	2,000,000	4M	480	$0.20/M tokens	$0.50/M tokens	General reasoning tasks
grok-4-1-fast-non-reasoning	2,000,000	4M	480	$0.20/M tokens	$0.50/M tokens	Fast, direct responses
grok-code-fast-1	N/A	N/A	N/A	N/A	N/A	Code generation and analysis

Installation

Prerequisites

Node.js >= 18.0.0
npm or yarn
xAI API key

Setup

Clone the repository:

git clone <repository-url>
cd grok-4-mcp-server

Install dependencies:

npm install

Configure environment variables:

cp .env.example .envrc

Edit .envrc with your configuration:

export XAI_API_KEY="your-xai-api-key-here"
export GROK_MODEL="grok-4-1-fast-reasoning"  # or grok-4-1-fast-non-reasoning, grok-code-fast-1
export GROK_BASE_URL="https://api.x.ai/v1"  # Optional, defaults provided
export GROK_TEMPERATURE="0.7"              # Optional, 0.0-1.0
export GROK_MAX_TOKENS="4000"              # Optional
export MCP_SERVER_NAME="grok-4-mcp-server" # Optional
export MCP_SERVER_VERSION="1.0.0"          # Optional

Load environment variables:

direnv allow  # If using direnv
# or
source .envrc

Build the project:

npm run build

Usage

Direct Execution

npm start

Development Mode

npm run dev

MCP Integration

The server implements the Model Context Protocol and can be integrated with any MCP-compatible client. It exposes the following tools:

grok_ask: Ask Grok a question with optional context and search
grok_chat: Multi-turn conversations with Grok
grok_search: Live web search functionality
grok_models: List available Grok models
grok_test_connection: Test API connectivity
grok_health: Server health check and metrics

Example MCP Client Configuration

{
  "mcpServers": {
    "grok": {
      "command": "node",
      "args": ["dist/index.js"],
      "env": {
        "XAI_API_KEY": "your-key-here"
      }
    }
  }
}

Configuration

Environment Variables

Variable	Default	Description
`XAI_API_KEY`	Required	Your xAI API key
`GROK_MODEL`	`grok-4-1-fast-reasoning`	Default model to use
`GROK_BASE_URL`	`https://api.x.ai/v1`	API endpoint URL
`GROK_TEMPERATURE`	`0.7`	Response creativity (0.0-1.0)
`GROK_MAX_TOKENS`	`4000`	Maximum response tokens
`LOG_LEVEL`	`info`	Logging verbosity
`NODE_ENV`	`production`	Environment mode

Advanced Configuration

The server includes built-in resilience features:

Rate Limiting: 2 concurrent requests, 500ms minimum interval
Circuit Breaker: Automatic failure handling with fallback
Caching: 5-minute TTL LRU cache for responses
Connection Pooling: HTTP agent with keep-alive connections

Development

Testing

npm test
npm run test:watch

Linting

npm run lint
npm run lint:fix

Type Checking

npm run type-check

Building

npm run build
npm run clean

Security Considerations

API Key Protection: Never commit API keys to version control
Input Validation: All inputs are validated and sanitized
Rate Limiting: Prevents abuse and ensures fair usage
Error Handling: Sensitive information is not exposed in error messages
Logging: Configurable log levels prevent sensitive data leakage

Monitoring

The server exposes Prometheus metrics at /metrics (when health endpoint is called):

Request latency histograms
Request counters by tool
Error counters
Cache hit/miss ratios

Troubleshooting

Common Issues

"API key not found": Ensure XAI_API_KEY is set in your environment
"Connection timeout": Check network connectivity and API endpoint URL
"Rate limit exceeded": Implement client-side rate limiting or increase intervals
"Model not available": Verify the model name is correct and supported

Debug Mode

Enable verbose logging:

export LOG_LEVEL=debug
export NODE_ENV=development

Health Check

Test server connectivity:

curl -X POST localhost:3000/health

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests for new functionality
Ensure all tests pass
Submit a pull request

Development Guidelines

Follow TypeScript best practices
Add comprehensive tests
Update documentation for API changes
Use conventional commit messages
Ensure compatibility with Node.js >= 18

License

MIT License - see LICENSE file for details.

Support

Issues: GitHub Issues
Documentation: Full Docs
Community: [Discord/Slack]

Changelog

v1.0.0

Initial release with Grok 4 support
MCP protocol implementation
Multi-model support
Comprehensive error handling and monitoring

Built with ❤️ for the AI community

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
diagnose-and-fix.sh		diagnose-and-fix.sh
package-lock.json		package-lock.json
package.json		package.json
run-with-env.sh		run-with-env.sh
setup-grok.sh		setup-grok.sh
status-check.sh		status-check.sh
test-basic.js		test-basic.js
test-grok-models.js		test-grok-models.js
tsconfig.json		tsconfig.json
update-mcp-config-direnv.sh		update-mcp-config-direnv.sh

RazonIn4K/grok-mcp-server

Folders and files

Latest commit

History

Repository files navigation