Refact.ai is an open-source, local-first AI coding assistant and autonomous software-engineering agent built by Small Magellanic Cloud Ai Ltd. ("SmallCloud"). The product combines an IDE-integrated chat experience (Ask / Explore / Debug / Review / Plan / Agent modes), accurate code completion powered by Qwen2.5-Coder with RAG over the workspace, and the Refact Agent — an autonomous mode that plans, executes, and iterates on engineering tasks end-to-end, integrating with Git hosts, databases, shells, browsers, and MCP servers. The full agent stack — a Rust HTTP/LSP engine (refact-lsp), a React/Vite chat GUI, and VS Code + JetBrains plugins — is open source under BSD-3-Clause at github.com/smallcloudai/refact and is currently ranked #1 open-source agent on SWE-bench Lite (60.0%) and 93.3% on Aider's Polyglot benchmark with thinking mode.
URL: Visit APIs.json
Run: Capabilities Using Naftiko
- AI, Artificial Intelligence, Coding Assistant, AI Agent, Autonomous Agents, Code Completion, Code Generation, Developer Tools, IDE, VS Code, JetBrains, Self-Hosting, On-Premise, Open Source, LSP, MCP, Model Context Protocol, Fine-Tuning, SWE-Bench, RAG
- Created: 2026-05-25
- Modified: 2026-05-25
Local HTTP/LSP API exposed by the Rust refact-lsp engine that runs inside the user's IDE or as a standalone server. Implements the agent runtime: provider/model capabilities, chat command queueing, SSE event streams, code completion, tool registry and confirmation rules, AST indexing, RAG/vector search over the workspace, integrations configuration, knowledge graph operations, task board management, and workspace checkpoint/rollback. The VS Code and JetBrains plugins and the React GUI all call this contract over http://127.0.0.1:<port>/v1.
Documented route groups under /v1 include:
/ping,/caps— health and capabilities discovery/chats/{id}/commands,/chats/subscribe— chat command queueing and SSE event streams/code-completion— completion requests/tools,/tools-check-if-confirmation-needed— tool registry and confirmation/ast-status,/ast-file-symbols— AST indexing/rag-status,/vecdb-search— semantic search over the workspace/integrations— provider, MCP, GitHub/GitLab, DB, Docker integration config/knowledge/*,/knowledge-graph— long-term memory and knowledge ops/tasks/*— agent task board/checkpoints-preview,/checkpoints-restore— workspace rollback
Human URL: https://docs.refact.ai
Refact Agent acts as an MCP (Model Context Protocol) client, attaching local or remote MCP servers (npx, Python -m, docker run, or remote SSE) into the agent's tool surface with per-tool confirmation rules. This makes any MCP server an extension of Refact's capability set inside the IDE.
Human URL: https://docs.refact.ai/features/autonomous-agent/integrations/mcp/
| Plan | Price | Notes |
|---|---|---|
| Free | $0 | Limited daily agent usage, 32k context chat, unlimited completions |
| Pro | $10 / month | 40 agent requests/day, 64k context, premium models, Think Mode |
| Enterprise | Contact | On-premise, fine-tuning on customer code, SSO, dedicated support |
| BYOK | Your provider's rates | Bring your own Anthropic / OpenAI / Gemini / xAI / DeepSeek / Groq / OpenRouter / Copilot key |
| Self-Hosted | Free (BSD-3) | Run the open-source stack on your own infra; pay only for the models you choose |
Note: Refact Cloud is being wound down — BYOK and self-hosting are now the recommended paths.
Cloud agent / chat: GPT-4.1 (default), Claude 3.7 Sonnet, Claude 3.5 Sonnet, GPT-4o, GPT-4o-mini, o3-mini.
Code completion: Qwen2.5-Coder-1.5B.
BYOK providers: Anthropic, OpenAI, Google Gemini, xAI Grok, DeepSeek, Groq, OpenRouter, GitHub Copilot, plus any OpenAI-compatible endpoint.
Local/self-hosted providers: Ollama, LM Studio, vLLM, custom OpenAI-compatible endpoints.
Self-hosted fine-tunable models: Refact, StarCoder, DeepSeek-Coder, CodeLlama variants (20+ options across completion and chat).
| Repo | Language | Description |
|---|---|---|
| smallcloudai/refact | Rust / TypeScript / Kotlin | Monorepo — engine, GUI, VS Code & JetBrains plugins, docs (BSD-3-Clause) |
| smallcloudai/refact-bench | Dockerfile | SWE-Bench benchmarking harness for coding agents |
| smallcloudai/rust-sdk | Rust | MCP Rust SDK (fork of modelcontextprotocol/rust-sdk) |
| smallcloudai/litellm | Python | LiteLLM proxy fork |
| smallcloudai/refact-vscode | TypeScript | VS Code plugin (archived — now in monorepo) |
| smallcloudai/refact-intellij | Kotlin | JetBrains plugin (archived — now in monorepo) |
- Refact for VS Code (Marketplace)
- Refact for JetBrains (Plugin Marketplace)
- Visual Studio, Neovim, and Sublime Text plugins documented at docs.refact.ai
- AWS Marketplace listing for enterprise EC2 deployment
- Website
- Documentation
- Quickstart
- Enterprise
- Pricing
- Blog
- Contact / Demo
- Sign Up / Sign In
- GitHub Organization
- Discord
- Twitter / X
- YouTube
- Kin Lane — kin@apievangelist.com — @apievangelist — apievangelist.com