Replit2Api

Self-hosted AI proxy gateway · 自托管 AI 代理网关

One Replit project. All major AI providers. OpenAI-compatible API.

一个 Replit 项目，统一接入主流 AI 服务商，提供 OpenAI 兼容接口。

目录 · Table of Contents

简介 · Overview
功能特性 · Features
快速开始 · Quick Start
API 端点 · Endpoints
认证方式 · Authentication
模型路由 · Model Routing
工具调用 · Tool Calling
扩展思考 · Extended Thinking
客户端接入 · Client Integration
Fleet Manager 多节点管理
自动更新 · Auto Update
更新日志 · Changelog

简介 · Overview

中文

Replit2Api 是一个运行在 Replit 上的 AI 代理网关。它将 OpenAI、Anthropic Claude、Google Gemini、OpenRouter 四大服务商统一暴露为单一的 OpenAI 兼容接口，让任何支持 OpenAI 格式的客户端（CherryStudio、SillyTavern、OpenWebUI 等）无需改动配置即可同时使用所有模型。

通过 Replit Remix（混搭） 功能一键部署，所有 API 密钥由 Replit Secrets 安全存储，支持热更新无需重新部署。

English

Replit2Api is an AI proxy gateway running on Replit. It unifies OpenAI, Anthropic Claude, Google Gemini, and OpenRouter behind a single OpenAI-compatible API endpoint, so any OpenAI-compatible client (CherryStudio, SillyTavern, OpenWebUI, etc.) can access all models without reconfiguration.

Deploy in one click via Replit Remix. All API keys are stored securely in Replit Secrets. Supports hot-update without redeployment.

功能特性 · Features

功能	说明	Feature	Description
🔀 多后端路由	按模型名自动路由到对应服务商	Multi-backend routing	Auto-routes by model name
📐 格式自动转换	请求/响应在 OpenAI ↔ Claude ↔ Gemini 间自动互转	Format conversion	Auto-converts between OpenAI / Claude / Gemini formats
🔧 工具调用	完整支持 OpenAI tools + tool_calls，自动转换到各后端	Tool calling	Full OpenAI tools/tool_calls, auto-converted per backend
🧠 扩展思考	`-thinking` / `-thinking-visible` 后缀别名，一键开启	Extended thinking	`-thinking` / `-thinking-visible` suffix aliases
⚡ 流式输出	所有端点均支持 SSE 流式，含 Claude 原生端点	Streaming (SSE)	All endpoints support SSE streaming
🔑 多种认证	Bearer Token / x-goog-api-key / ?key= URL 参数	Multi-auth	Bearer / x-goog-api-key / ?key= URL param
🎭 SillyTavern 兼容	自动修复 Claude 角色顺序，追加空 user 消息	SillyTavern compat	Auto-fixes Claude role ordering
🖥️ 管理面板	三栏 Web UI：首页/统计节点/端点文档	Admin panel	3-tab Web UI: Home / Stats & Nodes / API Docs
🛥️ Fleet Manager	批量管理子节点版本检测与一键更新	Fleet Manager	Batch version check & one-click update for sub-nodes
📦 热更新	无需 GitHub，通过文件包跨实例推送更新	Hot update	File-bundle push, no GitHub required

快速开始 · Quick Start

中文

点击下方链接，进入 Replit 项目页面，点击右上角 Remix 按钮，将完整项目复制到你的账户：

👉 https://replit.com/@Akatsukis036s/Replit-Api-Public
Remix 完成后，点击 Run 启动项目，等待服务器启动。
打开门户页面，按照内置的配置助手完成初始化（无需手动填写任何第三方 API Key）。
点击 Publish 发布到生产环境，将生产域名（https://your-app.replit.app）填入客户端的 Base URL。

English

Click the link below to open the Replit project, then click Remix in the top-right corner to copy it to your account:

👉 https://replit.com/@Akatsukis036s/Replit-Api-Public
After Remix, click Run to start the project and wait for the server to come up.
Open the portal page and follow the built-in Setup Wizard to complete initialization — no third-party API keys required.
Click Publish to go live. Set the production domain (https://your-app.replit.app) as the Base URL in your client.

API 端点 · Endpoints

所有端点均需认证（见下节）。All endpoints require authentication (see below).

端点	方法	说明	Endpoint	Method	Description
`/v1/chat/completions`	POST	OpenAI 聊天补全（主端点）	`/v1/chat/completions`	POST	OpenAI Chat Completions (main)
`/v1/models`	GET	返回所有可用模型列表	`/v1/models`	GET	List all available models
`/v1/messages`	POST	Anthropic Claude Messages 原生格式	`/v1/messages`	POST	Anthropic Claude Messages native format
`/v1beta/models/{model}:generateContent`	POST	Gemini 原生格式	`/v1beta/models/{model}:generateContent`	POST	Gemini native format
`/api/healthz`	GET	健康检查	`/api/healthz`	GET	Health check
`/api/update/version`	GET	查询当前版本	`/api/update/version`	GET	Query current version
`/api/update/apply`	POST	触发热更新	`/api/update/apply`	POST	Trigger hot update

认证方式 · Authentication

支持三种方式，任选其一 / Three methods supported, pick any:

# 1. Bearer Token（推荐 / Recommended）
curl https://your-app.replit.app/v1/models \
  -H "Authorization: Bearer YOUR_PROXY_API_KEY"

# 2. x-api-key 请求头 / header
curl https://your-app.replit.app/v1/models \
  -H "x-api-key: YOUR_PROXY_API_KEY"

# 3. URL 查询参数 / Query param（Gemini 客户端常用）
curl "https://your-app.replit.app/v1/models?key=YOUR_PROXY_API_KEY"

模型路由 · Model Routing

路由规则按模型名称自动判断，无需手动切换。 Routing is automatic based on model name — no manual switching needed.

模型名前缀/特征	路由目标	Model name prefix/pattern	Routes to
`gpt-`, `o1`, `o3`, `o4-`, `text-*`	OpenAI	`gpt-`, `o1`, `o3`, `o4-`	OpenAI
`claude-*`	Anthropic	`claude-*`	Anthropic
`gemini-*`	Google Gemini	`gemini-*`	Google Gemini
`provider/model`（含 `/`）	OpenRouter	Any name containing `/`	OpenRouter

扩展思考别名 · Extended Thinking Aliases

在任意支持思考的模型名后追加后缀即可： Append a suffix to any thinking-capable model:

claude-opus-4-5-thinking           → 思考模式，隐藏过程 / thinking, hidden
claude-opus-4-5-thinking-visible   → 思考模式，展示过程 / thinking, visible
gemini-2.5-pro-thinking            → Gemini 思考模式 / Gemini thinking

工具调用 · Tool Calling

网关完整支持 OpenAI tools + tool_calls 格式，自动转换到各后端原生格式： The gateway fully supports OpenAI tools + tool_calls, auto-converted per backend:

后端 / Backend	转换规则 / Conversion
Anthropic	`tools` → `input_schema`；`tool_choice: "required"` → `{type:"any"}`；`tool_calls` → `tool_use` blocks
Gemini	`tools` → `functionDeclarations`；`tool_calls` → `functionCall` parts
OpenAI / OpenRouter	原样透传 / Pass-through

流式工具调用完整支持 input_json_delta 逐块转发，finish_reason 正确映射为 "tool_calls"。 Streaming tool calls fully support input_json_delta chunk-by-chunk forwarding with correct "tool_calls" finish_reason.

// 示例请求 / Example request
{
  "model": "claude-opus-4-5",
  "tools": [{
    "type": "function",
    "function": {
      "name": "get_weather",
      "description": "Get current weather",
      "parameters": {
        "type": "object",
        "properties": { "city": { "type": "string" } },
        "required": ["city"]
      }
    }
  }],
  "messages": [{ "role": "user", "content": "北京今天天气怎么样？" }]
}

扩展思考 · Extended Thinking

// 开启思考（隐藏过程）/ Enable thinking (hidden)
{ "model": "claude-opus-4-5-thinking", ... }

// 开启思考（展示过程）/ Enable thinking (visible)
{ "model": "claude-opus-4-5-thinking-visible", ... }

// 或直接在请求体中传参 / Or pass in body directly
{
  "model": "claude-opus-4-5",
  "thinking": { "type": "enabled", "budget_tokens": 8000 }
}

客户端接入 · Client Integration

CherryStudio

设置 → 模型服务商 → 添加服务商，类型选 OpenAI Compatible
Base URL 填入生产域名；API Key 填入 PROXY_API_KEY
点击「检测」，自动加载全部模型
Settings → Model Providers → Add Provider → type: OpenAI Compatible
Base URL = your production domain; API Key = PROXY_API_KEY
Click "Check" — all models load automatically

SillyTavern

在管理面板首页开启 SillyTavern 兼容模式
API 类型选 OpenAI，Base URL 填入生产域名
模型名直接填 claude-opus-4-5 等
Enable SillyTavern Compatibility Mode on the admin panel home tab
API type: OpenAI, Base URL = production domain
Model name: e.g. claude-opus-4-5

OpenWebUI / LobeChat / 任意 OpenAI 兼容客户端

Base URL: https://your-app.replit.app
API Key: YOUR_PROXY_API_KEY
无需其他改动 / No other changes needed

Fleet Manager 多节点管理

在管理面板 统计 & 节点 页可管理多个 Replit2Api 实例： Manage multiple Replit2Api instances from the Stats & Nodes tab:

批量版本检测 — 一键查询所有子节点的当前版本号
一键更新 — 向子节点推送最新文件包，无需重新部署
导入/导出 — JSON 格式备份和恢复节点列表
节点数据存储在浏览器 localStorage，上游地址不暴露在 UI
Batch version check — query all sub-node versions in one click
One-click update — push the latest bundle to sub-nodes without redeployment
Import/Export — backup and restore node list in JSON format
Node data stored in browser localStorage; upstream URL never exposed in UI

自动更新 · Auto Update

门户更新（当前实例）

点击门户右上角版本徽标 → 复制提示词 → 粘贴到 Replit Agent 对话框，Agent 将自动从 GitHub 拉取最新代码并重启服务器。

To update this instance: Click the version badge in the portal → Copy prompt → Paste into Replit Agent chat. The Agent will pull the latest code from GitHub and restart automatically.

子节点批量更新（Fleet Manager）

在管理面板 统计 & 节点 页的 Fleet Manager 中填入子节点地址，点击「全部更新」可向所有子节点推送最新文件包：

本实例 (This instance)          子节点 (Sub-node)
      │                               │
      ├── GET /api/update/version ────→ 版本号比对
      │                               │
      └── POST /api/update/apply ─────→ 下载文件包
                                       解压覆盖
                                       自动重启

To update sub-nodes: Add sub-node URLs in Fleet Manager, then click "Update All" to push the latest bundle to all nodes at once.

更新日志 · Changelog

v1.1.1 — 2026-04-07

流式超时 120s → 600s：彻底解决长回复（≥5000 token）在传输途中被截断的问题
SSE 保活间隔 3s → 15s：减少不必要带宽消耗，仍可防止代理 60 秒空闲超时
ENV 节点配置复制框：统计页「添加节点」区块现在直接展示 ENV 提示词内容，点击一键复制
日志页只展示最新一条：其余历史版本折叠进滚动栏，首页核心功能无需滚动即可看全
Streaming timeout 120 s → 600 s: fixes truncation of long responses (5 000+ tokens)
SSE keepalive interval 3 s → 15 s: less overhead while still preventing proxy idle timeouts
ENV node copyable block: prompt text is now directly visible in the Stats page with a one-click copy button
Changelog shows only latest release: all older entries folded into a scrollable history section

v1.1.0 — 2026-04-06

项目正式更名为 Replit2Api（UI、标题、提示词全部同步更新）
子节点自动重试：请求失败后自动换节点重试，最多 3 次，精确区分 5xx 与网络错误；客户端无感知
流式 SSE 头延迟提交：首个数据包到达后才锁定连接，保留重试窗口
Token 用量 fallback 估算：子节点不返回 usage 时按字符数自动估算，统计页不再显示 0
ENV 节点提示词：统计页「添加节点」新增「复制提示词」按钮，一键发给 Replit Agent 配置永久 ENV 子节点
Project renamed to Replit2Api — UI, title, and prompts all updated
Friend proxy auto-retry: up to 3 attempts, distinguishes 5xx from network errors; client sees clean response
Streaming SSE header delay: headers committed only after first chunk, preserving retry window
Token fallback estimation: estimates from char count when sub-node omits usage field
ENV node prompt: "Copy prompt" button in Add Node section to configure permanent ENV sub-nodes via Replit Agent

v1.0.9 — 2026-04-06

配置助手弹窗逻辑修复：查询服务器状态，仅在 PROXY_API_KEY 未配置时自动弹出；配置完成后不再重复弹出
更新方式改为 Agent 提示词：点击版本徽标→复制提示词→粘贴到 Replit Agent，由 Agent 自动拉取最新代码并重启
统计刷新按钮修复：去除 marginTop: -16px，不再被遮挡
统计错误提示细化：区分 500（PROXY_API_KEY 未配置）与 401（API Key 不匹配）
Setup wizard logic fix: queries server status; auto-pops only when PROXY_API_KEY is not yet configured
Update flow changed to Agent prompt: click version badge → copy prompt → paste in Replit Agent chat
Stats refresh button fix: removed negative margin that caused overlap
Stats error differentiation: 500 = key not configured; 401 = key mismatch

v1.0.8 — 2026-04-06

批量节点管理：全选/多选现在覆盖所有子节点（含 ENV 节点），批量启用/禁用/移除
更新日志可滚动：默认展示最新 2 条，历史记录可滚动查看
配置助手第 3 步：开通 App Storage（GCS），子节点配置 Publish 后不再丢失
Batch node management: select-all now covers all sub-nodes including ENV nodes; bulk enable/disable/remove
Changelog scrollable: 2 latest entries shown by default; scroll to view history
Setup wizard step 3: provision App Storage so sub-node configs survive redeploys

v1.0.7 — 2026-04-06

云端持久化修复：动态子节点和禁用模型数据改用 Replit GCS 存储，重新部署不再清空
「重新检测」按钮修复：错误状态下也可点击，点击后显示加载动画和完成提示
Cloud persistence fix: dynamic backends and disabled models now use Replit GCS; survive redeploys
Re-check button fix: always clickable even after error; shows spinner and completion feedback

v1.0.6 — 2026-04-06

模型禁用管理：支持从管理面板禁用/启用单个模型，禁用后该模型不出现在 /v1/models 列表中
统计页包含全部节点：/v1/stats 现在返回所有后端（含禁用节点），禁用节点以红色边框标注
Model disable management: enable/disable individual models from admin panel; disabled models excluded from /v1/models
Stats includes all backends: /v1/stats now returns all nodes including disabled ones; shown with red border

v1.0.5 — 2026-04-06

配置助手重写：单步模式，一条指令覆盖全部初始化步骤，明确禁止 Agent 索取第三方 API Key
版本比较修复：正确处理预发布后缀（a/b/rc1 等）
子节点 URL 自动规范化：服务端与前端均自动补全 /api 后缀
X-Proxy-Version header 修复：过滤非 ASCII 字符，根除 ERR_INVALID_CHAR 崩溃
后端批量管理：多选后端，批量启用/禁用/删除
SetupWizard rewrite: single-step unified prompt; forbids Agent from asking for third-party API keys
Version comparison fix: correctly handles pre-release suffixes
Sub-node URL normalization: auto-appends /api suffix server-side and frontend
X-Proxy-Version header fix: strips non-ASCII chars, eliminates ERR_INVALID_CHAR crash
Batch backend management: multi-select for bulk enable/disable/remove

v1.0.1 — 2026-04-06

完整 tool calling 支持 — Claude、Gemini 自动格式互转
Claude 流式工具调用：input_json_delta 逐块转发，finish_reason 正确映射为 tool_calls
前端三栏重构：首页 / 统计 & 节点 / 端点文档
新增 Fleet Manager — 子节点批量版本检测与一键更新
Full tool calling support — auto-conversion for Claude (tool_use) and Gemini (functionDeclarations)
Claude streaming tool calls with correct tool_calls finish_reason
Frontend redesigned into 3-tab layout: Home / Stats & Nodes / API Docs
New Fleet Manager — batch version check and one-click update for sub-nodes

v1.0.0 — 2026-04-06

正式版发布 — 统一接入 OpenAI / Anthropic / Gemini / OpenRouter
支持 SillyTavern 兼容模式、CherryStudio 接入、多种认证方式
Replit 文件包热更新机制（无需 GitHub，跨实例推送）
Initial release — unified gateway for all major AI providers
SillyTavern compatibility, CherryStudio integration, multiple auth methods
Replit file-bundle hot-update system (no GitHub required)

许可证 · License

MIT License — 自由使用、修改、分发 / Free to use, modify, and distribute.

_{Powered by Replit · OpenAI · Anthropic · Google Gemini · OpenRouter}

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
artifacts		artifacts
.npmrc		.npmrc
.replitignore		.replitignore
README.md		README.md
package.json		package.json
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json
version.json		version.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Replit2Api

目录 · Table of Contents

简介 · Overview

功能特性 · Features

快速开始 · Quick Start

API 端点 · Endpoints

认证方式 · Authentication

模型路由 · Model Routing

扩展思考别名 · Extended Thinking Aliases

工具调用 · Tool Calling

扩展思考 · Extended Thinking

客户端接入 · Client Integration

CherryStudio

SillyTavern

OpenWebUI / LobeChat / 任意 OpenAI 兼容客户端

Fleet Manager 多节点管理

自动更新 · Auto Update

更新日志 · Changelog

v1.1.1 — 2026-04-07

v1.1.0 — 2026-04-06

v1.0.9 — 2026-04-06

v1.0.8 — 2026-04-06

v1.0.7 — 2026-04-06

v1.0.6 — 2026-04-06

v1.0.5 — 2026-04-06

v1.0.1 — 2026-04-06

v1.0.0 — 2026-04-06

许可证 · License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Replit2Api

目录 · Table of Contents

简介 · Overview

功能特性 · Features

快速开始 · Quick Start

API 端点 · Endpoints

认证方式 · Authentication

模型路由 · Model Routing

扩展思考别名 · Extended Thinking Aliases

工具调用 · Tool Calling

扩展思考 · Extended Thinking

客户端接入 · Client Integration

CherryStudio

SillyTavern

OpenWebUI / LobeChat / 任意 OpenAI 兼容客户端

Fleet Manager 多节点管理

自动更新 · Auto Update

更新日志 · Changelog

v1.1.1 — 2026-04-07

v1.1.0 — 2026-04-06

v1.0.9 — 2026-04-06

v1.0.8 — 2026-04-06

v1.0.7 — 2026-04-06

v1.0.6 — 2026-04-06

v1.0.5 — 2026-04-06

v1.0.1 — 2026-04-06

v1.0.0 — 2026-04-06

许可证 · License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages