Highlights
Model sharing — securely share your local models with friends over a free tunnel. Works with any OpenAI Responses API client (Codex, opencode, …), not only Neko Route. Each token scopes its allowed models with a spend quota and concurrency / RPM limits; keys (sk-…) and downstream model IDs are customizable. Image generation is supported and metered.
Fixes
- Streaming requests are now billed, and OpenAI cached input tokens are no longer double-counted.
- Shared requests route to the requested model regardless of Codex default / fallback / auxiliary / memory settings.
- Standard clients sending
max_output_tokensno longer fail against official accounts. - Request logs separate shared from local traffic and can be filtered by token.