Skip to content

0.2.2

Latest

Choose a tag to compare

@zoefix zoefix released this 28 Jun 05:53

Highlights

Model sharing — securely share your local models with friends over a free tunnel. Works with any OpenAI Responses API client (Codex, opencode, …), not only Neko Route. Each token scopes its allowed models with a spend quota and concurrency / RPM limits; keys (sk-…) and downstream model IDs are customizable. Image generation is supported and metered.

Fixes

  • Streaming requests are now billed, and OpenAI cached input tokens are no longer double-counted.
  • Shared requests route to the requested model regardless of Codex default / fallback / auxiliary / memory settings.
  • Standard clients sending max_output_tokens no longer fail against official accounts.
  • Request logs separate shared from local traffic and can be filtered by token.