Skip to content

v0.6.7

Choose a tag to compare

@Oaklight Oaklight released this 04 Jun 05:35
· 74 commits to master since this release

Fixed

  • Embedding endpoint upstream_model alias: The /v1/embeddings passthrough handler now substitutes the upstream_model name into the request body before forwarding. Previously model aliases (e.g. bge-m3BAAI/bge-m3) were ignored, causing upstream model-not-found errors.
  • Admin test timer leak: The elapsed-time counter is now tracked globally and cleared when a new test starts, preventing multiple timers from writing alternating values to the same display element.
  • Admin test timeout auto-cancel: When the browser-side 120s timeout fires, the server-side task is now explicitly cancelled. Added asyncio.wait_for() with 120s timeout server-side so hung upstream calls don't linger until the 300s cleanup window.