Skip to content

v0.0.8

Choose a tag to compare

@xiaguan xiaguan released this 22 Jan 09:12
· 216 commits to master since this release
9e2c41f

What's Changed

  • test: add E2E correctness canary test for vLLM integration by @xiaguan in #44
  • fix(connector): use correct torch device for CUDA sync in TP mode by @xiaguan in #46
  • feat: add vllm-async-loading-debug skill by @xiaguan in #47
  • Feat/connector preemption and logging by @xiaguan in #48
  • chore: bump version to 0.0.8 by @xiaguan in #49

Full Changelog: v0.0.7...v0.0.8