v1.7.0-1
Pre-release
Pre-release
What's Changed
- Installables: allow custom ones by @podkidyshev in #885
- NIXL EP: add single sbatch support by @podkidyshev in #889
- Append trajectory row on cache hits by @rutayan-nv in #888
- Ipod/custom srun bash by @podkidyshev in #896
- [Configurator] Make select_action observation-aware by @rutayan-nv in #892
- feat(dynamo_mocker): add GPU-free LLM inference simulation workload by @saivishal1999 in #895
- Bump idna from 3.11 to 3.15 by @dependabot[bot] in #897
- Bump python-dotenv from 1.2.1 to 1.2.2 by @dependabot[bot] in #878
- Bump urllib3 from 2.6.3 to 2.7.0 by @dependabot[bot] in #887
- vLLM/SGLANG: add semantic degradation support by @podkidyshev in #890
- feat(ai_dynamo): add aiperf workload support by @saivishal1999 in #898
- AIDynamo: add semantic degradation evaluation support by @podkidyshev in #903
- AIDynamo: enable LMCache by @podkidyshev in #906
- AIDynamo: enable multiple AIPerf runs during a single test run by @podkidyshev in #907
- AIDynamo: Optional restart of DynamoRouter between AIPerf re-runs by @podkidyshev in #908
- AIDynamo: shared node disagg inference by @podkidyshev in #909
- vLLM/SGLang: comparison report by @podkidyshev in #904
- NIXL EP: comparison report by @podkidyshev in #911
New Contributors
- @saivishal1999 made their first contribution in #895
Full Changelog: v1.6.1...v1.7.0-1