1.2.0.dev20260602003545
·
120 commits
to main
since this release
Installation
Via PyPI
pip install pjrt-plugin-tt==1.2.0.dev20260602003545 --extra-index-url https://pypi.eng.aws.tenstorrent.com/
pip install vllm-tt==1.2.0.dev20260602003545 --extra-index-url https://pypi.eng.aws.tenstorrent.com/Via Docker
docker pull ghcr.io/tenstorrent/tt-xla-slim:1.2.0.dev20260602003545What's Changed
- [Composite] Add nn.RMSNorm module-form support by @kamalrajkannan78 in #4985
- [Playground v2.5] Add end-to-end pipeline example by @kamalrajkannan78 in #4992
- Reduce vLLM decode graphs from 5 to 2 by @alinakhanTT in #4789
- Add Gemma-4-31B-it to vLLM benchmarks on QB2 Blackhole by @kmabeeTT in #5012
- Lower gemma 1.1_7B_IT inference pcc threshold by @vzeljkovicTT in #5026
- [Tests] Xfail training PCC failures for phi1, phi1_lora, gemma_lora by @vzeljkovicTT in #5027
- Lower sdxl clip threshold nightly by @vzeljkovicTT in #5023
- Set targetModule path as a default for emitPy testing by @amilovanovicTT in #4819
- Add Qwen3-32B vLLM perf benchmark for QuietBox2 (batch 1) by @ssaliceTT in #5030
- [Benchmark] Fix multichip arch, perf regression check and qb2 transformers pin failures by @vkovacevicTT in #5020
Full Changelog: 1.2.0.dev20260601003137...1.2.0.dev20260602003545