Release v2.7.0 · tenstorrent/tt-studio

The official release of v2.7.0 for TT Studio is here! Please pull the latest updates, explore the new features, and continue to use the in-app bug reporting tool to submit feedback.

What's Changed

Full Release v2.7.0 in 7d19c0d via PR #884

Changelog v2.7.0

🚀 New Features

Wake Mode & VAD for Voice Agent: Added wake-word activation alongside voice-activity detection and push-to-talk, letting the Voice Agent listen and respond hands-free or on demand. (#806)
Image Pull & Download Progress: Introduced a live progress bar during Docker image pulls, with an accurate hand-off into container-start tracking so deployment feedback stays continuous. (#758, #866)
Media-Model Progress Bar: Extended deployment progress visibility to TTS and Whisper media models. (#812)
Startup Freshness Check: run.py now detects stale local state on bring-up and surfaces a warning/prompt before proceeding. (#724)
--cleanup vs --cleanup-all: Reworked teardown so --cleanup preserves deployment containers and saved secrets while --cleanup-all fully resets TT Studio to a fresh-clone state — wiping generated logs, preferences, virtualenvs, named model-weight volumes, deployment containers, and locally built images behind a single confirmation prompt (--yes for non-interactive use). (#799, #833)
Quicker Parallel Delete & Reset: Faster delete and reset for individual deployments within multi-slot configurations. (#841)
Enhanced Device Management & Reset: Improved device allocation and reset handling within the Docker control service. (#788)
Public Landing Page: Added a GitHub Pages landing site served from the docs/ root. (#839, #844)

🐛 Bug Fixes

Deployment History Status: Stopped models now report a real end time and a proper stopped status instead of showing "N/A" or "Still running". (#868)
API Examples for Non-LLM Models: Resolved 404s when fetching example snippets for TTS and Whisper models. (#876)
TPS for Base Models: Corrected tokens-per-second reporting for base (non-instruct) models. (#822)
Qwen3-32B Device IDs: Fixed incorrect device IDs displayed on the Models Deployed page. (#837)
HuggingFace Integration: Updated docker-compose.yml to ensure correct first-run HuggingFace integration. (#842)
Terms Acceptance Persistence: User terms acceptance is now persisted across restarts, removing the repeat prompt. (#847)
QB2 Preloaded Weights: Reuse preloaded weights under the pinned release instead of re-downloading them. (#848)

🔧 Technical Improvements

Unified Streaming Views: Stop and reset actions now share unified streaming views for consistent live feedback. (#872)
Single Source of Truth for Read State: Consolidated all read paths onto one authoritative state source. (#814)
Model Catalog Cleanup: Removed non-working and deprecated models, including Qwen3-8B. (#869, #878)
P300 Naming & Model Visibility: Bumped the inference artifact to v0.15.0 and corrected P300 labels and model visibility. (#815)
tt-smi Upgrade: Updated tt-smi from v4.0.0 to v4.1.2. (#824)
RAG Data Source: Refreshed the RAG knowledge base to the latest Tenstorrent documentation. (#870)
Progress-Bar Color Scheme: Polished the progress-bar color scheme. (#834)
Deprecated Legacy Startup Script: Removed the legacy startup.sh in favor of run.py. (#805)

📄 Docs & Chores

Slimmed the root README and linked a dedicated setup guide. (#823)
Added license files and a compliance-check script. (#875)
Cleaned up stray test files and refreshed the footer. (#845)

🧪 Testing

Validated against the v2.7.0-rc test plan on a p300x2 (QB2) host.

As always, thank you for your help in testing this release!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v2.7.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's Changed

Changelog v2.7.0

🚀 New Features

🐛 Bug Fixes

🔧 Technical Improvements

📄 Docs & Chores

🧪 Testing

Uh oh!