Release 1.3.0 · tenstorrent/tt-forge

Installation

Via PyPI

pip install tt-forge==1.3.0 --extra-index-url https://pypi.eng.aws.tenstorrent.com/

Via Docker

docker pull ghcr.io/tenstorrent/tt-forge-slim:1.3.0

Dependency commits

tt-xla commit: 93ac846371fd1a118265a3226b5a4ce0bdf939fe
tt-mlir commit: 70ff200c7d2fa8d8401f316a0a0b35ee88cbfb72
tt-metal commit: 7f6364a11dafadf141b6c87358073d9e3d1dd22f

What's Changed

Add workflow to test installations steps using AI by @vmilosevic in #921
Uplift third_party/tt_forge_models to 37332bc2fa41c6d8ff7ec994ad19b84ab266429f 2026-03-28 by @vvukomanTT in #922
Improve claude instructions testing by @vmilosevic in #923
Bump release version to 1.1.0 by @vvukomanTT in #924
Add a number of skills for the tenstorrent ecosystem by @zoecarver in #925
Update claude workflow by @vmilosevic in #926
Uplift third_party/tt_forge_models to e7ba96f22de7ced01b151393f4d4e1bc312fac23 2026-04-04 by @vvukomanTT in #930
Remove benchmark infrastructure from tt-forge by @chandrasekaranpradeep in #928
Upgrade checkout from v4 to v5, Add venv activate by @nsumrakTT in #929
Uplift third_party/tt_forge_models to 752832c839d59bb6c3fde43d7b9c03ebc820fa57 2026-04-11 by @vvukomanTT in #931
Add tt-xla llama demo by @ddilbazTT in #932
Add tiny llama demo by @ddilbazTT in #933
Uplift third_party/tt_forge_models to 534f4821827f4e92af6a1ad58e8c836c8ccd38fd 2026-04-18 by @vvukomanTT in #935
Stop tt-forge-onnx nightly releases by @vvukomanTT in #936
Uplift third_party/tt_forge_models to d11fe6671e44d286c2e8c6dcd9bd46a3b84f9ea8 2026-04-25 by @vvukomanTT in #937
Uplift third_party/tt_forge_models to cae9ccbc67a318736f656bee9a9ea776eb73e69c 2026-05-02 by @vvukomanTT in #940
Remove depreciated files by @nsumrakTT in #941
Add qwen3 demo by @abrown in #934
Add filter test matrix file by @vvukomanTT in #942
Refactor into call claude by @vmilosevic in #946
Update requirements to uplift PyTorch by @mmanzoorTT in #944
Uplift third_party/tt_forge_models to f224af305a10d38acb9fbd72c0c3514b26ec4544 2026-05-09 by @vvukomanTT in #980
Add ai model bringup workflow by @vmilosevic in #986
Ai bringup update by @vmilosevic in #987
Update ai-bringup workflow by @vmilosevic in #988
Update ai workflow docker image by @vmilosevic in #989
Update ai workflow by @vmilosevic in #990
Update ai workflow by @vmilosevic in #992
Uplift third_party/tt_forge_models to a64a98131c35b010895198f489355d0e6306934f 2026-05-16 by @vvukomanTT in #993
Run cpu bringup on builder by @vmilosevic in #994
Add master ai bringup by @vmilosevic in #1002
Fix push problem in ai workflows by @vmilosevic in #1003
Uplift third_party/tt_forge_models to 7201811e7020d0e35e908df47a9e57926ba0aa1c 2026-05-23 by @vvukomanTT in #1005
Uplift third_party/tt_forge_models to b845dfd5fefc65d5b7aff0515243b8c520758ba5 2026-05-30 by @vvukomanTT in #1006
Uplift third_party/tt_forge_models to 156da1137c7fde3afeea2801cf7e8f25f0d3000d 2026-06-03 by @vvukomanTT in #1009
Uplift third_party/tt_forge_models to 6b4b47a7c419cdc2713ddfc6e3179f61012c12f4 2026-06-06 by @vvukomanTT in #1011
Uplift third_party/tt_forge_models to 09239ae98eb4f0b03abe5240aca9418eb3131717 2026-06-13 by @vvukomanTT in #1013
Move ai bringup scripts to tt-forge-ai-bringup repo by @vmilosevic in #1004
Uplift third_party/tt_forge_models to 32d5c2e4a8cfd55b0f2ec99b3ec8d1b217fcb742 2026-06-20 by @vvukomanTT in #1014
Uplift third_party/tt_forge_models to 9af516c8daaa0634c10f00b1cd745042883845ea 2026-06-27 by @vvukomanTT in #1015

New Contributors

@zoecarver made their first contribution in #925
@abrown made their first contribution in #934
@mmanzoorTT made their first contribution in #944

Full Changelog: 1.0.0...1.3.0

LLM Performance

Model	Token/sec/user	Batch	Token/sec	ttft (ms)	Hardware
pytorch_DeepSeek-V3.1_deepseek_v3_1_modified_nlp_causal_lm_custom	3.0	64	192.0	4159.47	n150
pytorch_Falcon_3_1B_Base_nlp_causal_lm_huggingface	57.0	32	1824.0	663.53	n150
pytorch_Falcon_3_3B_Base_nlp_causal_lm_huggingface	38.0	32	1216.0	865.52	n150
pytorch_Falcon_3_7B_Base_nlp_causal_lm_huggingface	19.0	32	608.0	1186.81	n150
pytorch_GLM_4.7_nlp_causal_lm_huggingface	7.0	64	448.0	1590.81	n150
pytorch_Gemma_1.1_2B_IT_nlp_causal_lm_huggingface	40.0	32	1280.0	638.1	n150
pytorch_Kimi-K2.5_kimi_k2_5_modified_nlp_causal_lm_custom	3.0	64	192.0	4587.63	n150
pytorch_Kimi-K2_kimi_k2_instruct_modified_nlp_causal_lm_custom	3.0	64	192.0	4808.17	n150
pytorch_Llama_3.1_8B_Instruct_nlp_causal_lm_huggingface	23.0	32	736.0	2230.65	n150
pytorch_Llama_3.2_1B_Instruct_nlp_causal_lm_huggingface	67.0	32	2144.0	575.95	n150
pytorch_Llama_3.2_3B_Instruct_nlp_causal_lm_huggingface	31.0	32	992.0	605.5	n150
pytorch_Mistral_7B_INSTRUCT_v03_nlp_causal_lm_huggingface	20.0	32	640.0	1252.08	n150
pytorch_Mistral_Ministral_8B_Instruct_nlp_causal_lm_huggingface	12.0	32	384.0	550.28	n150
pytorch_Mistral_Small_24B_INSTRUCT_2501_nlp_causal_lm_huggingface	16.0	32	512.0	1790.87	n150
pytorch_Phi-1.5_Phi_1_5_nlp_causal_lm_huggingface	22.0	32	704.0	617.58	n150
pytorch_Phi-1_Phi_1_nlp_causal_lm_huggingface	22.0	32	704.0	632.51	n150
pytorch_Qwen 2.5_0.5B_Instruct_nlp_causal_lm_huggingface	78.0	32	2496.0	408.36	n150
pytorch_Qwen 2.5_1.5B_Instruct_nlp_causal_lm_huggingface	38.0	32	1216.0	461.04	n150
pytorch_Qwen 2.5_3B_Instruct_nlp_causal_lm_huggingface	31.0	32	992.0	697.85	n150
pytorch_Qwen 2.5_7B_Instruct_nlp_causal_lm_huggingface	16.0	32	512.0	859.7	n150
pytorch_Qwen 3_0_6B_nlp_causal_lm_huggingface	36.0	32	1152.0	1163.37	n150
pytorch_Qwen 3_1_7B_nlp_causal_lm_huggingface	30.0	32	960.0	746.03	n150
pytorch_Qwen 3_4B_nlp_causal_lm_huggingface	17.0	32	544.0	976.23	n150
pytorch_Qwen 3_8B_nlp_causal_lm_huggingface	13.0	32	416.0	1673.32	n150

Non-LLM Performance

Model	Batch	Sample/sec	Hardware
pytorch_BERT_emrecan/bert-base-turkish-cased-mean-nli-stsb-tr_nlp_embed_gen_huggingface	8	44.0	n150
pytorch_BGE-M3_Base_nlp_embed_gen_custom	4	9.0	n150
pytorch_BGE-M3_Base_nlp_embed_gen_custom	4	19.0	p150
pytorch_EfficientNet_Timm_B0_cv_image_cls_timm	8	349.0	n150
pytorch_MNIST_Cnn_Dropout_cv_image_cls_custom	32	14643.0	n150
pytorch_MobileNetV2_Mobilenet_v2_cv_image_cls_torch_hub	12	1237.0	n150
pytorch_Qwen 3_Embedding_4B_nlp_embed_gen_huggingface	32	46.0	n150
pytorch_ResNet_ResNet50_HuggingFace_cv_image_cls_huggingface	8	1339.0	n150
pytorch_SegFormer_B0_Finetuned_Ade_512_512_cv_image_seg_huggingface	1	39.0	n150
pytorch_Swin_S_cv_image_cls_torchvision	1	10.0	n150
pytorch_U-Net for Conditional Generation_Base_conditional_generation_huggingface	1	5.0	n150
pytorch_Ultra-Fast Lane Detection v2_TuSimple_ResNet34_Backbone_cv_image_seg_github	1	136.0	n150
pytorch_VGG19-UNet_base_cv_image_seg_custom	1	147.0	n150
pytorch_ViT_Base_cv_image_cls_huggingface	8	229.0	n150
pytorch_VoVNet_Ese_Vovnet19b_Dw.ra_In1k_cv_image_cls_timm	8	667.0	n150

Model coverage

Info: Full list of supported models is available in the assets section.

Model task	Model architecture	Model variant	Model framework	Inference	Training	n150	n300	p150	Single device	Data parallel	Tensor parallel	Model source
conditional generation	U-Net for Conditional Generation	Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	AlexNet	Custom 1x2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
cv image cls	DINOv2	Small	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	EfficientNet	B0	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	MNIST	Cnn Batchnorm	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Dropout	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Dropout	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Nodropout	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom	jax	❌	✅	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom 1x2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
cv image cls	MobileNetV1	Mobilenet v1	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MobileNetV2	Mobilenet v2	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	ResNet	ResNet50 HuggingFace High Resolution	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	SegFormer	Mit B0	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	Swin	S	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	VGG	HF Vgg19	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	ViT	Base	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	VoVNet	Ese Vovnet19b Dw.ra In1k	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image seg	Ultra-Fast Lane Detection	TuSimple ResNet18 Backbone	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image seg	VGG19-UNet	base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv img to img	Autoencoder	linear	pytorch	❌	✅	✅	❌	✅	✅	❌	❌	View Source
cv object det	Attention DenseUNet	Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	DETR	ResNet50 Backbone	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	OWL-ViT	Base Patch32	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	PointPillars	pointpillars	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOP	Default	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOS Small	Small	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOv4	Base	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv object det	YOLOv7	Default	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOv9	T	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	ssd512	ssd512	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm action prediction	OpenVLA-OFT	Finetuned Libero 10	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
mm action prediction	pi_0	pi0 base	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
mm image text similarity	CLIP	Base Patch16	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm image text similarity	SigLIP	Base Patch16 224	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm visual qa	Llama	3.2 11B Vision Instruct	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
mm visual qa	Mistral	base	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	ALLaM	7B Instruct	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Command_A_Reasoning	command-a-reasoning-08-2025	pytorch	✅	❌	❌	❌	❌	❌	❌	✅	View Source
nlp causal lm	Falcon	3 10B Base	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Falcon	3 1B Base	pytorch	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Falcon	3 3B Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Falcon	3 7B Base	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	GPT-2	Base	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	GPT-2	Xl	jax	❌	✅	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	GPT-OSS	20B	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Gemma	1.1 2B IT	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Gemma	1.1 7B IT	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Gemma	2 27B IT	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Gemma	2 2B IT	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Gemma	2 9B IT	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Llama	3.1 70B	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Llama	3.1 8B Instruct	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Llama	3.2 1B	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Llama	3.2 3B	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Llama	3.3 70B Instruct	pytorch	✅	❌	❌	❌	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	7B INSTRUCT v03	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Mistral	Devstral Small 2505	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Large INSTRUCT 2411	pytorch	✅	❌	❌	❌	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Magistral Small 2506	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Ministral 8B Instruct	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Mistral	Nemo INSTRUCT 2407	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Small 24B INSTRUCT 2501	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Phi-1	Phi 1	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Phi-1	Phi 1	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-1	Phi 1	pytorch	❌	✅	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-1 LoRA	Phi 1	pytorch	❌	✅	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-1.5	Phi 1 5	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Phi-1.5	Phi 1 5	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-2	Phi 2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Phi-2	Phi 2	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini 128K Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini 4K Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-4	Phi 4	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Qwen 2	Qwq 32B	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B Instruct	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

1.3.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Installation

Via PyPI

Via Docker

Dependency commits

What's Changed

New Contributors

LLM Performance

Non-LLM Performance

Model coverage

Contributors

Uh oh!