Release 1.3.0.dev20260614003409 · tenstorrent/tt-forge

Installation

Via PyPI

pip install tt-forge==1.3.0.dev20260614003409 --extra-index-url https://pypi.eng.aws.tenstorrent.com/

Via Docker

docker pull ghcr.io/tenstorrent/tt-forge-slim:1.3.0.dev20260614003409

What's Changed

Uplift third_party/tt_forge_models to 09239ae98eb4f0b03abe5240aca9418eb3131717 2026-06-13 by @vvukomanTT in #1013

Full Changelog: 1.3.0.dev20260613003624...1.3.0.dev20260614003409

LLM Performance

Model	Token/sec/user	Batch	Token/sec	ttft (ms)
Qwen/Qwen2.5-0.5B-Instruct	1251.0	32	40032.0	920.43
Qwen/Qwen2.5-1.5B-Instruct	687.0	32	21984.0	1783.34
Qwen/Qwen2.5-3B-Instruct	550.0	32	17600.0	3072.82
Qwen/Qwen2.5-7B-Instruct	364.0	32	11648.0	5838.96
Qwen/Qwen3-0.6B	961.0	32	30752.0	1755.58
Qwen/Qwen3-1.7B	717.0	32	22944.0	2218.45
Qwen/Qwen3-4B	445.0	32	14240.0	4705.05
Qwen/Qwen3-8B	286.0	32	9152.0	6066.36
meta-llama/Llama-3.1-8B-Instruct	334.0	32	10688.0	5694.18
meta-llama/Llama-3.2-1B-Instruct	1067.0	32	34144.0	1336.13
meta-llama/Llama-3.2-3B-Instruct	432.0	32	13824.0	3257.63
microsoft/phi-1	733.0	32	23456.0	2482.63
microsoft/phi-1_5	741.0	32	23712.0	2485.56
microsoft/phi-2	431.0	32	13792.0	4825.94
mistralai/Ministral-8B-Instruct-2410	289.0	32	9248.0	5986.28
mistralai/Mistral-7B-Instruct-v0.3	363.0	32	11616.0	5704.14
pytorch_DeepSeek-V3.2_deepseek_v3_2_exp_modified_nlp_causal_lm_custom	2.0	128	256.0	7314.22
pytorch_Falcon_3_10B_Base_nlp_causal_lm_huggingface	24.0	32	768.0	1679.56
pytorch_Falcon_3_1B_Base_nlp_causal_lm_huggingface	58.0	32	1856.0	668.33
pytorch_Falcon_3_3B_Base_nlp_causal_lm_huggingface	38.0	32	1216.0	825.5
pytorch_Falcon_3_7B_Base_nlp_causal_lm_huggingface	32.0	32	1024.0	1152.64
pytorch_Gemma_1.1_2B_IT_nlp_causal_lm_huggingface	39.0	32	1248.0	672.77
pytorch_Llama_3.1_8B_Instruct_nlp_causal_lm_huggingface	23.0	32	736.0	2136.12
pytorch_Llama_3.2_1B_Instruct_nlp_causal_lm_huggingface	67.0	32	2144.0	548.99
pytorch_Llama_3.2_3B_Instruct_nlp_causal_lm_huggingface	31.0	32	992.0	587.21
pytorch_Mistral_7B_INSTRUCT_v03_nlp_causal_lm_huggingface	20.0	32	640.0	1228.28
pytorch_Mistral_Ministral_8B_Instruct_nlp_causal_lm_huggingface	8.0	32	256.0	612.36
pytorch_Mistral_Nemo_INSTRUCT_2407_nlp_causal_lm_huggingface	17.0	32	544.0	1687.0
pytorch_Mistral_Small_24B_INSTRUCT_2501_nlp_causal_lm_huggingface	16.0	32	512.0	1806.4
pytorch_Phi-1.5_Phi_1_5_nlp_causal_lm_huggingface	22.0	32	704.0	615.48
pytorch_Phi-1_Phi_1_nlp_causal_lm_huggingface	22.0	32	704.0	614.76
pytorch_Qwen 2.5_0.5B_Instruct_nlp_causal_lm_huggingface	76.0	32	2432.0	387.57
pytorch_Qwen 2.5_1.5B_Instruct_nlp_causal_lm_huggingface	38.0	32	1216.0	453.66
pytorch_Qwen 2.5_14B_Instruct_nlp_causal_lm_huggingface	12.0	32	384.0	2051.1
pytorch_Qwen 2.5_3B_Instruct_nlp_causal_lm_huggingface	32.0	32	1024.0	664.11
pytorch_Qwen 2.5_7B_Instruct_nlp_causal_lm_huggingface	16.0	32	512.0	848.01
pytorch_Qwen 3_0_6B_nlp_causal_lm_huggingface	36.0	32	1152.0	1142.82
pytorch_Qwen 3_14B_nlp_causal_lm_huggingface	10.0	32	320.0	1788.43
pytorch_Qwen 3_1_7B_nlp_causal_lm_huggingface	30.0	32	960.0	725.71
pytorch_Qwen 3_32B_nlp_causal_lm_huggingface	9.0	32	288.0	4656.78
pytorch_Qwen 3_4B_nlp_causal_lm_huggingface	17.0	32	544.0	1003.49
pytorch_Qwen 3_8B_nlp_causal_lm_huggingface	11.0	32	352.0	1595.74
tiiuae/Falcon3-1B-Base	868.0	32	27776.0	1492.71
tiiuae/Falcon3-3B-Base	447.0	32	14304.0	2577.07
tiiuae/Falcon3-7B-Base	217.0	32	6944.0	6448.05

Non-LLM Performance

Model	Batch	Sample/sec
BAAI/bge-m3	1	34.0
Qwen/Qwen3-Embedding-4B	1	7.0
pytorch_BERT_emrecan/bert-base-turkish-cased-mean-nli-stsb-tr_nlp_embed_gen_huggingface	8	44.0
pytorch_BGE-M3_Base_nlp_embed_gen_custom	4	9.0
pytorch_EfficientNet_Timm_B0_cv_image_cls_timm	8	350.0
pytorch_MNIST_Cnn_Dropout_cv_image_cls_custom	32	13940.0
pytorch_MobileNetV2_Mobilenet_v2_cv_image_cls_torch_hub	12	1248.0
pytorch_Qwen 3_Embedding_4B_nlp_embed_gen_huggingface	32	46.0
pytorch_ResNet_ResNet50_HuggingFace_cv_image_cls_huggingface	8	1350.0
pytorch_SegFormer_B0_Finetuned_Ade_512_512_cv_image_seg_huggingface	1	38.0
pytorch_Swin_S_cv_image_cls_torchvision	1	10.0
pytorch_U-Net for Conditional Generation_Base_conditional_generation_huggingface	1	5.0
pytorch_Ultra-Fast Lane Detection v2_TuSimple_ResNet34_Backbone_cv_image_seg_github	1	136.0
pytorch_VGG19-UNet_base_cv_image_seg_custom	1	139.0
pytorch_ViT_Base_cv_image_cls_huggingface	8	230.0
pytorch_VoVNet_Ese_Vovnet19b_Dw.ra_In1k_cv_image_cls_timm	8	673.0

Model coverage

Info: Full list of supported models is available in the assets section.

Model task	Model architecture	Model variant	Model framework	Inference	Training	n150	n300	p150	Single device	Data parallel	Tensor parallel	Model source
conditional generation	U-Net for Conditional Generation	Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	AlexNet	Custom 1x2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
cv image cls	DINOv2	Small	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	EfficientNet	B0	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	MNIST	Cnn Batchnorm	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Dropout	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Dropout	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Cnn Nodropout	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom	jax	❌	✅	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MNIST	Mlp Custom 1x2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
cv image cls	MobileNetV1	Mobilenet v1	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	MobileNetV2	Mobilenet v2	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	ResNet	ResNet50 HuggingFace High Resolution	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	SegFormer	Mit B0	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	Swin	S	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	VGG	HF Vgg19	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image cls	ViT	Base	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image cls	VoVNet	Ese Vovnet19b Dw.ra In1k	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv image seg	Ultra-Fast Lane Detection	TuSimple ResNet18 Backbone	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv image seg	VGG19-UNet	base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv img to img	Autoencoder	linear	pytorch	❌	✅	✅	❌	✅	✅	❌	❌	View Source
cv object det	Attention DenseUNet	Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	DETR	ResNet50 Backbone	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	OWL-ViT	Base Patch32	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	PointPillars	pointpillars	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOP	Default	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOS Small	Small	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOv4	Base	pytorch	✅	❌	✅	✅	✅	✅	✅	❌	View Source
cv object det	YOLOv7	Default	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	YOLOv9	T	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
cv object det	ssd512	ssd512	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm action prediction	OpenVLA-OFT	Finetuned Libero 10	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
mm action prediction	pi_0	pi0 base	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
mm image text similarity	CLIP	Base Patch16	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm image text similarity	SigLIP	Base Patch16 224	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
mm visual qa	Mistral	base	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	ALLaM	7B Instruct	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Command_A_Reasoning	command-a-reasoning-08-2025	pytorch	✅	❌	❌	❌	❌	❌	❌	✅	View Source
nlp causal lm	Falcon	3 10B Base	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Falcon	3 1B Base	pytorch	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Falcon	3 3B Base	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Falcon	3 7B Base	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	GPT-2	Base	jax	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	GPT-2	Xl	jax	❌	✅	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Gemma	1.1 2B IT	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Gemma	1.1 7B IT	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Gemma	2 27B IT	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Gemma	2 2B IT	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Gemma	2 9B IT	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Llama	3.1 8B Instruct	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Llama	3.2 1B	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Llama	3.2 3B	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Mistral	7B INSTRUCT v03	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Mistral	Devstral Small 2505	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Magistral Small 2506	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Ministral 8B Instruct	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Nemo INSTRUCT 2407	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Mistral	Small 24B INSTRUCT 2501	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Phi-1	Phi 1	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Phi-1	Phi 1	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-1 LoRA	Phi 1	pytorch	❌	✅	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-1.5	Phi 1 5	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Phi-1.5	Phi 1 5	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-2	Phi 2	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Phi-2	Phi 2	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini 128K Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini 4K Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-3	Mini Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Phi-4	Phi 4	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Qwen 2	Qwq 32B	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B Instruct	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	0.5B Instruct	pytorch	✅	❌	✅	❌	✅	✅	❌	❌	View Source
nlp causal lm	Qwen 2.5	1.5B Instruct	jax	✅	❌	✅	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	1.5B Instruct	pytorch	✅	❌	❌	❌	✅	✅	❌	❌	View Source
nlp causal lm	Qwen 2.5	14B Instruct	pytorch	✅	❌	❌	✅	✅	✅	❌	✅	View Source
nlp causal lm	Qwen 2.5	32B Instruct	pytorch	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Qwen 2.5	3B	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source
nlp causal lm	Qwen 2.5	3B Instruct	jax	✅	❌	❌	✅	❌	❌	❌	✅	View Source

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1.3.0.dev20260614003409

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Installation

Via PyPI

Via Docker

What's Changed

LLM Performance

Non-LLM Performance

Model coverage

Contributors

Uh oh!