Skip to content

1.3.0.dev20260615003539

Choose a tag to compare

@tenstorrent-github-bot tenstorrent-github-bot released this 15 Jun 01:20
· 1 commit to main since this release
6f2d964

Installation

Via PyPI

pip install tt-forge==1.3.0.dev20260615003539 --extra-index-url https://pypi.eng.aws.tenstorrent.com/

Via Docker

docker pull ghcr.io/tenstorrent/tt-forge-slim:1.3.0.dev20260615003539

What's Changed

  • Uplift third_party/tt_forge_models to 09239ae98eb4f0b03abe5240aca9418eb3131717 2026-06-13 by @vvukomanTT in #1013

Full Changelog: 1.3.0.dev20260613003624...1.3.0.dev20260615003539


LLM Performance

Model Token/sec/user Batch Token/sec ttft (ms)
Qwen/Qwen2.5-0.5B-Instruct 1251.0 32 40032.0 920.43
Qwen/Qwen2.5-1.5B-Instruct 687.0 32 21984.0 1783.34
Qwen/Qwen2.5-3B-Instruct 550.0 32 17600.0 3072.82
Qwen/Qwen2.5-7B-Instruct 364.0 32 11648.0 5838.96
Qwen/Qwen3-0.6B 961.0 32 30752.0 1755.58
Qwen/Qwen3-1.7B 717.0 32 22944.0 2218.45
Qwen/Qwen3-4B 445.0 32 14240.0 4705.05
Qwen/Qwen3-8B 286.0 32 9152.0 6066.36
meta-llama/Llama-3.1-8B-Instruct 334.0 32 10688.0 5694.18
meta-llama/Llama-3.2-1B-Instruct 1067.0 32 34144.0 1336.13
meta-llama/Llama-3.2-3B-Instruct 432.0 32 13824.0 3257.63
microsoft/phi-1 733.0 32 23456.0 2482.63
microsoft/phi-1_5 741.0 32 23712.0 2485.56
microsoft/phi-2 431.0 32 13792.0 4825.94
mistralai/Ministral-8B-Instruct-2410 289.0 32 9248.0 5986.28
mistralai/Mistral-7B-Instruct-v0.3 363.0 32 11616.0 5704.14
pytorch_DeepSeek-V3.2_deepseek_v3_2_exp_modified_nlp_causal_lm_custom 2.0 128 256.0 7314.22
pytorch_Falcon_3_10B_Base_nlp_causal_lm_huggingface 24.0 32 768.0 1679.56
pytorch_Falcon_3_1B_Base_nlp_causal_lm_huggingface 58.0 32 1856.0 668.33
pytorch_Falcon_3_3B_Base_nlp_causal_lm_huggingface 38.0 32 1216.0 825.5
pytorch_Falcon_3_7B_Base_nlp_causal_lm_huggingface 32.0 32 1024.0 1152.64
pytorch_Gemma_1.1_2B_IT_nlp_causal_lm_huggingface 39.0 32 1248.0 672.77
pytorch_Llama_3.1_8B_Instruct_nlp_causal_lm_huggingface 23.0 32 736.0 2136.12
pytorch_Llama_3.2_1B_Instruct_nlp_causal_lm_huggingface 67.0 32 2144.0 548.99
pytorch_Llama_3.2_3B_Instruct_nlp_causal_lm_huggingface 31.0 32 992.0 587.21
pytorch_Mistral_7B_INSTRUCT_v03_nlp_causal_lm_huggingface 20.0 32 640.0 1228.28
pytorch_Mistral_Ministral_8B_Instruct_nlp_causal_lm_huggingface 8.0 32 256.0 612.36
pytorch_Mistral_Nemo_INSTRUCT_2407_nlp_causal_lm_huggingface 17.0 32 544.0 1687.0
pytorch_Mistral_Small_24B_INSTRUCT_2501_nlp_causal_lm_huggingface 16.0 32 512.0 1806.4
pytorch_Phi-1.5_Phi_1_5_nlp_causal_lm_huggingface 22.0 32 704.0 615.48
pytorch_Phi-1_Phi_1_nlp_causal_lm_huggingface 22.0 32 704.0 614.76
pytorch_Qwen 2.5_0.5B_Instruct_nlp_causal_lm_huggingface 76.0 32 2432.0 387.57
pytorch_Qwen 2.5_1.5B_Instruct_nlp_causal_lm_huggingface 38.0 32 1216.0 453.66
pytorch_Qwen 2.5_14B_Instruct_nlp_causal_lm_huggingface 12.0 32 384.0 2051.1
pytorch_Qwen 2.5_3B_Instruct_nlp_causal_lm_huggingface 32.0 32 1024.0 664.11
pytorch_Qwen 2.5_7B_Instruct_nlp_causal_lm_huggingface 16.0 32 512.0 848.01
pytorch_Qwen 3_0_6B_nlp_causal_lm_huggingface 36.0 32 1152.0 1142.82
pytorch_Qwen 3_14B_nlp_causal_lm_huggingface 10.0 32 320.0 1788.43
pytorch_Qwen 3_1_7B_nlp_causal_lm_huggingface 30.0 32 960.0 725.71
pytorch_Qwen 3_32B_nlp_causal_lm_huggingface 9.0 32 288.0 4656.78
pytorch_Qwen 3_4B_nlp_causal_lm_huggingface 17.0 32 544.0 1003.49
pytorch_Qwen 3_8B_nlp_causal_lm_huggingface 11.0 32 352.0 1595.74
tiiuae/Falcon3-1B-Base 868.0 32 27776.0 1492.71
tiiuae/Falcon3-3B-Base 447.0 32 14304.0 2577.07
tiiuae/Falcon3-7B-Base 217.0 32 6944.0 6448.05

Non-LLM Performance

Model Batch Sample/sec
BAAI/bge-m3 1 34.0
Qwen/Qwen3-Embedding-4B 1 7.0
pytorch_BERT_emrecan/bert-base-turkish-cased-mean-nli-stsb-tr_nlp_embed_gen_huggingface 8 44.0
pytorch_BGE-M3_Base_nlp_embed_gen_custom 4 9.0
pytorch_EfficientNet_Timm_B0_cv_image_cls_timm 8 350.0
pytorch_MNIST_Cnn_Dropout_cv_image_cls_custom 32 13940.0
pytorch_MobileNetV2_Mobilenet_v2_cv_image_cls_torch_hub 12 1248.0
pytorch_Qwen 3_Embedding_4B_nlp_embed_gen_huggingface 32 46.0
pytorch_ResNet_ResNet50_HuggingFace_cv_image_cls_huggingface 8 1350.0
pytorch_SegFormer_B0_Finetuned_Ade_512_512_cv_image_seg_huggingface 1 38.0
pytorch_Swin_S_cv_image_cls_torchvision 1 10.0
pytorch_U-Net for Conditional Generation_Base_conditional_generation_huggingface 1 5.0
pytorch_Ultra-Fast Lane Detection v2_TuSimple_ResNet34_Backbone_cv_image_seg_github 1 136.0
pytorch_VGG19-UNet_base_cv_image_seg_custom 1 139.0
pytorch_ViT_Base_cv_image_cls_huggingface 8 230.0
pytorch_VoVNet_Ese_Vovnet19b_Dw.ra_In1k_cv_image_cls_timm 8 673.0

Model coverage

Info: Full list of supported models is available in the assets section.

Model task Model architecture Model variant Model framework Inference Training n150 n300 p150 Single device Data parallel Tensor parallel Model source
conditional generation U-Net for Conditional Generation Base pytorch View Source
cv image cls AlexNet Custom 1x2 jax View Source
cv image cls DINOv2 Small pytorch View Source
cv image cls EfficientNet B0 pytorch View Source
cv image cls MNIST Cnn Batchnorm jax View Source
cv image cls MNIST Cnn Dropout jax View Source
cv image cls MNIST Cnn Dropout pytorch View Source
cv image cls MNIST Cnn Nodropout pytorch View Source
cv image cls MNIST Mlp Custom jax View Source
cv image cls MNIST Mlp Custom jax View Source
cv image cls MNIST Mlp Custom 1x2 jax View Source
cv image cls MobileNetV1 Mobilenet v1 pytorch View Source
cv image cls MobileNetV2 Mobilenet v2 pytorch View Source
cv image cls ResNet ResNet50 HuggingFace High Resolution pytorch View Source
cv image cls SegFormer Mit B0 pytorch View Source
cv image cls Swin S pytorch View Source
cv image cls VGG HF Vgg19 pytorch View Source
cv image cls ViT Base pytorch View Source
cv image cls VoVNet Ese Vovnet19b Dw.ra In1k pytorch View Source
cv image seg Ultra-Fast Lane Detection TuSimple ResNet18 Backbone pytorch View Source
cv image seg VGG19-UNet base pytorch View Source
cv img to img Autoencoder linear pytorch View Source
cv object det Attention DenseUNet Base pytorch View Source
cv object det DETR ResNet50 Backbone pytorch View Source
cv object det OWL-ViT Base Patch32 pytorch View Source
cv object det PointPillars pointpillars pytorch View Source
cv object det YOLOP Default pytorch View Source
cv object det YOLOS Small Small pytorch View Source
cv object det YOLOv4 Base pytorch View Source
cv object det YOLOv7 Default pytorch View Source
cv object det YOLOv9 T pytorch View Source
cv object det ssd512 ssd512 pytorch View Source
mm action prediction OpenVLA-OFT Finetuned Libero 10 pytorch View Source
mm action prediction pi_0 pi0 base pytorch View Source
mm image text similarity CLIP Base Patch16 pytorch View Source
mm image text similarity SigLIP Base Patch16 224 pytorch View Source
mm visual qa Mistral base pytorch View Source
nlp causal lm ALLaM 7B Instruct pytorch View Source
nlp causal lm Command_A_Reasoning command-a-reasoning-08-2025 pytorch View Source
nlp causal lm Falcon 3 10B Base pytorch View Source
nlp causal lm Falcon 3 1B Base pytorch View Source
nlp causal lm Falcon 3 3B Base pytorch View Source
nlp causal lm Falcon 3 7B Base pytorch View Source
nlp causal lm GPT-2 Base jax View Source
nlp causal lm GPT-2 Xl jax View Source
nlp causal lm Gemma 1.1 2B IT pytorch View Source
nlp causal lm Gemma 1.1 7B IT pytorch View Source
nlp causal lm Gemma 2 27B IT pytorch View Source
nlp causal lm Gemma 2 2B IT pytorch View Source
nlp causal lm Gemma 2 9B IT pytorch View Source
nlp causal lm Llama 3.1 8B Instruct pytorch View Source
nlp causal lm Llama 3.2 1B pytorch View Source
nlp causal lm Llama 3.2 3B pytorch View Source
nlp causal lm Mistral 7B INSTRUCT v03 pytorch View Source
nlp causal lm Mistral Devstral Small 2505 pytorch View Source
nlp causal lm Mistral Magistral Small 2506 pytorch View Source
nlp causal lm Mistral Ministral 8B Instruct pytorch View Source
nlp causal lm Mistral Nemo INSTRUCT 2407 pytorch View Source
nlp causal lm Mistral Small 24B INSTRUCT 2501 pytorch View Source
nlp causal lm Phi-1 Phi 1 jax View Source
nlp causal lm Phi-1 Phi 1 pytorch View Source
nlp causal lm Phi-1 LoRA Phi 1 pytorch View Source
nlp causal lm Phi-1.5 Phi 1 5 jax View Source
nlp causal lm Phi-1.5 Phi 1 5 pytorch View Source
nlp causal lm Phi-2 Phi 2 jax View Source
nlp causal lm Phi-2 Phi 2 pytorch View Source
nlp causal lm Phi-3 Mini 128K Instruct pytorch View Source
nlp causal lm Phi-3 Mini 4K Instruct pytorch View Source
nlp causal lm Phi-3 Mini Instruct pytorch View Source
nlp causal lm Phi-4 Phi 4 pytorch View Source
nlp causal lm Qwen 2 Qwq 32B pytorch View Source
nlp causal lm Qwen 2.5 0.5B jax View Source
nlp causal lm Qwen 2.5 0.5B Instruct jax View Source
nlp causal lm Qwen 2.5 0.5B Instruct pytorch View Source
nlp causal lm Qwen 2.5 1.5B Instruct jax View Source
nlp causal lm Qwen 2.5 1.5B Instruct pytorch View Source
nlp causal lm Qwen 2.5 14B Instruct pytorch View Source
nlp causal lm Qwen 2.5 32B Instruct pytorch View Source
nlp causal lm Qwen 2.5 3B jax View Source
nlp causal lm Qwen 2.5 3B Instruct jax View Source