Skip to content

1.3.0

Latest

Choose a tag to compare

@tenstorrent-github-bot tenstorrent-github-bot released this 29 Jun 11:18
f100c7a

Installation

Via PyPI

pip install tt-forge==1.3.0 --extra-index-url https://pypi.eng.aws.tenstorrent.com/

Via Docker

docker pull ghcr.io/tenstorrent/tt-forge-slim:1.3.0

Dependency commits

tt-xla commit: 93ac846371fd1a118265a3226b5a4ce0bdf939fe
tt-mlir commit: 70ff200c7d2fa8d8401f316a0a0b35ee88cbfb72
tt-metal commit: 7f6364a11dafadf141b6c87358073d9e3d1dd22f

What's Changed

  • Add workflow to test installations steps using AI by @vmilosevic in #921
  • Uplift third_party/tt_forge_models to 37332bc2fa41c6d8ff7ec994ad19b84ab266429f 2026-03-28 by @vvukomanTT in #922
  • Improve claude instructions testing by @vmilosevic in #923
  • Bump release version to 1.1.0 by @vvukomanTT in #924
  • Add a number of skills for the tenstorrent ecosystem by @zoecarver in #925
  • Update claude workflow by @vmilosevic in #926
  • Uplift third_party/tt_forge_models to e7ba96f22de7ced01b151393f4d4e1bc312fac23 2026-04-04 by @vvukomanTT in #930
  • Remove benchmark infrastructure from tt-forge by @chandrasekaranpradeep in #928
  • Upgrade checkout from v4 to v5, Add venv activate by @nsumrakTT in #929
  • Uplift third_party/tt_forge_models to 752832c839d59bb6c3fde43d7b9c03ebc820fa57 2026-04-11 by @vvukomanTT in #931
  • Add tt-xla llama demo by @ddilbazTT in #932
  • Add tiny llama demo by @ddilbazTT in #933
  • Uplift third_party/tt_forge_models to 534f4821827f4e92af6a1ad58e8c836c8ccd38fd 2026-04-18 by @vvukomanTT in #935
  • Stop tt-forge-onnx nightly releases by @vvukomanTT in #936
  • Uplift third_party/tt_forge_models to d11fe6671e44d286c2e8c6dcd9bd46a3b84f9ea8 2026-04-25 by @vvukomanTT in #937
  • Uplift third_party/tt_forge_models to cae9ccbc67a318736f656bee9a9ea776eb73e69c 2026-05-02 by @vvukomanTT in #940
  • Remove depreciated files by @nsumrakTT in #941
  • Add qwen3 demo by @abrown in #934
  • Add filter test matrix file by @vvukomanTT in #942
  • Refactor into call claude by @vmilosevic in #946
  • Update requirements to uplift PyTorch by @mmanzoorTT in #944
  • Uplift third_party/tt_forge_models to f224af305a10d38acb9fbd72c0c3514b26ec4544 2026-05-09 by @vvukomanTT in #980
  • Add ai model bringup workflow by @vmilosevic in #986
  • Ai bringup update by @vmilosevic in #987
  • Update ai-bringup workflow by @vmilosevic in #988
  • Update ai workflow docker image by @vmilosevic in #989
  • Update ai workflow by @vmilosevic in #990
  • Update ai workflow by @vmilosevic in #992
  • Uplift third_party/tt_forge_models to a64a98131c35b010895198f489355d0e6306934f 2026-05-16 by @vvukomanTT in #993
  • Run cpu bringup on builder by @vmilosevic in #994
  • Add master ai bringup by @vmilosevic in #1002
  • Fix push problem in ai workflows by @vmilosevic in #1003
  • Uplift third_party/tt_forge_models to 7201811e7020d0e35e908df47a9e57926ba0aa1c 2026-05-23 by @vvukomanTT in #1005
  • Uplift third_party/tt_forge_models to b845dfd5fefc65d5b7aff0515243b8c520758ba5 2026-05-30 by @vvukomanTT in #1006
  • Uplift third_party/tt_forge_models to 156da1137c7fde3afeea2801cf7e8f25f0d3000d 2026-06-03 by @vvukomanTT in #1009
  • Uplift third_party/tt_forge_models to 6b4b47a7c419cdc2713ddfc6e3179f61012c12f4 2026-06-06 by @vvukomanTT in #1011
  • Uplift third_party/tt_forge_models to 09239ae98eb4f0b03abe5240aca9418eb3131717 2026-06-13 by @vvukomanTT in #1013
  • Move ai bringup scripts to tt-forge-ai-bringup repo by @vmilosevic in #1004
  • Uplift third_party/tt_forge_models to 32d5c2e4a8cfd55b0f2ec99b3ec8d1b217fcb742 2026-06-20 by @vvukomanTT in #1014
  • Uplift third_party/tt_forge_models to 9af516c8daaa0634c10f00b1cd745042883845ea 2026-06-27 by @vvukomanTT in #1015

New Contributors

Full Changelog: 1.0.0...1.3.0


LLM Performance

Model Token/sec/user Batch Token/sec ttft (ms) Hardware
pytorch_DeepSeek-V3.1_deepseek_v3_1_modified_nlp_causal_lm_custom 3.0 64 192.0 4159.47 n150
pytorch_Falcon_3_1B_Base_nlp_causal_lm_huggingface 57.0 32 1824.0 663.53 n150
pytorch_Falcon_3_3B_Base_nlp_causal_lm_huggingface 38.0 32 1216.0 865.52 n150
pytorch_Falcon_3_7B_Base_nlp_causal_lm_huggingface 19.0 32 608.0 1186.81 n150
pytorch_GLM_4.7_nlp_causal_lm_huggingface 7.0 64 448.0 1590.81 n150
pytorch_Gemma_1.1_2B_IT_nlp_causal_lm_huggingface 40.0 32 1280.0 638.1 n150
pytorch_Kimi-K2.5_kimi_k2_5_modified_nlp_causal_lm_custom 3.0 64 192.0 4587.63 n150
pytorch_Kimi-K2_kimi_k2_instruct_modified_nlp_causal_lm_custom 3.0 64 192.0 4808.17 n150
pytorch_Llama_3.1_8B_Instruct_nlp_causal_lm_huggingface 23.0 32 736.0 2230.65 n150
pytorch_Llama_3.2_1B_Instruct_nlp_causal_lm_huggingface 67.0 32 2144.0 575.95 n150
pytorch_Llama_3.2_3B_Instruct_nlp_causal_lm_huggingface 31.0 32 992.0 605.5 n150
pytorch_Mistral_7B_INSTRUCT_v03_nlp_causal_lm_huggingface 20.0 32 640.0 1252.08 n150
pytorch_Mistral_Ministral_8B_Instruct_nlp_causal_lm_huggingface 12.0 32 384.0 550.28 n150
pytorch_Mistral_Small_24B_INSTRUCT_2501_nlp_causal_lm_huggingface 16.0 32 512.0 1790.87 n150
pytorch_Phi-1.5_Phi_1_5_nlp_causal_lm_huggingface 22.0 32 704.0 617.58 n150
pytorch_Phi-1_Phi_1_nlp_causal_lm_huggingface 22.0 32 704.0 632.51 n150
pytorch_Qwen 2.5_0.5B_Instruct_nlp_causal_lm_huggingface 78.0 32 2496.0 408.36 n150
pytorch_Qwen 2.5_1.5B_Instruct_nlp_causal_lm_huggingface 38.0 32 1216.0 461.04 n150
pytorch_Qwen 2.5_3B_Instruct_nlp_causal_lm_huggingface 31.0 32 992.0 697.85 n150
pytorch_Qwen 2.5_7B_Instruct_nlp_causal_lm_huggingface 16.0 32 512.0 859.7 n150
pytorch_Qwen 3_0_6B_nlp_causal_lm_huggingface 36.0 32 1152.0 1163.37 n150
pytorch_Qwen 3_1_7B_nlp_causal_lm_huggingface 30.0 32 960.0 746.03 n150
pytorch_Qwen 3_4B_nlp_causal_lm_huggingface 17.0 32 544.0 976.23 n150
pytorch_Qwen 3_8B_nlp_causal_lm_huggingface 13.0 32 416.0 1673.32 n150

Non-LLM Performance

Model Batch Sample/sec Hardware
pytorch_BERT_emrecan/bert-base-turkish-cased-mean-nli-stsb-tr_nlp_embed_gen_huggingface 8 44.0 n150
pytorch_BGE-M3_Base_nlp_embed_gen_custom 4 9.0 n150
pytorch_BGE-M3_Base_nlp_embed_gen_custom 4 19.0 p150
pytorch_EfficientNet_Timm_B0_cv_image_cls_timm 8 349.0 n150
pytorch_MNIST_Cnn_Dropout_cv_image_cls_custom 32 14643.0 n150
pytorch_MobileNetV2_Mobilenet_v2_cv_image_cls_torch_hub 12 1237.0 n150
pytorch_Qwen 3_Embedding_4B_nlp_embed_gen_huggingface 32 46.0 n150
pytorch_ResNet_ResNet50_HuggingFace_cv_image_cls_huggingface 8 1339.0 n150
pytorch_SegFormer_B0_Finetuned_Ade_512_512_cv_image_seg_huggingface 1 39.0 n150
pytorch_Swin_S_cv_image_cls_torchvision 1 10.0 n150
pytorch_U-Net for Conditional Generation_Base_conditional_generation_huggingface 1 5.0 n150
pytorch_Ultra-Fast Lane Detection v2_TuSimple_ResNet34_Backbone_cv_image_seg_github 1 136.0 n150
pytorch_VGG19-UNet_base_cv_image_seg_custom 1 147.0 n150
pytorch_ViT_Base_cv_image_cls_huggingface 8 229.0 n150
pytorch_VoVNet_Ese_Vovnet19b_Dw.ra_In1k_cv_image_cls_timm 8 667.0 n150

Model coverage

Info: Full list of supported models is available in the assets section.

Model task Model architecture Model variant Model framework Inference Training n150 n300 p150 Single device Data parallel Tensor parallel Model source
conditional generation U-Net for Conditional Generation Base pytorch View Source
cv image cls AlexNet Custom 1x2 jax View Source
cv image cls DINOv2 Small pytorch View Source
cv image cls EfficientNet B0 pytorch View Source
cv image cls MNIST Cnn Batchnorm jax View Source
cv image cls MNIST Cnn Dropout jax View Source
cv image cls MNIST Cnn Dropout pytorch View Source
cv image cls MNIST Cnn Nodropout pytorch View Source
cv image cls MNIST Mlp Custom jax View Source
cv image cls MNIST Mlp Custom jax View Source
cv image cls MNIST Mlp Custom 1x2 jax View Source
cv image cls MobileNetV1 Mobilenet v1 pytorch View Source
cv image cls MobileNetV2 Mobilenet v2 pytorch View Source
cv image cls ResNet ResNet50 HuggingFace High Resolution pytorch View Source
cv image cls SegFormer Mit B0 pytorch View Source
cv image cls Swin S pytorch View Source
cv image cls VGG HF Vgg19 pytorch View Source
cv image cls ViT Base pytorch View Source
cv image cls VoVNet Ese Vovnet19b Dw.ra In1k pytorch View Source
cv image seg Ultra-Fast Lane Detection TuSimple ResNet18 Backbone pytorch View Source
cv image seg VGG19-UNet base pytorch View Source
cv img to img Autoencoder linear pytorch View Source
cv object det Attention DenseUNet Base pytorch View Source
cv object det DETR ResNet50 Backbone pytorch View Source
cv object det OWL-ViT Base Patch32 pytorch View Source
cv object det PointPillars pointpillars pytorch View Source
cv object det YOLOP Default pytorch View Source
cv object det YOLOS Small Small pytorch View Source
cv object det YOLOv4 Base pytorch View Source
cv object det YOLOv7 Default pytorch View Source
cv object det YOLOv9 T pytorch View Source
cv object det ssd512 ssd512 pytorch View Source
mm action prediction OpenVLA-OFT Finetuned Libero 10 pytorch View Source
mm action prediction pi_0 pi0 base pytorch View Source
mm image text similarity CLIP Base Patch16 pytorch View Source
mm image text similarity SigLIP Base Patch16 224 pytorch View Source
mm visual qa Llama 3.2 11B Vision Instruct pytorch View Source
mm visual qa Mistral base pytorch View Source
nlp causal lm ALLaM 7B Instruct pytorch View Source
nlp causal lm Command_A_Reasoning command-a-reasoning-08-2025 pytorch View Source
nlp causal lm Falcon 3 10B Base pytorch View Source
nlp causal lm Falcon 3 1B Base pytorch View Source
nlp causal lm Falcon 3 3B Base pytorch View Source
nlp causal lm Falcon 3 7B Base pytorch View Source
nlp causal lm GPT-2 Base jax View Source
nlp causal lm GPT-2 Xl jax View Source
nlp causal lm GPT-OSS 20B pytorch View Source
nlp causal lm Gemma 1.1 2B IT pytorch View Source
nlp causal lm Gemma 1.1 7B IT pytorch View Source
nlp causal lm Gemma 2 27B IT pytorch View Source
nlp causal lm Gemma 2 2B IT pytorch View Source
nlp causal lm Gemma 2 9B IT pytorch View Source
nlp causal lm Llama 3.1 70B pytorch View Source
nlp causal lm Llama 3.1 8B Instruct pytorch View Source
nlp causal lm Llama 3.2 1B pytorch View Source
nlp causal lm Llama 3.2 3B pytorch View Source
nlp causal lm Llama 3.3 70B Instruct pytorch View Source
nlp causal lm Mistral 7B INSTRUCT v03 pytorch View Source
nlp causal lm Mistral Devstral Small 2505 pytorch View Source
nlp causal lm Mistral Large INSTRUCT 2411 pytorch View Source
nlp causal lm Mistral Magistral Small 2506 pytorch View Source
nlp causal lm Mistral Ministral 8B Instruct pytorch View Source
nlp causal lm Mistral Nemo INSTRUCT 2407 pytorch View Source
nlp causal lm Mistral Small 24B INSTRUCT 2501 pytorch View Source
nlp causal lm Phi-1 Phi 1 jax View Source
nlp causal lm Phi-1 Phi 1 pytorch View Source
nlp causal lm Phi-1 Phi 1 pytorch View Source
nlp causal lm Phi-1 LoRA Phi 1 pytorch View Source
nlp causal lm Phi-1.5 Phi 1 5 jax View Source
nlp causal lm Phi-1.5 Phi 1 5 pytorch View Source
nlp causal lm Phi-2 Phi 2 jax View Source
nlp causal lm Phi-2 Phi 2 pytorch View Source
nlp causal lm Phi-3 Mini 128K Instruct pytorch View Source
nlp causal lm Phi-3 Mini 4K Instruct pytorch View Source
nlp causal lm Phi-3 Mini Instruct pytorch View Source
nlp causal lm Phi-4 Phi 4 pytorch View Source
nlp causal lm Qwen 2 Qwq 32B pytorch View Source
nlp causal lm Qwen 2.5 0.5B jax View Source
nlp causal lm Qwen 2.5 0.5B Instruct jax View Source
nlp causal lm Qwen 2.5 0.5B Instruct pytorch View Source