Pulse · pytorch/TensorRT · GitHub

May 13, 2025 – June 13, 2025

Overview

42 Active pull requests

16 Active issues

Could not load contribution data

Please try again later

1 Release published by 1 person

v2.6.1 Torch-TensorRT v2.6.1
published Jun 3, 2025

30 Pull requests merged by 8 people

Add fp4 support
#3532 merged Jun 13, 2025
add --no-deps for tests/py/requirements.txt
#3569 merged Jun 12, 2025
fix: Fix constant folding failure due to modelopt
#3565 merged Jun 11, 2025
Cross compile guard
#3486 merged Jun 11, 2025
Upgrade to TensorRT 10.11
#3557 merged Jun 8, 2025
fix int8/fp8 constant folding issue
#3543 merged Jun 6, 2025
fix the jetson nightly build check bug
#3552 merged Jun 6, 2025
Changed the debug setting
#3551 merged Jun 6, 2025
fix: handle device in the same way as dtype in aten.full_like decomposition
#3538 merged Jun 5, 2025
Publish jetson wheel to pytorch nightly index
#3550 merged Jun 4, 2025
feat: TensorRT AOT Plugin
#3504 merged Jun 4, 2025
Add jetson build on CI
#3524 merged Jun 3, 2025
cherry pick 3505(windows driver upgrade) to release2.6.1
#3547 merged Jun 3, 2025
nvidia-modelopt dependency fix
#3544 merged Jun 2, 2025
update version to 2.6.1
#3545 merged Jun 2, 2025
fix the build issue for patch2.6.1
#3542 merged May 30, 2025
remove breakpoint
#3540 merged May 30, 2025
feat: Implement SDPA op converter / lowering pass as extensions
#3534 merged May 28, 2025
fix: wrong dtype and device in aten.full_like decomposition
#3535 merged May 28, 2025
Minor fix - check for DTensor on igpu platform
#3531 merged May 27, 2025
chore(deps): bump transformers from 4.48.0 to 4.50.0 in /examples/dynamo
#3497 merged May 26, 2025
add feature gate for tensorrt plugin
#3518 merged May 24, 2025
chore(deps): bump setuptools from 70.2.0 to 78.1.1 in /toolchains/jp_workspaces
#3523 merged May 23, 2025
Simplify pass manager debug system
#3530 merged May 23, 2025
Added CPU offloading
#3452 merged May 20, 2025
fix docker build failure: add allow_empty to true
#3526 merged May 20, 2025
chore(deps): bump undici from 5.28.5 to 5.29.0 in /.github/actions/assigner
#3520 merged May 15, 2025
chore: update the docstring for llama2 rmsnorm automatic plugin example
#3512 merged May 14, 2025
Add Linux CI build for aarch64
#3516 merged May 13, 2025
Update mutable_torchtrt_module_example.py
#3519 merged May 13, 2025

12 Pull requests opened by 9 people

feat: Refactor LLM model zoo and add KV cache support
#3527 opened May 20, 2025
FX graph visualization
#3528 opened May 21, 2025
AOT plugin: examples with RMSNORM
#3529 opened May 21, 2025
chore(deps): bump transformers from 4.48.0 to 4.50.0 in /tests/modules
#3533 opened May 26, 2025
flux fp4 example(WIP)
#3537 opened May 28, 2025
feat: Hierarchical Partitioner to support multi-backends
#3539 opened May 29, 2025
add a quick fail fast feed-back actions before the full tests is triggered
#3541 opened May 30, 2025
Test only(Do not review): one main workflow, if fast feed back loop is success then trigger all other workflows
#3558 opened Jun 6, 2025
fix: replace add_identity by add_cast for type cast
#3563 opened Jun 9, 2025
feat: Saving modules using the AOTI format
#3567 opened Jun 11, 2025
fix: Fix a perf regression due to weights being ITensors
#3568 opened Jun 11, 2025
adding rotary embedding example, with graph rewrite for complex subgraph [WIP]
#3570 opened Jun 13, 2025

9 Issues closed by 6 people

Constant folding crashes if model opt is not installed
#3564 closed Jun 11, 2025
Update the lock file to reflect the torch nightly
#3553 closed Jun 8, 2025
🐛 [Bug] RuntimeError: Unhandled FakeTensor Device Propagation for torchvision.deform_conv2d.default, found two different devices cuda:0, cpu
#3556 closed Jun 7, 2025
Please Promote torch_tensorrt 2.6.1 release artifacts
#3546 closed Jun 3, 2025
✨[Feature] Add `pickle_protocol` argument for `torch_tensorrt.save`
#3294 closed May 30, 2025
✨[Feature] Warn users who are building Torch-TensorRT with CUDA 12.6 that they need to use CXX11 ABI
#3339 closed May 30, 2025
❓ [Question] Do you have any plan to release v2.6.1 ?
#3536 closed May 29, 2025
Remove this breakpoint
#3386 closed May 22, 2025
🐛 [Bug] cannot load quantize_fp8 even though the modelopt[all] installed
#3232 closed May 20, 2025

7 Issues opened by 4 people

🐛 [Bug] Using `torch_executed_ops` causes the metadata packing to fail
#3566 opened Jun 11, 2025
🐛 [Bug] Severe performance regression with 2.7 and 2.8.dev
#3561 opened Jun 8, 2025
🐛 [Bug] AttributeError: '_OpNamespace' 'tensorrt' object has no attribute 'quantize_op'
#3560 opened Jun 8, 2025
Is it possible to enable Github Merge Queues for our Repos?
#3559 opened Jun 6, 2025
Remove the distributed extra
#3554 opened Jun 6, 2025
❓ [Question] How to save the compiled while using torch.compile
#3525 opened May 20, 2025
❓ [Question] Manually Annotate Quantization Parameters in FX Graph
#3522 opened May 16, 2025

10 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Added flux demo
#3418 commented on Jun 13, 2025 • 15 new comments
slice scatter support for dynamic cases
#3513 commented on May 16, 2025 • 1 new comment
✨[Feature] Is there a plan to support to convert quantized PT2 to trt ?
#3471 commented on May 26, 2025 • 0 new comments
🐛 [Bug] Difficulties Quantizing FP16 Models to INT8 Using torch_tensorrt (MLP, CNN, Attention, LSTM, Transformer)
#3494 commented on May 30, 2025 • 0 new comments
❓ [Question] Is SAM2 supported when compiling with the Dynamo backend on JetPack 6.1 or 6.2?
#3478 commented on Jun 3, 2025 • 0 new comments
✨[Feature] Add FX tests to CI
#3492 commented on Jun 6, 2025 • 0 new comments
🐛 [Bug] AssertionError: end must be an integer
#3448 commented on Jun 12, 2025 • 0 new comments
TRT-LLM loading mechanism tool
#3398 commented on Jun 11, 2025 • 0 new comments
Add support for JetPack 6.2 build
#3453 commented on May 15, 2025 • 0 new comments
Debugging guide for TensorRT
#3489 commented on Jun 9, 2025 • 0 new comments