-
Notifications
You must be signed in to change notification settings - Fork 363
Insights: pytorch/TensorRT
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
v2.6.1 Torch-TensorRT v2.6.1
published
Jun 3, 2025
30 Pull requests merged by 8 people
-
Add fp4 support
#3532 merged
Jun 13, 2025 -
add --no-deps for tests/py/requirements.txt
#3569 merged
Jun 12, 2025 -
fix: Fix constant folding failure due to modelopt
#3565 merged
Jun 11, 2025 -
Cross compile guard
#3486 merged
Jun 11, 2025 -
Upgrade to TensorRT 10.11
#3557 merged
Jun 8, 2025 -
fix int8/fp8 constant folding issue
#3543 merged
Jun 6, 2025 -
fix the jetson nightly build check bug
#3552 merged
Jun 6, 2025 -
Changed the debug setting
#3551 merged
Jun 6, 2025 -
fix: handle device in the same way as dtype in
aten.full_like
decomposition#3538 merged
Jun 5, 2025 -
Publish jetson wheel to pytorch nightly index
#3550 merged
Jun 4, 2025 -
feat: TensorRT AOT Plugin
#3504 merged
Jun 4, 2025 -
Add jetson build on CI
#3524 merged
Jun 3, 2025 -
cherry pick 3505(windows driver upgrade) to release2.6.1
#3547 merged
Jun 3, 2025 -
nvidia-modelopt dependency fix
#3544 merged
Jun 2, 2025 -
update version to 2.6.1
#3545 merged
Jun 2, 2025 -
fix the build issue for patch2.6.1
#3542 merged
May 30, 2025 -
remove breakpoint
#3540 merged
May 30, 2025 -
feat: Implement SDPA op converter / lowering pass as extensions
#3534 merged
May 28, 2025 -
fix: wrong dtype and device in
aten.full_like
decomposition#3535 merged
May 28, 2025 -
Minor fix - check for DTensor on igpu platform
#3531 merged
May 27, 2025 -
chore(deps): bump transformers from 4.48.0 to 4.50.0 in /examples/dynamo
#3497 merged
May 26, 2025 -
add feature gate for tensorrt plugin
#3518 merged
May 24, 2025 -
chore(deps): bump setuptools from 70.2.0 to 78.1.1 in /toolchains/jp_workspaces
#3523 merged
May 23, 2025 -
Simplify pass manager debug system
#3530 merged
May 23, 2025 -
Added CPU offloading
#3452 merged
May 20, 2025 -
fix docker build failure: add allow_empty to true
#3526 merged
May 20, 2025 -
chore(deps): bump undici from 5.28.5 to 5.29.0 in /.github/actions/assigner
#3520 merged
May 15, 2025 -
chore: update the docstring for llama2 rmsnorm automatic plugin example
#3512 merged
May 14, 2025 -
Add Linux CI build for aarch64
#3516 merged
May 13, 2025 -
Update mutable_torchtrt_module_example.py
#3519 merged
May 13, 2025
12 Pull requests opened by 9 people
-
feat: Refactor LLM model zoo and add KV cache support
#3527 opened
May 20, 2025 -
FX graph visualization
#3528 opened
May 21, 2025 -
AOT plugin: examples with RMSNORM
#3529 opened
May 21, 2025 -
chore(deps): bump transformers from 4.48.0 to 4.50.0 in /tests/modules
#3533 opened
May 26, 2025 -
flux fp4 example(WIP)
#3537 opened
May 28, 2025 -
feat: Hierarchical Partitioner to support multi-backends
#3539 opened
May 29, 2025 -
add a quick fail fast feed-back actions before the full tests is triggered
#3541 opened
May 30, 2025 -
fix: replace add_identity by add_cast for type cast
#3563 opened
Jun 9, 2025 -
feat: Saving modules using the AOTI format
#3567 opened
Jun 11, 2025 -
fix: Fix a perf regression due to weights being ITensors
#3568 opened
Jun 11, 2025 -
adding rotary embedding example, with graph rewrite for complex subgraph [WIP]
#3570 opened
Jun 13, 2025
9 Issues closed by 6 people
-
Constant folding crashes if model opt is not installed
#3564 closed
Jun 11, 2025 -
Update the lock file to reflect the torch nightly
#3553 closed
Jun 8, 2025 -
Please Promote torch_tensorrt 2.6.1 release artifacts
#3546 closed
Jun 3, 2025 -
✨[Feature] Add `pickle_protocol` argument for `torch_tensorrt.save`
#3294 closed
May 30, 2025 -
✨[Feature] Warn users who are building Torch-TensorRT with CUDA 12.6 that they need to use CXX11 ABI
#3339 closed
May 30, 2025 -
❓ [Question] Do you have any plan to release v2.6.1 ?
#3536 closed
May 29, 2025 -
Remove this breakpoint
#3386 closed
May 22, 2025 -
🐛 [Bug] cannot load quantize_fp8 even though the modelopt[all] installed
#3232 closed
May 20, 2025
7 Issues opened by 4 people
-
🐛 [Bug] Using `torch_executed_ops` causes the metadata packing to fail
#3566 opened
Jun 11, 2025 -
🐛 [Bug] Severe performance regression with 2.7 and 2.8.dev
#3561 opened
Jun 8, 2025 -
🐛 [Bug] AttributeError: '_OpNamespace' 'tensorrt' object has no attribute 'quantize_op'
#3560 opened
Jun 8, 2025 -
Is it possible to enable Github Merge Queues for our Repos?
#3559 opened
Jun 6, 2025 -
Remove the distributed extra
#3554 opened
Jun 6, 2025 -
❓ [Question] How to save the compiled while using torch.compile
#3525 opened
May 20, 2025 -
❓ [Question] Manually Annotate Quantization Parameters in FX Graph
#3522 opened
May 16, 2025
10 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Added flux demo
#3418 commented on
Jun 13, 2025 • 15 new comments -
slice scatter support for dynamic cases
#3513 commented on
May 16, 2025 • 1 new comment -
✨[Feature] Is there a plan to support to convert quantized PT2 to trt ?
#3471 commented on
May 26, 2025 • 0 new comments -
🐛 [Bug] Difficulties Quantizing FP16 Models to INT8 Using torch_tensorrt (MLP, CNN, Attention, LSTM, Transformer)
#3494 commented on
May 30, 2025 • 0 new comments -
❓ [Question] Is SAM2 supported when compiling with the Dynamo backend on JetPack 6.1 or 6.2?
#3478 commented on
Jun 3, 2025 • 0 new comments -
✨[Feature] Add FX tests to CI
#3492 commented on
Jun 6, 2025 • 0 new comments -
🐛 [Bug] AssertionError: end must be an integer
#3448 commented on
Jun 12, 2025 • 0 new comments -
TRT-LLM loading mechanism tool
#3398 commented on
Jun 11, 2025 • 0 new comments -
Add support for JetPack 6.2 build
#3453 commented on
May 15, 2025 • 0 new comments -
Debugging guide for TensorRT
#3489 commented on
Jun 9, 2025 • 0 new comments