mlir-tensorrt-v0.4.4.dev202512190
Pre-release
Pre-release
·
160 commits
to main
since this release
Major Features
- Open-source the kernel generation backend
- Open-source the PJRT (XLA/JAX) integration
- Various optimizations targeting StableHLO while loops and raising to for-style loops
- Improve C++ host code generation pipeline; fill in some missing EmitC converters
Generated Release Notes
- Update CI workflows and devcontainer configs for CUDA 12.9 and Ubuntu 24.04 by @christopherbate in #723
- migrate internal changes by @christopherbate in #722
- migrate internal changes by @christopherbate in #724
- [CI] Adding Test Pipeline for PRs by @lanluo-nvidia in #725
- [mlir-tensorrt] Migrate internal change by @christopherbate in #726
- Add conversion of JAX shape_assertion custom calls to cf.assert by @christopherbate in #728
- Add Math dialect to EmitC conversion support by @christopherbate in #729
- Refactor: Move StableHLO matchers to dialect utils and reorganize namespaces by @christopherbate in #730
- Fix zero-rank tensor handling in shape functions by @christopherbate in #731
- [compiler] Fix missing test file by @christopherbate in #732
- [compiler] Add
stablehlo.get_dimension_sizeconversion instablehlo-to-linalgby @christopherbate in #733 - [compiler] Add
cf.assertsupport toconvert-host-to-emitcby @christopherbate in #734 - NFC: make minor updates to pass naming by @christopherbate in #735
- [tensorrt] Make new transpose/reshape elimination pass as v2 pass. by @christopherbate in #736
- [compiler] Add
mtrt-scf-strength-reductionpass by @christopherbate in #737 - integrate internal changes by @christopherbate in #738
- integrate internal changes by @christopherbate in #739
- [mlir-tensorrt] Integrate Kernel backend by @christopherbate in #740
- [mlir-tensorrt] Migrate PJRT integration by @christopherbate in #741
- [CMake] Fix TRT CUDA version selection and add tests by @christopherbate in #745
- [CI] Add release wheel pipeline by @lanluo-nvidia in #742
- [mlir-tensorrt] NFC: Integrate internal changes by @christopherbate in #746
- [compiler] Add
mtrt-scf-float-strength-reducepass to preprocessing pipeline by @christopherbate in #747 - NFC: Fix formatting in MLIR-TRT README.md by @christopherbate in #749
- [integrations/PJRT] Implement executable metadata queries by @christopherbate in #748
- [integrations/PJRT] Implement additional PJRT methods for executable serialization/deserialization by @christopherbate in #750
- NFC: Update MLIR-TRT build instructions for PJRT integration wheels by @christopherbate in #751
- Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in #743
- integrate internal changes by @christopherbate in #752
- [mlir-tensorrt] Update LLVM patch to add support for PTX ISA v9.0 by @christopherbate in #753
Full Changelog: mlir-tensorrt-v0.4.2dev20251112...mlir-tensorrt-v0.4.4.dev202512190