mlir-tensorrt-v0.4.5
·
117 commits
to main
since this release
What's Changed
- Integrate LLVM@d6e2143b064e by @christopherbate in #754
- [kernel] Add vector.from_elements unroll patterns to LowerToNVVM by @christopherbate in #755
- [compiler] Improve consistency of how 'target' attribute on grouping ops is handled by @christopherbate in #756
- [compiler] Move StableHLO partitioning attribute handling to StablehloToPlan pass by @christopherbate in #757
- [Plan] Rename cluster ops to shorter mnemonics by @christopherbate in #758
- [StablehloExt] Refactor simplification patterns into separate files by @christopherbate in #761
- integrate internal changes by @christopherbate in #762
- [CI] upgrade release base image from rockylinux8 to rocklinux9 by @lanluo-nvidia in #763
- [executor] Add SROA support by @christopherbate in #765
- [mlir-tensorrt] Integrate internal changes by @christopherbate in #768
- NFC: Add instructions for using custom llvm-project by @christopherbate in #767
- integrate internal changes by @christopherbate in #770
- [executor] Add
executor.ctpopoperation by @christopherbate in #769 - [executor] Handle type mismatches in getoffset lowering by @christopherbate in #771
- [CI] added ubuntu 22.04 container by @lanluo-nvidia in #764
- [compiler] NFC: consolidate Utils libraries from compiler to common by @christopherbate in #772
- [compiler] Add
phase-startandphase-endoptions to main pipeline by @christopherbate in #773 - [integrations/PJRT] Improve symbol visibility control and fix error message by @christopherbate in #774
- [CI] a few CI changes by @lanluo-nvidia in #760
- [executor] Introduce a pass to expand unsupproted Math operations by @christopherbate in #777
- [mlir-tensorrt] Add Stablehlo patch for various Stablehlo upstream pass issues by @christopherbate in #776
- [compiler][emitc] Add support for embedding and emitting runtime files by @christopherbate in #775
- [compiler] NFC: Add missing
memref-to-cudatest cases by @christopherbate in #779 - [integrations/PJRT] Fix CMake configuration for PJRT library by @christopherbate in #780
- NFC: Consolidate CUDA integration tests and simplify test commands by @christopherbate in #781
- integrate internal changes by @christopherbate in #782
- NFC: Optimize includes in header files to limit header size by @christopherbate in #783
- [CI] Add manylinux auditwheel repair by @lanluo-nvidia in #778
- integrate internal changes by @christopherbate in #785
Full Changelog: mlir-tensorrt-v0.4.4.dev202512190...mlir-tensorrt-v0.4.5