Release Triton Model Navigator v0.7.0 · triton-inference-server/model_navigator

new: Inplace Optimize feature - optimize models directly in the Python code
new: Non-tensor inputs and outputs support
new: Model warmup support in Triton model configuration
new: nav.tensorrt.optimize api added for testing and measuring performance of TensorRT models
new: Extended custom configs to pass arguments directly to export and conversion operations like torch.onnx.export or polygraphy convert
new: Collect GPU clock during model profiling
new: Add option to configure minimal trials and stabilization windows for performance verification and profiling
change: Navigator package version change to 0.2.3. Custom configurations now use trt_profiles list instead single value
change: Store separate reproduction scripts for runners used during correctness and profiling
Version of external components used during testing:
- PyTorch 2.1.0a0+b5021ba
- TensorFlow 2.12.0
- TensorRT 8.6.1
- ONNX Runtime 1.15.1
- Polygraphy: 0.47.1
- GraphSurgeon: 0.3.27
- tf2onnx v1.14.0
- Other component versions depend on the used framework containers versions.
  See its support matrix
  for a detailed summary.

Provide feedback