Abnormal fp16 inference results of TensorRT v10.0 when running engine converted from onnx of NAFNet

## Description
I tried to convert onnx to trt in FP16, the infering results are abnormal compared with FP32.

fp16:
![NikonD40_0096_T_A3](https://github.com/NVIDIA/TensorRT/assets/99655603/388f1aed-520e-4622-8d15-08c24b4ff190)

fp32:
![NikonD40_0096_T_A3](https://github.com/NVIDIA/TensorRT/assets/99655603/3f7094d0-f919-44e5-97d9-de8b2bcac30c)


## Environment
![image](https://github.com/NVIDIA/TensorRT/assets/99655603/4432fea0-847b-4bf5-ac00-c20c6d617f15)



**TensorRT Version**:10.0

**NVIDIA GPU**:

**NVIDIA Driver Version**:

**CUDA Version**:

**CUDNN Version**:


Operating System:

Python Version (if applicable):

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):


## Relevant Files



**Model link**:
[onnx.zip](https://github.com/NVIDIA/TensorRT/files/15432152/onnx.zip)

[trt10_fp16.zip](https://github.com/NVIDIA/TensorRT/files/15432207/trt10_fp16.zip)
[trt10_fp32.zip](https://github.com/NVIDIA/TensorRT/files/15432209/trt10_fp32.zip)
## Steps To Reproduce
./trtexec --onnx=color_consistency_nafnet.onnx --saveEngine=nafnetcc75_t4_float16_v10.trtmodel --inputIOFormats=fp32:chw --outputIOFormats=fp32:chw --device=3 --minShapes=input:1x64x64x3 --optShapes=input:1x1024x1024x3 --maxShapes=input:1x1920x1920x3 --fp16



**Commands or scripts**:

**Have you tried [the latest release](https://developer.nvidia.com/tensorrt)?**:

**Can this model run on other frameworks?** For example run ONNX model with ONNXRuntime (`polygraphy run <model.onnx> --onnxrt`):

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abnormal fp16 inference results of TensorRT v10.0 when running engine converted from onnx of NAFNet #3897

Description

Environment

Relevant Files

Steps To Reproduce

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Abnormal fp16 inference results of TensorRT v10.0 when running engine converted from onnx of NAFNet #3897

Description

Description

Environment

Relevant Files

Steps To Reproduce

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions