Skip to content

trtexec segfault on AGX 64 only #1995

@joihn

Description

@joihn

I'm converting a ONNX model to .engine
trtexec --fp16 --onnx=/home/maxime/model.onnx --saveEngine=out.engine

This commands used to work well on nvidia xavier AGX 32gb with jetpack 4.6.
However, I recently upgraded to nvidia xavier AGX 64gb and I have the following segfault (tested both on jetpack 4.6 and 4.6.1)

trtexec  --fp16  --onnx=/home/maxime/fp16-weights-291-0.89.onnx --saveEngine=OUTI.engine
&&&& RUNNING TensorRT.trtexec [TensorRT v8201] # /usr/src/tensorrt/bin/trtexec --fp16 --onnx=/home/maxime/model.onnx --saveEngine=OUTI.engine
[05/20/2022-15:18:47] [I] === Model Options ===
[05/20/2022-15:18:47] [I] Format: ONNX
[05/20/2022-15:18:47] [I] Model: /home/maxime/fp16-weights-291-0.89.onnx
[05/20/2022-15:18:47] [I] Output:
[05/20/2022-15:18:47] [I] === Build Options ===
[05/20/2022-15:18:47] [I] Max batch: explicit batch
[05/20/2022-15:18:47] [I] Workspace: 16 MiB
[05/20/2022-15:18:47] [I] minTiming: 1
[05/20/2022-15:18:47] [I] avgTiming: 8
[05/20/2022-15:18:47] [I] Precision: FP32+FP16
[05/20/2022-15:18:47] [I] Calibration: 
[05/20/2022-15:18:47] [I] Refit: Disabled
[05/20/2022-15:18:47] [I] Sparsity: Disabled
[05/20/2022-15:18:47] [I] Safe mode: Disabled
[05/20/2022-15:18:47] [I] DirectIO mode: Disabled
[05/20/2022-15:18:47] [I] Restricted mode: Disabled
[05/20/2022-15:18:47] [I] Save engine: OUTI.engine
[05/20/2022-15:18:47] [I] Load engine: 
[05/20/2022-15:18:47] [I] Profiling verbosity: 0
[05/20/2022-15:18:47] [I] Tactic sources: Using default tactic sources
[05/20/2022-15:18:47] [I] timingCacheMode: local
[05/20/2022-15:18:47] [I] timingCacheFile: 
[05/20/2022-15:18:47] [I] Input(s)s format: fp32:CHW
[05/20/2022-15:18:47] [I] Output(s)s format: fp32:CHW
[05/20/2022-15:18:47] [I] Input build shapes: model
[05/20/2022-15:18:47] [I] Input calibration shapes: model
[05/20/2022-15:18:47] [I] === System Options ===
[05/20/2022-15:18:47] [I] Device: 0
[05/20/2022-15:18:47] [I] DLACore: 
[05/20/2022-15:18:47] [I] Plugins:
[05/20/2022-15:18:47] [I] === Inference Options ===
[05/20/2022-15:18:47] [I] Batch: Explicit
[05/20/2022-15:18:47] [I] Input inference shapes: model
[05/20/2022-15:18:47] [I] Iterations: 10
[05/20/2022-15:18:47] [I] Duration: 3s (+ 200ms warm up)
[05/20/2022-15:18:47] [I] Sleep time: 0ms
[05/20/2022-15:18:47] [I] Idle time: 0ms
[05/20/2022-15:18:47] [I] Streams: 1
[05/20/2022-15:18:47] [I] ExposeDMA: Disabled
[05/20/2022-15:18:47] [I] Data transfers: Enabled
[05/20/2022-15:18:47] [I] Spin-wait: Disabled
[05/20/2022-15:18:47] [I] Multithreading: Disabled
[05/20/2022-15:18:47] [I] CUDA Graph: Disabled
[05/20/2022-15:18:47] [I] Separate profiling: Disabled
[05/20/2022-15:18:47] [I] Time Deserialize: Disabled
[05/20/2022-15:18:47] [I] Time Refit: Disabled
[05/20/2022-15:18:47] [I] Skip inference: Disabled
[05/20/2022-15:18:47] [I] Inputs:
[05/20/2022-15:18:47] [I] === Reporting Options ===
[05/20/2022-15:18:47] [I] Verbose: Disabled
[05/20/2022-15:18:47] [I] Averages: 10 inferences
[05/20/2022-15:18:47] [I] Percentile: 99
[05/20/2022-15:18:47] [I] Dump refittable layers:Disabled
[05/20/2022-15:18:47] [I] Dump output: Disabled
[05/20/2022-15:18:47] [I] Profile: Disabled
[05/20/2022-15:18:47] [I] Export timing to JSON file: 
[05/20/2022-15:18:47] [I] Export output to JSON file: 
[05/20/2022-15:18:47] [I] Export profile to JSON file: 
[05/20/2022-15:18:47] [I] 
Segmentation fault (core dumped)

Environment

TensorRT Version: TensorRT v8201

CUDA Version: cuda_10.2
Operating System: jetpack 4.6.1

Hardware

NVIDIA GPU: xavier AGX64 with a CTI carrier board (ref AGX111)

Metadata

Metadata

Assignees

Labels

triagedIssue has been triaged by maintainers

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions