I am using ONNX Runtime to generate a TensorRt engine from ONNX model file.
I am using trt_options.trt_engine_cache_enable = 1; and trt_options.trt_engine_cache_path = "./path"; to generate the engine file once and to load it on later runs, which works well (speeds up strongly).
However I'm not sure about what exactly that engines file is.
Does it include models weights?
Can it be used to load the full model, without having access to the .onnx model file?
My question arises because we are encrypting our model files and load them from RAM during runtime, so that other people with access to the system don't have access to our models. If the cache is basically a full access to the model, we will need a different solution.
I am using ONNX Runtime to generate a TensorRt engine from ONNX model file.
I am using trt_options.trt_engine_cache_enable = 1; and trt_options.trt_engine_cache_path = "./path"; to generate the engine file once and to load it on later runs, which works well (speeds up strongly).
However I'm not sure about what exactly that engines file is.
Does it include models weights?
Can it be used to load the full model, without having access to the .onnx model file?
My question arises because we are encrypting our model files and load them from RAM during runtime, so that other people with access to the system don't have access to our models. If the cache is basically a full access to the model, we will need a different solution.