Not able to see performance gain for the Optimized model? #5710

mwangms · 2023-10-27T23:31:42Z

mwangms
Oct 27, 2023

After export a PyTorch model, run optimizer to get an optimized model (), then run it on RTX8000, was unable to see if any performance gain.

The optimizing code is as below:
optimizer = ORTOptimizer.from_pretrained(model)
optimization_config = AutoOptimizationConfig.O4(disable_shape_inference=True)
optimizer.optimize(save_dir=onnx_optimized_output_path, optimization_config=optimization_config)

The code to run inference:
ort_session = ort.InferenceSession(onnx_optimized_model_path, providers='CUDAExecutionProvider')
outputs = self._inference_session.run(["logits"], {"input_ids": input_ids, "attention_mask": attention_mask})

Anyone can share some insights about what's going wrong? Is that specific GPU card required for this optimization? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not able to see performance gain for the Optimized model? #5710

{{title}}

Replies: 0 comments

Select a reply

Not able to see performance gain for the Optimized model? #5710

mwangms Oct 27, 2023

Replies: 0 comments

mwangms
Oct 27, 2023