Converting model using convert_model API #22775

tomas-pet · 2024-02-12T04:23:47Z

tomas-pet
Feb 12, 2024

Can someone please explain the flow of taking a pytorch/TF/onnx model through the convert_model API? What happens behind the scenes? I would like to understand if there is any memory issues with storing the converted model in memory.

popovaan · 2024-02-12T09:20:08Z

popovaan
Feb 12, 2024
Collaborator

convert_model() uses python bindings of pytorch/TF/onnx c++ frontends, the rest is packing of parameters and applying some preprocessing to intermediate InputModel format, for example shapes setting.
First we determine which frontend to use, for most cases load_by_model() method is used, which determines the correct frontend by the given model.
Then we run the load() method from the frontend which returns InputModel object. Preprocessing is applied if needed, then the InputModel is converted to ov.Model with convert() method from frontend.
Finally convert_model() sets metadata to ov.Model and returns the model.

0 replies

slyalin · 2024-02-12T15:23:33Z

slyalin
Feb 12, 2024
Collaborator

It depends on source model representation. For example, having PyTorch model, we do torch.jit.trace under the hood with your example_input provided as convert_model parameter. After model is traced we go over the trace and convert each operation from torch representation to OpenVINO operations (one or more for each source operation). Following this path we are trying to keep big weight tensors in their original memory without copying them. So when you convert a PyTorch model and have ov.Model object part of the weights are shared between original PyTorch model and ov.Model. It is quite handy for LLMs for example. When you save or compile_model, weights will be copied.

When we are converting model from a file, for example in TF and ONNX, depending on which particular type of model file is used, we can avoid loading the original model in memory completely. For example, in ONNX model where weights are stored in set of separate files, we are not loading them completely in memory during the conversion. The same is true for TF saved_model representation. But when you compile the model with compile_model, all the weights will be required in memory.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converting model using convert_model API #22775

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Converting model using convert_model API #22775

tomas-pet Feb 12, 2024

Replies: 2 comments

popovaan Feb 12, 2024 Collaborator

slyalin Feb 12, 2024 Collaborator

tomas-pet
Feb 12, 2024

popovaan
Feb 12, 2024
Collaborator

slyalin
Feb 12, 2024
Collaborator