Export to ONNX #146

ml5ah · 2022-08-04T17:41:33Z

Is your feature request related to a problem? Please describe.
A script to convert the Address Parser (.ckpt) model to ONNX (.onnx)?

Describe the solution you'd like
Has someone successfully converted the address parser model to onnx format?

github-actions · 2022-08-04T17:42:16Z

Thank you for you interest in improving Deepparse.

davebulaval · 2022-08-04T18:02:29Z

Hi @ml5ah,

I only used ONNX once, and it was not a successful experience (it was my initial idea to handle Deepparse weights).

If I recall right, the bottleneck was that you had to set the batch size to a specific size; thus, it is cumbersome to find an appropriate one. However, it was back in 2019 or so, and things might have evolved since. I will take a look at it.

So the best case would be some export method for an AddressParser to export itself into an ONNX format, right?

davebulaval · 2022-08-04T18:21:16Z

I've looked at PyTorch doc, and it still seems like you need to provide a batch-a-like dataset for the export.

If you come up with a method that can export the AddressParser.model (model) attribute into an ONNX format, I will be more than happy to merge it into a PR. Otherwise, I don't find ONNX helpful and will only provide a new save_address_parser_weights method to save the model's weights in the next release.

ml5ah · 2022-08-04T18:21:39Z

Thanks for the reply, @davebulaval!

Yes, that's correct - would be the best way. I have been trying to work with the inbuilt export function in torch but keep running into some issues. The export call works but having trouble initializing an inference session in onnxruntime.

Fixing the batch_size to be 1 should be good as well, for starters!

Do share any insights/suggestions. Thanks!

FYI - my error:

ml5ah · 2022-08-04T18:23:17Z

@davebulaval saw your updated reply - got it, that makes sense. Sure, I'll keep you posted if I have any success.

davebulaval · 2022-08-04T18:24:23Z

It seems like a float typing error (it converts some into a float and others into a long float). LSTM parameters are LongTensor, and it may be there the problem.

davebulaval · 2022-08-04T18:25:35Z

@ml5ah this post might be useful https://stackoverflow.com/questions/57299674/trouble-converting-lstm-pytorch-model-to-onnx.

davebulaval · 2022-08-04T20:00:11Z

@ml5ah I've just added the save_model_weights method to the AddressParser class into dev. It saves the PyTorch state dictionary into a pickle format.

If you need to use the model in another ML framework or code base (e.g. Java, etc.), you can 'simply' load the weights matrix. Usually, it is convenient, but you might need some naming/format conversion.

ml5ah · 2022-08-08T17:10:44Z

Thanks @davebulaval! That function helped and I was able to move forward, albeit faced some more roadblocks along the way.

I faced 2 problems:

The size of input tensor is variable based on the number of "words" in the input address text. This impacts the decomposition lengths input as well. Solved this problem temporarily to unblock but not sure if ONNX can handle it.
While exporting, there is an operator "resolve_conj" that is used. Looks like this is not currently supported in any opset version. Documented here: ONNX export fails with a resolve_conj op when a tensor slice is printed. pytorch/pytorch#73619
Might have to wait a bit for this to be supported.

davebulaval · 2022-08-08T17:54:37Z

@ml5ah I see. Yeah, it seems like it is not for now.

And what exactly is your objective? In which language are you trying to import?

ml5ah · 2022-08-08T18:07:55Z

@davebulaval objective is to deploy the pre-trained address parser model for inference using onnxruntime (in either python or java).
To do this, I've been trying to convert the model to onnx using python.

davebulaval · 2022-08-08T18:36:08Z

Ok, I got it. Do you want the address parser as an API-like service?

ml5ah · 2022-08-09T04:00:37Z

Yep, exactly. With the constraint that inference is using onnxruntine with no dependency on PyTorch.

davebulaval · 2022-08-09T14:17:09Z

Keep us updated on your progress. I would love to have 1) the script for ONNX conversion and 2) the script to bundle it into an API. It would be a great doc improvement to have that.

github-actions · 2022-10-09T00:13:36Z

This issue is stale because it has been open 60 days with no activity.
Stale issues will automatically be closed 30 days after being marked Stale
.

kleineroscar · 2023-04-26T22:33:28Z

@ml5ah hey, im curious if you managed to create an onnx export? If yes it would be great if you could share your insights.

davebulaval · 2023-04-27T10:43:47Z

The last time I checked, my bottleneck with ONNX was that batch size needed to be fixed beforehand.

ml5ah · 2023-05-23T17:10:29Z

@kleineroscar @davebulaval apologies for the late reply. Its been while since I actively looked at this issue. ONNX does provide support for dynamic batch size using the dynamic-axes feature. But, as far as I remember, it did not work out of the box for deepparse till sometime back.

I will give it another shot - hopefully things have changed.

ml5ah · 2023-05-24T23:08:25Z

Took another look, this doesn't seem to be a trivial problem.
pytorch/pytorch#28423 is another issue that'll need to be solved before the parser model can be exported to onnx. I also tried dealing with the embedding, encoder and decoder models separately -- non-trivial.

Please share your thoughts.

cc: @kleineroscar @davebulaval

davebulaval · 2023-05-25T14:28:36Z

The embedding conversion is done outside of the model. Thus, the LSTM expects to receive a 300d vector. I think for simplicity, we could fix a batch size, but it would require doing example padding if the number of addresses to parse is smaller than the batch size.
I am not a fan of ONNX. I found it to be too rigid.

I've added functionality to allow someone to put the weights in an S3 bucket and process data in cloud service. I could be a workaround to create an API in Python.

I have a friend working on Burn a Rust Torch-like framework, but LSTM is not yet implemented. I would prioritize using a Rust implementation of Deepparse rather than working on/with ONNX.

ml5ah added the enhancement New feature or request label Aug 4, 2022

ml5ah changed the title ~~[FEATURE]~~ Export to ONNX Aug 4, 2022

davebulaval self-assigned this Aug 4, 2022

davebulaval mentioned this issue Aug 4, 2022

add save_model_eights method #147

Merged

github-actions bot added the stale label Oct 9, 2022

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export to ONNX #146

Export to ONNX #146

ml5ah commented Aug 4, 2022

github-actions bot commented Aug 4, 2022

davebulaval commented Aug 4, 2022 •

edited

davebulaval commented Aug 4, 2022

ml5ah commented Aug 4, 2022

ml5ah commented Aug 4, 2022

davebulaval commented Aug 4, 2022

davebulaval commented Aug 4, 2022 •

edited

davebulaval commented Aug 4, 2022

ml5ah commented Aug 8, 2022 •

edited

davebulaval commented Aug 8, 2022

ml5ah commented Aug 8, 2022

davebulaval commented Aug 8, 2022

ml5ah commented Aug 9, 2022

davebulaval commented Aug 9, 2022

github-actions bot commented Oct 9, 2022

kleineroscar commented Apr 26, 2023 •

edited

davebulaval commented Apr 27, 2023

ml5ah commented May 23, 2023

ml5ah commented May 24, 2023

davebulaval commented May 25, 2023

Export to ONNX #146

Export to ONNX #146

Comments

ml5ah commented Aug 4, 2022

github-actions bot commented Aug 4, 2022

davebulaval commented Aug 4, 2022 • edited

davebulaval commented Aug 4, 2022

ml5ah commented Aug 4, 2022

ml5ah commented Aug 4, 2022

davebulaval commented Aug 4, 2022

davebulaval commented Aug 4, 2022 • edited

davebulaval commented Aug 4, 2022

ml5ah commented Aug 8, 2022 • edited

davebulaval commented Aug 8, 2022

ml5ah commented Aug 8, 2022

davebulaval commented Aug 8, 2022

ml5ah commented Aug 9, 2022

davebulaval commented Aug 9, 2022

github-actions bot commented Oct 9, 2022

kleineroscar commented Apr 26, 2023 • edited

davebulaval commented Apr 27, 2023

ml5ah commented May 23, 2023

ml5ah commented May 24, 2023

davebulaval commented May 25, 2023

davebulaval commented Aug 4, 2022 •

edited

davebulaval commented Aug 4, 2022 •

edited

ml5ah commented Aug 8, 2022 •

edited

kleineroscar commented Apr 26, 2023 •

edited