Support exporting and loading ONNX models #556

sammlapp · 2022-09-20T21:53:59Z

We should be able to export a model to ONNX so that someone could run predictions using the model without opensoundscape.
Second, we should be able to load an ONNX model and generate predictions. If this is best done with a simple torch script rather than by implementing something in opensoundscape, that's fine - we can just add documentation of how to do this
Third, we should be able to load and ONNX model into opensoundscape such that we could re-train (eg, warm-starting) within opensoundscape

It may be necessary or at least logical to use torchaudio to incorporate preprocessing steps into the model, as mentioned in #337

Be aware of the numpy & built-in types caveats for the torch.onnx module

sammlapp · 2022-09-20T22:14:32Z

Because ONNX will require a numeric vector input, the input will logically be either (a) audio samples or (b) a pre-processed 2d representation of the audio such as a spectrogram, potentially with multiple channels. The advantage of passing the audio sample vector (wav) is that all preprocessing parameters will be included in the model; the only thing the user has to get right is the audio sampling rate. Option (b) gives more flexibility because pre-processing does not need to be packaged into the model, but allows more opportunity for pre-processing operations and parameters to be lost or mis-implemented when the model changes hands.

sammlapp · 2022-09-20T22:25:04Z

Pytorch support for stft with ONNX is still a work in progress

sammlapp · 2023-06-29T13:40:22Z

Apparently "torch.onnx.dynamo_export" will add some onnx operators. Also, we could apparently do some custom handling to use implemented onnx functions but I don't fully understand how (see https://github.com/Alexey-Kamenev/tensorrt-dft-plugins/blob/main/tests/test_dft.py#L35 and pytorch/pytorch#81075 (comment))

Modulus has done something similar https://github.com/NVIDIA/modulus/blob/main/modulus/models/afno/afno.py#L140

If/when we implement something like this, we will need all preprocessing steps (for inference in the exported ONNX model) to be part of the pytorch model, ie layers with forward methods. This raises the question of whether we will end up entirely changing from the use of librosa and scipy to directly using the torchaudio API.

sammlapp · 2023-11-01T21:28:40Z

apparently now should work!

sammlapp · 2023-11-06T21:55:47Z

there's a new torch issue to follow for fft export to ONNX: pytorch/pytorch#113067

sammlapp · 2024-04-12T19:28:14Z

stft + onnx seems to still not be ready (pytorch/pytorch#113067 (comment))

sammlapp added this to the 0.8.0 milestone Sep 20, 2022

sammlapp assigned jatinkhilnani Oct 4, 2022

sammlapp mentioned this issue Oct 4, 2022

consider moving preprocessing "into model" #500

Open

sammlapp removed this from the 0.8.0 milestone Oct 21, 2022

sammlapp added the blocked label Dec 20, 2022

sammlapp removed the blocked label Nov 1, 2023

sammlapp unassigned jatinkhilnani Nov 1, 2023

sammlapp added the discuss Long term ideas to consider label Apr 12, 2024

sammlapp added this to the 0.11.0 milestone Apr 12, 2024

sammlapp added the blocked label Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support exporting and loading ONNX models #556

Support exporting and loading ONNX models #556

sammlapp commented Sep 20, 2022

sammlapp commented Sep 20, 2022

sammlapp commented Sep 20, 2022

sammlapp commented Jun 29, 2023

sammlapp commented Nov 1, 2023

sammlapp commented Nov 6, 2023

sammlapp commented Apr 12, 2024

Support exporting and loading ONNX models #556

Support exporting and loading ONNX models #556

Comments

sammlapp commented Sep 20, 2022

sammlapp commented Sep 20, 2022

sammlapp commented Sep 20, 2022

sammlapp commented Jun 29, 2023

sammlapp commented Nov 1, 2023

sammlapp commented Nov 6, 2023

sammlapp commented Apr 12, 2024