export model to onnx without FFT/IFFT #530

netv1 · 2023-07-24T13:46:13Z

❓ Questions

Hi guys, I've tried to export the model to onnx but because of the FFT (real-to-complex) operations in the model it seems the export cannot work. I have used the latest supported PyTorch 2.0.1 and ONNX opset 18. I want to use demucs from C/C++ and I can do the FFT part directly in C/C++ and provide it to the model along with the raw audio. However, I've no idea how to export the model without the FFT parts.

Eg. I want to input myself the FFT and the raw audio and get back the data for the inverse FFT and the time-domain data and add them myself. PyTorch is not my main expertise area as you can tell, but I can do the DSP pre/post-processing in C/C++.

What's the best way to go about this (exporting a partial graph or a model with multiple inputs/outputs)? I would appreciate any help or hints. Thanks

Bin-ze · 2023-08-17T08:39:27Z

I want to convert the model to onnx for deployment on embedded devices, but I can't achieve it with simple logic, because the model experiment apply_model function processing, can you tell me how to convert the model to onnx? Also have you implemented inference using the onnx backend.
Looking forward to your reply, I will be very grateful

netv1 · 2023-08-21T11:26:38Z

Unfortunately I haven't done any progress on this. Maybe the community will be kind enough to at least steer us in the right direction.

Bin-ze · 2023-08-22T01:33:35Z

Thanks for the replies, here are some possible relevant discussions I found in the community:

I am currently trying to avoid the FFT part everywhere, so as to bypass the problem that onnx does not support some operators

But I don't understand the audio field at all, I checked the code, but I can't understand the recursion in apply_model, and I can't understand what exactly Bagmodel implements, but I think if you want to completely convert this algorithm to onnx, then you have to convert The recursive implementation is converted to a loop, and then the corresponding models are exported separately, I hope to get your help, if you have already figured out this part, can you tell me how to do it

alexvoina · 2023-09-06T12:10:55Z

I'm interested in doing this too

netv1 · 2023-09-11T15:27:43Z

Unfortunately I still haven't had the time to revisit this (I'm still planning to maybe next month). It is 100% doable as both VirtualDJ and Serato are using it, so ...

alexvoina · 2023-09-11T16:31:35Z

let's keep in touch, we could join forces! I have some experience with this kind of work

AdarshAcharya5 · 2024-02-01T07:24:36Z

Thanks for the replies, here are some possible relevant discussions I found in the community:

[ONNX][Complex] Support view_as_complex pytorch/pytorch#49793

https://github.com/adobe-research/convmelspec

[ONNX] STFT Support pytorch/pytorch#92087

[ONNX] Support opset 17 operators pytorch/pytorch#81075

Exporting the operator stft to ONNX opset version 9 is not supported speechbrain/speechbrain#1455

I am currently trying to avoid the FFT part everywhere, so as to bypass the problem that onnx does not support some operators

But I don't understand the audio field at all, I checked the code, but I can't understand the recursion in apply_model, and I can't understand what exactly Bagmodel implements, but I think if you want to completely convert this algorithm to onnx, then you have to convert The recursive implementation is converted to a loop, and then the corresponding models are exported separately, I hope to get your help, if you have already figured out this part, can you tell me how to do it

If you're using HTDemucs or HDemucs, you can actually put STFT and ISTFT outside the model's forward call as it's only used in the beginning and the end of the call. I did the same and almost managed to convert it, however it seems ONNX doesn't support nn.MultiheadAttention operator in it's opset yet. Unfortunately the tracer doesn't show the exact line where it's failing to parse. I think the only way to go about this problem is to write our own multiheadattention function.
For context this is the exception it throws :

raise errors.UnsupportedOperatorError(
torch.onnx.errors.UnsupportedOperatorError: Exporting the operator 'aten::_native_multi_head_attention' to ONNX opset version 17 is not supported. Please feel free to request support or submit a pull request on PyTorch GitHub: https://github.com/pytorch/pytorch/issues.

jie-chen · 2024-02-11T00:45:56Z

It seems only HTDemucs use MultiheadAttention in transformer. So HDemucs should be good to go?

loretoparisi · 2024-08-28T09:52:11Z

Thanks for the replies, here are some possible relevant discussions I found in the community:

[ONNX][Complex] Support view_as_complex pytorch/pytorch#49793

https://github.com/adobe-research/convmelspec

[ONNX] STFT Support pytorch/pytorch#92087

[ONNX] Support opset 17 operators pytorch/pytorch#81075

Exporting the operator stft to ONNX opset version 9 is not supported speechbrain/speechbrain#1455

I am currently trying to avoid the FFT part everywhere, so as to bypass the problem that onnx does not support some operators
But I don't understand the audio field at all, I checked the code, but I can't understand the recursion in apply_model, and I can't understand what exactly Bagmodel implements, but I think if you want to completely convert this algorithm to onnx, then you have to convert The recursive implementation is converted to a loop, and then the corresponding models are exported separately, I hope to get your help, if you have already figured out this part, can you tell me how to do it

If you're using HTDemucs or HDemucs, you can actually put STFT and ISTFT outside the model's forward call as it's only used in the beginning and the end of the call. I did the same and almost managed to convert it, however it seems ONNX doesn't support nn.MultiheadAttention operator in it's opset yet. Unfortunately the tracer doesn't show the exact line where it's failing to parse. I think the only way to go about this problem is to write our own multiheadattention function. For context this is the exception it throws :
raise errors.UnsupportedOperatorError(
torch.onnx.errors.UnsupportedOperatorError: Exporting the operator 'aten::_native_multi_head_attention' to ONNX opset version 17 is not supported. Please feel free to request support or submit a pull request on PyTorch GitHub: https://github.com/pytorch/pytorch/issues.

@AdarshAcharya5 @netv1 Is this an issue in the latest opset-20?

netv1 added the question Further information is requested label Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

export model to onnx without FFT/IFFT #530

export model to onnx without FFT/IFFT #530

netv1 commented Jul 24, 2023

Bin-ze commented Aug 17, 2023

netv1 commented Aug 21, 2023

Bin-ze commented Aug 22, 2023

alexvoina commented Sep 6, 2023

netv1 commented Sep 11, 2023

alexvoina commented Sep 11, 2023

AdarshAcharya5 commented Feb 1, 2024

jie-chen commented Feb 11, 2024

loretoparisi commented Aug 28, 2024

export model to onnx without FFT/IFFT #530

export model to onnx without FFT/IFFT #530

Comments

netv1 commented Jul 24, 2023

❓ Questions

Bin-ze commented Aug 17, 2023

netv1 commented Aug 21, 2023

Bin-ze commented Aug 22, 2023

alexvoina commented Sep 6, 2023

netv1 commented Sep 11, 2023

alexvoina commented Sep 11, 2023

AdarshAcharya5 commented Feb 1, 2024

jie-chen commented Feb 11, 2024

loretoparisi commented Aug 28, 2024