onnx exported problem #8

unparalleled-ysj · 2022-11-16T03:55:20Z

May I ask how you dealt with the “RuntimeError: Unknown number type: complex” problem caused by torch.istft when exporting the onnx model

Jackiexiao · 2022-11-28T07:25:07Z

torch.istft is currently not support to convert to onnx and still in development, see: pytorch/pytorch#81075

MasayaKawamura · 2022-11-28T07:32:40Z

Hi @unparalleled-ysj, I'm sorry to be late...
torch.istft is currently not supported by onnx, so please exclude only the istft part when exporting to onnx.
@Jackiexiao, thank you for your comment!

MasayaKawamura · 2022-11-28T07:36:11Z

Maybe this URL will also be useful for torch onnx.

Jackiexiao · 2022-11-28T07:38:58Z

I'm confused, we have to pass istft function to get wav, if we exclude istft part, we can't get result @MasayaKawamura

MasayaKawamura · 2022-11-28T07:48:30Z

@Jackiexiao
I think that it is possible to export the processes except for istft using onnx. During inference, I think that wav can be obtained by combining onnx and torch istft code.

Jackiexiao · 2022-11-28T07:51:11Z

ok, I get, looking forward to get istft support in torch nightly, so we just need onnx during inference

unparalleled-ysj · 2022-11-28T08:13:46Z

You can use

MB-iSTFT-VITS/stft.py

Line 144 in df2f8d3

def inverse(self, magnitude, phase):

instead of

MB-iSTFT-VITS/stft.py

Line 197 in df2f8d3

def inverse(self, magnitude, phase):

when exporting, which can successfully export the model as onnx, but at the same time, there will be rattling noise in the speech. After my ablation comparison, the problem still appears in the istft export (because the original model has no problem)

unparalleled-ysj · 2022-11-28T08:23:24Z

@Jackiexiao @MasayaKawamura refer to pytorch/pytorch#31317 (comment)

Jackiexiao · 2022-11-28T08:26:15Z

thx

FanhuaandLuomu · 2022-12-09T07:50:12Z

Hi @MasayaKawamura
Can you share your code to save onnx model, i got some problems when i convert to onnx.

Jackiexiao · 2022-12-09T07:52:37Z

FYI see: https://github.com/wenet-e2e/wetts/blob/main/wetts/vits/export_onnx.py but you can't export istft vocoder to onnx here @FanhuaandLuomu

abylouw · 2022-12-21T18:09:36Z

FYI see: https://github.com/wenet-e2e/wetts/blob/main/wetts/vits/export_onnx.py but you can't export istft vocoder to onnx here @FanhuaandLuomu

hi @Jackiexiao,

I have tried the above script to export but I have had no success. Would you mind sharing your export code?

abylouw · 2022-12-21T18:10:15Z

Hi @MasayaKawamura Can you share your code to save onnx model, i got some problems when i convert to onnx.

Hi @FanhuaandLuomu, have you succeeded in exporting the model?

Jackiexiao · 2022-12-22T02:08:30Z

@abylouw it just work in original vits(not for mbistft, but they work the same way, except the vocoder part), and wetts repo has all code you need

JohnHerry · 2023-08-16T06:23:08Z

You can use

MB-iSTFT-VITS/stft.py

Line 144 in df2f8d3

def inverse(self, magnitude, phase):

instead of

MB-iSTFT-VITS/stft.py

Line 197 in df2f8d3

def inverse(self, magnitude, phase):

when exporting, which can successfully export the model as onnx, but at the same time, there will be rattling noise in the speech. After my ablation comparison, the problem still appears in the istft export (because the original model has no problem)

Do we need to use the class STFT instead of TorchSTFT during the training in this case?

JohnHerry · 2023-08-18T07:46:34Z

@Jackiexiao I think that it is possible to export the processes except for istft using onnx. During inference, I think that wav can be obtained by combining onnx and torch istft code.

I have tried to split the MB-iSTFT-VITS into this two parts and the former transfered into onnx, it is succeed. but as to the MS-iSTFT-VITS, I have to split the model into three parts, which case the first and the third part should be transfer into onnx models. as to the third part, the multi-band filter, the self.multistream_conv_post layer there is a weight_norm, should I keep the weight_norm layer there? I saw your remove_weight_norm function in the class did not remove this part. If the weight_norm can be removed during transfer into onnx, should I just put the "dec.multistream_conv_post.weight_v" value in the checkpoint , into my self defined third model part?

nshmyrev · 2023-10-01T19:57:10Z

Do we need to use the class STFT instead of TorchSTFT during the training in this case?

You do not have to use STFT during training, only during export. See here

alphacep/MB-iSTFT-VITS2@29c91d4

see also

FENRlR/MB-iSTFT-VITS2#3

Insensiblee · 2023-11-09T08:23:32Z

在这种情况下，我们在训练过程中是否需要使用 STFT 类来代替 TorchSTFT ？

您不必在训练期间使用 STFT，只需在导出期间使用。看这里

alphacep/MB-iSTFT-VITS2@ 29c91d4

也可以看看

FENRlR/MB-iSTFT-VITS2#3

I used this code to transfer onnx to process the pre-trained model provided, why did I report this error：AttributeError: 'ResidualCouplingLayer' object has no attribute 'remove_weight_norm'

JohnHerry · 2023-11-10T01:17:39Z

在这种情况下，我们在训练过程中是否需要使用 STFT 类来代替 TorchSTFT ？

您不必在训练期间使用 STFT，只需在导出期间使用。看这里
alphacep/MB-iSTFT-VITS2@ 29c91d4
也可以看看
FENRlR/MB-iSTFT-VITS2#3

I used this code to transfer onnx to process the pre-trained model provided, why did I report this error：AttributeError: 'ResidualCouplingLayer' object has no attribute 'remove_weight_norm'

No, the STFT from this project, which is the same with the one from the iSTFTNet project, is not good for onnx model exporation.
it can help generate a onnx model for inference, but this model will failed for some input cases. and even for those success, the generated wavform will contains some noise.

nshmyrev mentioned this issue Oct 1, 2023

Export of model to ONNX #20

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnx exported problem #8

onnx exported problem #8

unparalleled-ysj commented Nov 16, 2022

Jackiexiao commented Nov 28, 2022 •

edited

Loading

MasayaKawamura commented Nov 28, 2022

MasayaKawamura commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022 •

edited

Loading

MasayaKawamura commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022

unparalleled-ysj commented Nov 28, 2022

unparalleled-ysj commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022

FanhuaandLuomu commented Dec 9, 2022

Jackiexiao commented Dec 9, 2022

abylouw commented Dec 21, 2022

abylouw commented Dec 21, 2022

Jackiexiao commented Dec 22, 2022

JohnHerry commented Aug 16, 2023

JohnHerry commented Aug 18, 2023

nshmyrev commented Oct 1, 2023

Insensiblee commented Nov 9, 2023

JohnHerry commented Nov 10, 2023 •

edited

Loading

onnx exported problem #8

onnx exported problem #8

Comments

unparalleled-ysj commented Nov 16, 2022

Jackiexiao commented Nov 28, 2022 • edited Loading

MasayaKawamura commented Nov 28, 2022

MasayaKawamura commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022 • edited Loading

MasayaKawamura commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022

unparalleled-ysj commented Nov 28, 2022

unparalleled-ysj commented Nov 28, 2022

Jackiexiao commented Nov 28, 2022

FanhuaandLuomu commented Dec 9, 2022

Jackiexiao commented Dec 9, 2022

abylouw commented Dec 21, 2022

abylouw commented Dec 21, 2022

Jackiexiao commented Dec 22, 2022

JohnHerry commented Aug 16, 2023

JohnHerry commented Aug 18, 2023

nshmyrev commented Oct 1, 2023

Insensiblee commented Nov 9, 2023

JohnHerry commented Nov 10, 2023 • edited Loading

Jackiexiao commented Nov 28, 2022 •

edited

Loading

Jackiexiao commented Nov 28, 2022 •

edited

Loading

JohnHerry commented Nov 10, 2023 •

edited

Loading