ONNX bug #1634

AlexandderGorodetski · 2024-05-21T08:41:09Z

Guys,

I got following during ONNX decoding

File "/opt/conda/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "./zipformer/onnx_pretrained.py", line 393, in main
encoder_out, encoder_out_lens = model.run_encoder(features, feature_lengths)
File "./zipformer/onnx_pretrained.py", line 199, in run_encoder
out = self.encoder.run(
File "/opt/conda/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 188, in run
return self._sess.run(output_names, input_feed, run_options)
onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Non-zero status code returned while running If node. Name:'/encoder/0/encoder_pos/If' Status Message: Non-zero status code returned while running ScatterND node. Name:'/encoder/0/encoder_pos/ScatterND_2' Status Message: invalid indice found, indice = -1

Any help is appreciated.

Thanks,
AlexG.

csukuangfj · 2024-05-21T09:55:11Z

Please tell us the exact command you are using and also please tell us the duration of your test wave.

AlexandderGorodetski · 2024-05-21T10:01:36Z

The duration of test waves is 30sec.
I use following command for the decoding.

export PYTHONPATH='/workspace/inputs/alexg/asr/src/models/k2_2024/icefall/egs/tedlium3/ASR/zipformer:/workspace/inputs/alexg/asr/src/models/k2_2024/icefall:$PYTHONPATH'

export CUDA_VISIBLE_DEVICES="0"

python ./zipformer/onnx_pretrained.py
--encoder-model-filename zipformer/exp/encoder-epoch-50-avg-1.onnx
--decoder-model-filename zipformer/exp/decoder-epoch-50-avg-1.onnx
--joiner-model-filename zipformer/exp/joiner-epoch-50-avg-1.onnx
--tokens data/lang_bpe_500/tokens.txt
/workspace/inputs/alexg/asr/src/projects/en_eval/input/test_1.wav

csukuangfj · 2024-05-21T10:04:20Z

Could you use a shorter wave, e.g., less than 10 seconds or 20 seconds ?

AlexandderGorodetski · 2024-05-21T10:15:38Z

Great.
Well done.

For 10 seconds ONNX decoder works properly. Is it possible to add support to 30 sec or should I have to update my VAD so that it will not produce segments longer than 10sec?

csukuangfj · 2024-05-21T10:18:33Z

Is it possible to add support to 30 sec

Yes, absolutely.

Please change

icefall/egs/librispeech/ASR/zipformer/zipformer.py

Line 1405 in 0df406c

max_len: int = 1000,

to a larger value, re-export your model, and re-try.

AlexandderGorodetski · 2024-05-21T10:23:32Z

I changed the value from 1,000 to 10,000 and it did not help.

I found that maximal time that I can work with is 20sec. For 21sec I already have an error.

Maybe I should change this value during the training, maybe this value is saved somewhere in the model?

csukuangfj · 2024-05-21T10:29:02Z

There must be some constant value about the length of some positional encoding vector in the code. You need to find and change it.

AlexandderGorodetski · 2024-05-21T12:38:58Z

You are right. max_len currently represents time of 20 sec. It can be increased to 2,000 and then maximal time will be increase to 40 sec. But it is important to perform this change BEFORE exporting the ONNX model.

Thank you so much, this issue can be closed.

AlexandderGorodetski · 2024-05-21T12:40:09Z

Completed.

AlexandderGorodetski closed this as completed May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX bug #1634

ONNX bug #1634

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

ONNX bug #1634

ONNX bug #1634

Comments

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

csukuangfj commented May 21, 2024

AlexandderGorodetski commented May 21, 2024

AlexandderGorodetski commented May 21, 2024