your greed decode implement is wrong. #1

Duum · 2018-05-24T09:20:16Z

I think your transducer greed decode implement is wrong.
here is my implement of pytorch.

HawkAaron · 2018-05-24T12:55:33Z

Its based on the assumption that for each acoustic feature frame, there is at most one corresponding label, so we just need to move one step up, then turn right.

I implemented the greedy decode several weeks ago as your way, but the PER is worse in TIMIT.

The decoding algorithm is still under developing, so any comments are welcome.

Duum · 2018-05-24T13:49:46Z

but in my dataset my greedy decode implement CER is 15% lower than yours.

HawkAaron · 2018-05-24T13:53:05Z

Which dataset did you use?

I'll check that again in TIMIT.

Duum · 2018-05-24T13:54:51Z

In the paper of alex grave, in a transducer path ，only the label is null, the frame will step. when the label is not null ,the U will increase,But the T will stop to wait.

Duum · 2018-05-24T13:58:36Z

in my private dataset，only when use the second method ，the rnn transducer will compare with ctc.

HawkAaron · 2018-05-24T14:10:21Z

@Duum
Yes, when non-null is predicted, u forward one step and t stop. But as what I said, at any frame t, the corresponding label will not be more than 1, so if non-null is predicted at frame t, it must move to the next frame after the next prediction. What I said is based on the physical meaning of speech feature. But according to the original transducer model definition, at any time t, there could be more than one up transitions, which is meanless in speech recognition.

I'll check your implementation and reply to you asap.

HawkAaron · 2018-05-24T14:15:28Z

@Duum By the way, what you mean "the second method" ? Have you ever try beam search ?

Duum · 2018-05-24T14:21:55Z

the second method is my implement of greedy decode，I haven't use beam search for now .
a frame corresponding to a label may be just your assumption. If the frame is between two phones, it is possible to have two phones in one frame.

HawkAaron · 2018-05-24T14:35:41Z

@Duum Thanks for your comments, I'll check that.

HawkAaron closed this as completed May 25, 2018

HawkAaron mentioned this issue Sep 22, 2018

Training epochs #4

Closed

oshindow mentioned this issue Apr 14, 2020

DataLoader.py return these errors... #24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

your greed decode implement is wrong. #1

your greed decode implement is wrong. #1

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

Duum commented May 24, 2018 •

edited

HawkAaron commented May 24, 2018

Duum commented May 24, 2018

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

HawkAaron commented May 24, 2018

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

your greed decode implement is wrong. #1

your greed decode implement is wrong. #1

Comments

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

Duum commented May 24, 2018 • edited

HawkAaron commented May 24, 2018

Duum commented May 24, 2018

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

HawkAaron commented May 24, 2018

Duum commented May 24, 2018

HawkAaron commented May 24, 2018

Duum commented May 24, 2018 •

edited