added end detection #46

sw005320 · 2017-12-27T21:48:05Z

Added end detection described in Eq. (50) of S. Watanabe et al "Hybrid CTC/Attention Architecture for End-to-End Speech Recognition,"
If we set maxlenratio 0.0, this automatic detection is triggered.
With WSJ and Voxforge experiments, it is confirmed that the detection works well.

ShigekiKarita · 2017-12-28T02:20:27Z

src/nets/e2e_asr_attctc_th.py

@@ -697,6 +697,27 @@ def recognize(self, h, recog_args):

        return y_seq

+    # end detection desribed in Eq. (50) of
+    # S. Watanabe et al "Hybrid CTC/Attention Architecture for End-to-End Speech Recognition"
+    def end_detect(self, ended_hyps, i, M=3, D_end=np.log(1 * np.exp(-10))):


I think this function should be placed outside e2e_asr_attctc[_th].py and be shared from them like from e2e_common import end_detection. Because this function seems to be static (not use self) and be free from numpy/torch operations.

ShigekiKarita · 2017-12-28T02:27:16Z

src/nets/e2e_asr_attctc.py

@@ -570,8 +596,11 @@ def recognize_beam(self, h, recog_args, char_list):

        # preprate sos
        y = self.xp.full(1, self.sos, 'i')
-        # maxlen >= 1
-        maxlen = max(1, int(recog_args.maxlenratio * h.shape[0]))
+        if recog_args.maxlenratio == 0:


please add this specialized behavior in argparse help of asr_recog[_th].py . If you think it is worth being default, set it to 0.0 (now default is 0.5)

and I also think argparse.ArgumentParser should be shared like from e2e_args import asr_recog_parser as I commented on def end_detect because chainer/pytorch tools should be consistent.

I'll do it for end_detect, but let me make "argparse part" common later.
Actually, asr_recog.py and asr_recog_th.py are almost same, and I'm thinking of using the common asr_recog.py for both, which is more simple for me, but I'm not sure it would be applicable to asr_train.py and asr_train_th.py, and it needs some more consideration.

OK. It may be too much to this PR.

ShigekiKarita · 2017-12-28T02:31:47Z

src/nets/e2e_asr_attctc.py

@@ -546,6 +551,27 @@ def recognize(self, h, recog_args):

        return y_seq

+    # end detection desribed in Eq. (50) of
+    # S. Watanabe et al "Hybrid CTC/Attention Architecture for End-to-End Speech Recognition"
+    def end_detect(self, ended_hyps, i, M=3, D_end=np.log(1 * np.exp(-10))):


ditto (see e2e_asr_attctc_th.py)

…enratio behaviour in the end detect case in argparse

sw005320 · 2017-12-28T03:39:59Z

@ShigekiKarita Done!

ShigekiKarita

LGTM

added endpoint detection

59c6e48

sw005320 requested a review from ShigekiKarita December 27, 2017 21:48

sw005320 mentioned this pull request Dec 27, 2017

Toward a stable version #35

Closed

14 tasks

sw005320 changed the title ~~added endpoint detection~~ added end detection Dec 27, 2017

fix shape->size

5784f13

ShigekiKarita reviewed Dec 28, 2017

View reviewed changes

sw005320 added 3 commits December 27, 2017 22:06

move end_detect to e2e_asr_common, and also add a explanation of maxl…

eb1409e

…enratio behaviour in the end detect case in argparse

bug fix for travis check

7d9271f

fix for HXXX

c368b80

ShigekiKarita approved these changes Dec 28, 2017

View reviewed changes

ShigekiKarita merged commit 8ac4f22 into espnet:master Dec 28, 2017

ShigekiKarita mentioned this pull request Sep 10, 2020

CommonPreprocessor_multi not defined #2450

Closed

himanshucodz55 mentioned this pull request Jul 24, 2022

RuntimeError: [1] is setting up NCCL communicator and retreiving ncclUniqueId from [0] via c10d key-value store by key '0', but store->get('0') got error: Timeout waiting for key: default_pg/0/0 after 1800000 ms #4531

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added end detection #46

added end detection #46

sw005320 commented Dec 27, 2017 •

edited

ShigekiKarita Dec 28, 2017 •

edited

ShigekiKarita Dec 28, 2017

ShigekiKarita Dec 28, 2017 •

edited

sw005320 Dec 28, 2017

ShigekiKarita Dec 28, 2017

ShigekiKarita Dec 28, 2017

sw005320 commented Dec 28, 2017

ShigekiKarita left a comment

added end detection #46

added end detection #46

Conversation

sw005320 commented Dec 27, 2017 • edited

ShigekiKarita Dec 28, 2017 • edited

Choose a reason for hiding this comment

ShigekiKarita Dec 28, 2017

Choose a reason for hiding this comment

ShigekiKarita Dec 28, 2017 • edited

Choose a reason for hiding this comment

sw005320 Dec 28, 2017

Choose a reason for hiding this comment

ShigekiKarita Dec 28, 2017

Choose a reason for hiding this comment

ShigekiKarita Dec 28, 2017

Choose a reason for hiding this comment

sw005320 commented Dec 28, 2017

ShigekiKarita left a comment

Choose a reason for hiding this comment

sw005320 commented Dec 27, 2017 •

edited

ShigekiKarita Dec 28, 2017 •

edited

ShigekiKarita Dec 28, 2017 •

edited