Test and Travis-CI #20

ShigekiKarita · 2017-12-18T17:26:13Z

I'm working on Travis-CI #18

ShigekiKarita · 2017-12-21T09:26:34Z

@sw005320 I want you to merge this if it is OK.

ShigekiKarita · 2017-12-21T10:11:52Z

just a moment, I found Travis-CI is falling. https://travis-ci.org/ShigekiKarita/espnet

ShigekiKarita · 2017-12-21T10:19:33Z

Now, Travis is OK. Many tests are skipped because torch is not installed but we can wait for him pytorch/pytorch#4178 (comment)

kan-bayashi · 2017-12-22T00:46:50Z

src/nets/e2e_asr_attctc_th.py

+    acc = 0
+    pad_pred = y_all.data.view(pad_target.size(0), pad_target.size(1), y_all.size(1)).max(2)[1]
+    mask = pad_target.data != ignore_label
+    return torch.sum(pad_pred.masked_select(mask) == pad_target.data.masked_select(mask)) / torch.sum(mask)


Final calculation torch.sum(pad_pred.masked_select(mask) == pad_target.data.masked_select(mask)) / torch.sum(mask) is int / int, therefore, in the case of python2, acc become = 0. (I confirmed)

We should cast to float.
I will show my modified version.

def th_accuracy(y_all, pad_target, ignore_label): pad_pred = y_all.data.view(pad_target.size(0), pad_target.size(1), y_all.size(1)).max(2)[1] mask = pad_target.data != ignore_label numerator = torch.sum(pad_pred.masked_select(mask) == pad_target.data.masked_select(mask)) denominator = torch.sum(mask) return float(numerator) / float(denominator)

kan-bayashi · 2017-12-22T00:50:44Z

src/nets/e2e_asr_attctc_th.py

-        for i in range(len(ys)):
-            acc += torch.sum(pred_pad[i, :ys[i].size(0)] == ys[i].data)
-        acc /= sum(map(len, ys))
+        acc = th_accuracy(y_all, pad_ys_out, ignore_label=self.ignore_id)
        logging.info('att loss:' + str(self.loss.data))

        # show predicted character sequence for debug


In the case of chainer, ignore label is -1.
Therefore, we should change as follows.

# now idx_hat = np.argmax(y_hat_[y_true_ != -1], axis=1) idx_true = y_true_[y_true_ != -1] # proposed idx_hat = np.argmax(y_hat_[y_true_ != self.ignore_id], axis=1) idx_true = y_true_[y_true_ != self.ignore_id]

OK. I think you are talking about show predicted character sequence for debug at Decoder.forward.

Yes, that's right.

Sorry, but I noticed the current version self.ignore_id=0. Is 0 for CTC?

Are you talking about this?

espnet/src/nets/e2e_asr_attctc_th.py

Line 1633 in c044f7a

self.ignore_id = 0 # NOTE: 0 for CTC?

This may have some problems.
@ShigekiKarita, can you tell me why you set 0 here?

In current implementation, character index for decoder is 1 to odim - 1.
Therefore, 0 is not used in decoder and it might be no effect even if we use 0 as ignore id, I think.

But to make more clear, I agree to change it to -1.

@kan-bayashi Many thanks for your reply!

kan-bayashi · 2017-12-22T00:52:32Z

@ShigekiKarita Hi Shigeki. I added some comments.
Could you reflect them?

kan-bayashi · 2017-12-22T00:57:17Z

src/nets/e2e_asr_attctc_th.py

            self.h_length = self.enc_h.shape[1]
            # utt x frame x att_dim
-            self.pre_compute_enc_h = F.tanh(linear_tensor(self.mlp_enc, self.enc_h))
+            self.pre_compute_enc_h = linear_tensor(self.mlp_enc, self.enc_h)


Does it need to perform tanh?

Hmm, I'm not sure why I delete it. However I will follow the reference impl of chainer.

In eq 9 in https://arxiv.org/pdf/1609.06773.pdf, pre_compute_h corresponds to V h_l that does not have tanh. Anyway it might not be big deal because we cannot see big difference in #9 (comment)

ShigekiKarita · 2017-12-22T02:19:24Z

Thank you for many comments.

kan-bayashi · 2017-12-22T02:30:00Z

src/nets/e2e_asr_attctc_th.py

@@ -469,7 +469,7 @@ def forward(self, enc_hs_pad, enc_hs_len, dec_z, att_prev, scaling=2.0):
            self.enc_h = enc_hs_pad  # utt x frame x hdim
            self.h_length = self.enc_h.shape[1]
            # utt x frame x att_dim
-            self.pre_compute_enc_h = linear_tensor(self.mlp_enc, self.enc_h)
+            self.pre_compute_enc_h = torch.tanh(linear_tensor(self.mlp_enc, self.enc_h))


tanh op is performed for only AttDot in Chainer.
I think it is not needed for AttLoc.

oh that is right

ShigekiKarita · 2017-12-22T05:21:36Z

Does everything look good? @kan-bayashi @sw005320

sw005320 · 2017-12-22T05:43:45Z

can you include pytorch and others in all at Makefile?

ShigekiKarita · 2017-12-22T06:04:33Z

done

Update of joint model part.

fix typo

Full-size training and wav2vec2 + mbart training

ShigekiKarita and others added 5 commits December 19, 2017 01:21

add travis test

c0d9744

fix .travis.yml

23f5696

make pytorch initialization consistent to chainer and test partialy

a7a43f3

fix encoding utf-8

2462ce3

more compatible chainer/pytorch

d3f6bb1

ShigekiKarita mentioned this pull request Dec 19, 2017

python 2 and 3 compatible #14

Closed

karita and others added 7 commits December 20, 2017 00:49

remove unused main

fe60ae8

make accuracy consistent

f08c911

add more test

4d48754

add forgotten tanh in encoder, chainer-like ctc and test them

62a1cb0

Merge remote-tracking branch 'espnet/master' into test-pytorch-loss

fd4dac7

add more tests on optim / grad

517c95e

add more tests on grad

6b0b4a8

ShigekiKarita mentioned this pull request Dec 21, 2017

pytorch exp #9

Closed

workaround for zero-length

68f2c88

ShigekiKarita changed the title ~~[WIP] Test and Travis-CI~~ Test and Travis-CI Dec 21, 2017

fix pytest to import torch

d2fdf0a

fix .travis.yml

11c14d4

kan-bayashi reviewed Dec 22, 2017

View reviewed changes

reflect reviews from kan-bayashi

696a3cc

kan-bayashi reviewed Dec 22, 2017

View reviewed changes

revert attloc tanh

b4129da

update Makefile

e3a19ab

sw005320 merged commit 0c6cbdc into espnet:master Dec 22, 2017

ShigekiKarita mentioned this pull request Dec 25, 2017

chainer vs. pytorch in terms of training speed #28

Closed

sw005320 mentioned this pull request Jun 15, 2018

fix ignore_id issue #229

Merged

zhichaowang pushed a commit to zhichaowang/espnet that referenced this pull request Jan 20, 2021

Merge pull request espnet#20 from LiChenda/develop_rebased

f9e4c6a

Update of joint model part.

himanshucodz55 mentioned this pull request Jul 24, 2022

RuntimeError: [1] is setting up NCCL communicator and retreiving ncclUniqueId from [0] via c10d key-value store by key '0', but store->get('0') got error: Timeout waiting for key: default_pg/0/0 after 1800000 ms #4531

Open

mergify bot pushed a commit that referenced this pull request Jul 21, 2023

Merge pull request #20 from Jungjee/speaker

e0d2844

fix typo

tjysdsg referenced this pull request in tjysdsg/espnet Sep 8, 2023

Finalized implementation (#20)

184bb1b

tjysdsg referenced this pull request in tjysdsg/espnet Dec 13, 2023

Merge pull request #20 from tjysdsg/sm_s2st

b8418c1

Full-size training and wav2vec2 + mbart training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test and Travis-CI #20

Test and Travis-CI #20

ShigekiKarita commented Dec 18, 2017

ShigekiKarita commented Dec 21, 2017

ShigekiKarita commented Dec 21, 2017

ShigekiKarita commented Dec 21, 2017 •

edited

kan-bayashi Dec 22, 2017 •

edited

ShigekiKarita Dec 22, 2017

kan-bayashi Dec 22, 2017 •

edited

ShigekiKarita Dec 22, 2017

kan-bayashi Dec 22, 2017

mpc001 Jun 12, 2018

sw005320 Jun 13, 2018

kan-bayashi Jun 14, 2018

kan-bayashi Jun 14, 2018

mpc001 Jun 15, 2018

kan-bayashi commented Dec 22, 2017

kan-bayashi Dec 22, 2017

ShigekiKarita Dec 22, 2017

ShigekiKarita Dec 22, 2017

ShigekiKarita commented Dec 22, 2017

kan-bayashi Dec 22, 2017

ShigekiKarita Dec 22, 2017

ShigekiKarita commented Dec 22, 2017

sw005320 commented Dec 22, 2017

ShigekiKarita commented Dec 22, 2017

Test and Travis-CI #20

Test and Travis-CI #20

Conversation

ShigekiKarita commented Dec 18, 2017

ShigekiKarita commented Dec 21, 2017

ShigekiKarita commented Dec 21, 2017

ShigekiKarita commented Dec 21, 2017 • edited

kan-bayashi Dec 22, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kan-bayashi Dec 22, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kan-bayashi commented Dec 22, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShigekiKarita commented Dec 22, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShigekiKarita commented Dec 22, 2017

sw005320 commented Dec 22, 2017

ShigekiKarita commented Dec 22, 2017

ShigekiKarita commented Dec 21, 2017 •

edited

kan-bayashi Dec 22, 2017 •

edited

kan-bayashi Dec 22, 2017 •

edited