Standalone Transducer v1.1 #5140

b-flo · 2023-04-25T08:11:54Z

This PR is an update for the standalone Transducer version. It stitches several things I'm using as a baseline for new features (that won't be included here). Mainly:

General: Fix various docstrings, documentation, variable names, etc + add LayerDrop.
Streaming: Rework audio processing + some minor fixes for performance.
Encoder: Add E-Branchformer (offline/streaming).
Decoder: Add MEGA (w/ chunking) + fix decoder interface.

Some results are given for Libri-100 in another PR.

P.S: It's missing unit and integration tests for MEGA, I'll add it later this week.

b-flo · 2023-06-01T13:52:02Z

@sw005320 Just to confirm, Is it required to use this single PR to update multiple files? I mean, easily, it can be split into 3/4 PRs. Makes a little difficult to check which function may fail. If there is no problem, I will try focusing on MEGA. But it may be ok.

It's not, sorry about that. I worked off the list on several things based on different branches and it's was becoming a mess to handle because of the dependencies. I prefer to stitch what's needed for the baseline, I can segment from here though.

Note that you for the new additions, it should be self-contained (i.e.: you only need to focus on specific files):

MEGA: decoder/mega_decoder.py, decoder/blocks/mega.py and decoder/modules/mega/*.py
RWKV: decoder/rwkv_decoder.py, decoder/blocks/rwkv.py and decoder/modules/rwkv/*.py
Ebranchformer: encoder/blocks/ebranchformer.py and encoder/modules/convolution.py

Again, sorry about that mess.

Fhrozen · 2023-06-05T09:54:24Z

LGTM, fix test errors and ready to merge.

b-flo · 2023-06-05T13:15:05Z

I added the the unit tests for RWKV and fixed some other ones. Integration tests are still missing but we can do that later (I have to redesign them, it's a mess).

Btw, I'm not sure what's the cause but I have the following error for a test on my side:

test/espnet2/bin/test_asr_transducer_inference.py:53: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
espnet2/tasks/abs_task.py:1054: in main
    cls.main_worker(args)
espnet2/tasks/abs_task.py:1147: in main_worker
    set_all_random_seed(args.seed)
espnet2/torch_utils/set_all_random_seed.py:10: in set_all_random_seed
    torch.random.manual_seed(seed)
tools/venv/lib/python3.9/site-packages/torch/random.py:40: in manual_seed
    torch.cuda.manual_seed_all(seed)
tools/venv/lib/python3.9/site-packages/torch/cuda/random.py:113: in manual_seed_all
    _lazy_call(cb, seed_all=True)
tools/venv/lib/python3.9/site-packages/torch/cuda/__init__.py:153: in _lazy_call
    callable()
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

    def cb():
        for i in range(device_count()):
            default_generator = torch.cuda.default_generators[i]
>           default_generator.manual_seed(seed)
E           RuntimeError: CUDA error: an illegal memory access was encountered

It wasn't the case before and I'm wondering if it's due to some master changes or if my env is borked somehow?
I traced the error but I honestly can't figure out the main issue.

b-flo · 2023-06-05T13:57:01Z

Hum, I didn't think about this one because I thought ninja was already a dependency (https://ninja-build.org).
@sw005320 Can the package be added to CI runs? I don't want to add an installation at an higher level, users can install the package in their env if needed. It's only used to load WKV kernel module during training.

Edit: I guess I can safely add ninja dependency to the warp-transducer installation. However, the cpp extension can only be loaded if a CUDA is available. How do you think I should handle that in regard to CI/tests? Should I filter them out or is there a GPU/CUDA executor available for some builds? @kamo-naoyuki Do you have an opinion/idea?

Edit2: For now, let's skip RWKV unit tests when no GPU is available.

pyf98 · 2023-06-07T04:37:09Z

The E-Branchformer part looks good to me.

sw005320 · 2023-06-08T05:36:36Z

@b-flo,

Edit2: For now, let's skip RWKV unit tests when no GPU is available.

Sounds good to me.
The refactoring part looks good to me and other individual functions do not seem to have an issue.
So, once it becomes a good shape, please go ahead and merge this PR.

b-flo · 2023-06-08T09:16:27Z

So, once it becomes a good shape, please go ahead and merge this PR.

I think it's in somewhat good shape but there are some things I'm not happy with:

How the decoder state(s) is defined/handled, I think it's inefficient. If you have some opinions/ideas for a (future) redesign, it would be appreciated!
RWKV is a bit unstable and training parameters dependence is high. It may be due to bugs I introduced but some regularization and changes are needed (e.g.: initialization) to make it behave properly. For now, I could stabilize it a bit by adding back the R/K/V/output projection biases.

Anyway, I am mostly interested in rough performance/cost for now. Given the role of the Transducer decoder in a "vanilla" setup, the architecture won't make a big difference IMO. I'm finishing the experiments w/ Libri-100 and we can merge the PR.

sw005320 · 2023-06-20T09:59:35Z

Thanks a lot, @b-flo!

b-flo added 30 commits November 23, 2022 17:34

fix chunk mask

aa0f1a0

remove right context + minor fixes

9bd640a

monkey patch chunk-by-chunk decoding before rework

028b2d5

rework v0.1

7940ec0

Merge branch 'master' into refactoring

edeacc9

bump to v0.2

12636be

update streaming tests

0c57964

remove old commented code

0e1691b

add back buffering

ce615fc

alternative v0.2

87acd89

Merge branch 'master' into refactoring

8ce6802

add back display_partial_hypotheses option + minor fixes

0606aa4

Merge branch 'master' into refactoring

bd3062b

remove unused code

6cc36ba

fix convinput subsampling tests

17b917b

Merge branch 'master' into refactoring

97cb471

remove math lib usage

7267f5b

improve doc and tutorial for left context/chunks

f7bbb4e

improve doc and tutorial for left context/chunks (2)

e4a4317

fix streaming test

8b1cd2c

Merge branch 'master' into refactoring

722457e

v0.2 stable

643e964

Merge branch 'master' into refactoring

4b0f196

add offline/online ebranchformer + tests

2229ef2

add layerdrop (w/ decay)

9756379

update doc

0c825d0

small fix for layerdrop

86b0577

apply new black

db4a344

Merge branch 'master' into refactoring

01893f8

add back dec proj bias + remove merge mod dropout

ee30ce9

b-flo added 5 commits June 1, 2023 14:30

Merge branch 'master' into refactoring

e3192f9

add missing init files

5651dad

add missing init files (2)

17a624c

add rescaling option during inference

a085a20

Merge branch 'master' into refactoring

65dc349

b-flo added 2 commits June 5, 2023 13:08

add missing guard conditions

81798b3

add rwkv tests + fixes

aa0045d

add ninja install through warp-transducer install

04cc557

mergify bot added the Installation label Jun 6, 2023

b-flo added 3 commits June 7, 2023 08:23

add skip for rwkv tests without gpu

8bc222c

remove unused import

a59c94b

improve/fix documentation for new additions

4f81c23

mergify bot added the README label Jun 7, 2023

b-flo added 2 commits June 13, 2023 13:45

fix typos (type, docs)

f1632c0

Merge branch 'master' into refactoring

683a291

b-flo mentioned this pull request Jun 15, 2023

Question regarding asr_inference_streaming.py #4807

Open

sw005320 merged commit 3297e10 into espnet:master Jun 20, 2023
24 of 25 checks passed

b-flo mentioned this pull request Jun 21, 2023

Small fixes for Transducer #5247

Merged

b-flo deleted the refactoring branch June 22, 2023 09:07

b-flo added this to the v.202307 milestone Jun 30, 2023

b-flo mentioned this pull request Jul 6, 2023

Add support for K2 pruned transducer loss #5268

Merged

2 tasks

b-flo mentioned this pull request Jul 22, 2023

remove unused file + small typo/style #5346

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standalone Transducer v1.1 #5140

Standalone Transducer v1.1 #5140

b-flo commented Apr 25, 2023 •

edited

b-flo commented Jun 1, 2023 •

edited

Fhrozen commented Jun 5, 2023

b-flo commented Jun 5, 2023 •

edited

b-flo commented Jun 5, 2023 •

edited

pyf98 commented Jun 7, 2023

sw005320 commented Jun 8, 2023

b-flo commented Jun 8, 2023

sw005320 commented Jun 20, 2023

Standalone Transducer v1.1 #5140

Standalone Transducer v1.1 #5140

Conversation

b-flo commented Apr 25, 2023 • edited

b-flo commented Jun 1, 2023 • edited

Fhrozen commented Jun 5, 2023

b-flo commented Jun 5, 2023 • edited

b-flo commented Jun 5, 2023 • edited

pyf98 commented Jun 7, 2023

sw005320 commented Jun 8, 2023

b-flo commented Jun 8, 2023

sw005320 commented Jun 20, 2023

b-flo commented Apr 25, 2023 •

edited

b-flo commented Jun 1, 2023 •

edited

b-flo commented Jun 5, 2023 •

edited

b-flo commented Jun 5, 2023 •

edited