Support LoRA based large model finetuning. #5400

pengchengguo · 2023-08-09T09:23:30Z

What?

Support LoRA-based large model finetuning and also provide results of LoRA-based Whisper finetuning on Aishell corpus.

By finetuning the Whisper large model, we are able to achieve superior results of 2.5/2.7 CERs on dev/test sets, respectively.

Codecov Report

Merging #5400 (97f855c) into master (71dc9a3) will decrease coverage by 6.82%.
Report is 187 commits behind head on master.
The diff coverage is 86.61%.

@@            Coverage Diff             @@
##           master    #5400      +/-   ##
==========================================
- Coverage   77.14%   70.33%   -6.82%     
==========================================
  Files         684      707      +23     
  Lines       62713    64932    +2219     
==========================================
- Hits        48383    45673    -2710     
- Misses      14330    19259    +4929

Flag	Coverage Δ
test_configuration_espnet2	`∅ <ø> (∅)`
test_integration_espnet1	`?`
test_integration_espnet2	`48.76% <53.27%> (-0.31%)`	⬇️
test_python_espnet1	`19.19% <1.57%> (-0.76%)`	⬇️
test_python_espnet2	`51.30% <78.74%> (-1.01%)`	⬇️
test_utils	`23.10% <66.66%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
espnet2/bin/whisper_export_vocabulary.py	`93.22% <100.00%> (+0.36%)`	⬆️
espnet2/text/build_tokenizer.py	`78.37% <ø> (ø)`
espnet2/text/whisper_token_id_converter.py	`89.47% <100.00%> (+3.36%)`	⬆️
espnet2/text/whisper_tokenizer.py	`89.18% <100.00%> (+0.95%)`	⬆️
espnet2/train/preprocessor.py	`77.31% <100.00%> (-0.12%)`	⬇️
espnet/nets/e2e_mt_common.py	`63.93% <66.66%> (-0.48%)`	⬇️
espnet2/bin/asr_inference.py	`87.50% <83.33%> (+0.51%)`	⬆️
espnet2/tasks/st.py	`86.66% <88.88%> (-1.66%)`	⬇️
espnet2/layers/create_lora_adapter.py	`95.74% <95.74%> (ø)`
espnet2/tasks/abs_task.py	`76.84% <83.33%> (+0.04%)`	⬆️
... and 3 more

... and 102 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

sw005320 · 2023-08-09T11:41:27Z

Thanks, @pengchengguo!
The results are very impressive.
Can you also add a test so that this implementation becomes more solid?

pengchengguo · 2023-08-09T11:45:45Z

Sure, the HF model link will also be updated later.

sw005320 · 2023-08-30T18:50:55Z

@pyf98, can you review this PR?

@pengchengguo, can you add a test?

pyf98 · 2023-09-17T17:56:14Z

Thanks! It looks good to me!

pyf98 · 2023-09-17T17:57:13Z

Just one question:
Does it support LoRA for all ESPnet models? Or just Whisper? I guess it should support all.

pengchengguo · 2023-09-18T01:14:38Z

Yes, it supports all.

pengchengguo · 2023-09-18T11:51:03Z

I am changing this PR to "work in progress" because I also want to add Whisper fine-tuning to ST. The ST part is being refactored now and will be finished soon.

mergify · 2023-09-23T12:00:54Z

This pull request is now in conflict :(

for more information, see https://pre-commit.ci

sw005320 · 2023-09-30T11:52:52Z

Can you remove “WIP” from the PR title?

pengchengguo · 2023-09-30T12:21:57Z

Sure, now I am trying to increase the codecov, it seems the CI test file misses some functions.

sw005320 · 2023-10-10T12:48:42Z

We seem to have an issue for CI. Can you check it?

pengchengguo · 2023-10-11T12:20:36Z

I fixed the CI issues and ST errors as I mentioned before. Now, Whisper fine-tuned ST shows better results compared with E-Branchformer (60.9 vs 53 BLEU scores on Fishercallhome dev sets).
I need time to decode all test sets and upload the pre-trained model, so I want to do it in another PR.
For this PR, I think it's okay to be merged.

sw005320 · 2023-10-11T12:51:11Z

Thanks, @pengchengguo!

Support LoRA based large model finetuning.

9b9b278

mergify bot added ESPnet2 README labels Aug 9, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

033941a

for more information, see https://pre-commit.ci

sw005320 requested a review from pyf98 August 9, 2023 11:14

sw005320 added New Features ASR Automatic speech recogntion labels Aug 9, 2023

sw005320 added this to the v.202312 milestone Aug 9, 2023

pengchengguo added 3 commits September 15, 2023 16:47

Merge branch 'master' into whisper

bcb0a53

Add CI test file.

bf94adf

Fix CI test file bugs.

40ddcce

pengchengguo force-pushed the whisper branch from a1b1f7a to 40ddcce Compare September 17, 2023 13:25

pengchengguo changed the title ~~Support LoRA based large model finetuning.~~ [WIP] Support LoRA based large model finetuning. Sep 18, 2023

Support Whisper finetune for ST task.

0e6bc6a

pengchengguo force-pushed the whisper branch from 6a9bed5 to 0e6bc6a Compare September 21, 2023 08:59

mergify bot added the ESPnet1 label Sep 21, 2023

mergify bot added the conflicts label Sep 23, 2023

Merge from master and solve conflicts.

07f7e77

pengchengguo force-pushed the whisper branch from af33626 to 07f7e77 Compare September 24, 2023 07:56

mergify bot removed the conflicts label Sep 24, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

8e711c7

for more information, see https://pre-commit.ci

pengchengguo added 3 commits September 24, 2023 21:43

Fix CI test errors.

6f882f8

Fix CI test error.

cc55e58

Merge from master branch and solve conflicts.

d1847a8

pengchengguo changed the title ~~[WIP] Support LoRA based large model finetuning.~~ Support LoRA based large model finetuning. Sep 30, 2023

mergify bot added Installation CI Travis, Circle CI, etc labels Sep 30, 2023

pengchengguo force-pushed the whisper branch from 87a4568 to d1847a8 Compare September 30, 2023 14:53

pengchengguo added 2 commits October 10, 2023 10:39

Merge branch 'master' into whisper

121a77d

Add LoRA installation to Makefile.

df4536b

pengchengguo force-pushed the whisper branch from 2b063ab to df4536b Compare October 10, 2023 03:16

Fix ST inference and CI test errors.

97f855c

pengchengguo force-pushed the whisper branch from 2d3ef4f to 97f855c Compare October 11, 2023 08:59

sw005320 merged commit 390589a into espnet:master Oct 11, 2023
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support LoRA based large model finetuning. #5400

Support LoRA based large model finetuning. #5400

pengchengguo commented Aug 9, 2023

codecov bot commented Aug 9, 2023 •

edited

sw005320 commented Aug 9, 2023

pengchengguo commented Aug 9, 2023

sw005320 commented Aug 30, 2023

pyf98 commented Sep 17, 2023

pyf98 commented Sep 17, 2023 •

edited

pengchengguo commented Sep 18, 2023

pengchengguo commented Sep 18, 2023

mergify bot commented Sep 23, 2023

sw005320 commented Sep 30, 2023

pengchengguo commented Sep 30, 2023

sw005320 commented Oct 10, 2023

pengchengguo commented Oct 11, 2023

sw005320 commented Oct 11, 2023

Support LoRA based large model finetuning. #5400

Support LoRA based large model finetuning. #5400

Conversation

pengchengguo commented Aug 9, 2023

What?

See also

codecov bot commented Aug 9, 2023 • edited

Codecov Report

sw005320 commented Aug 9, 2023

pengchengguo commented Aug 9, 2023

sw005320 commented Aug 30, 2023

pyf98 commented Sep 17, 2023

pyf98 commented Sep 17, 2023 • edited

pengchengguo commented Sep 18, 2023

pengchengguo commented Sep 18, 2023

mergify bot commented Sep 23, 2023

sw005320 commented Sep 30, 2023

pengchengguo commented Sep 30, 2023

sw005320 commented Oct 10, 2023

pengchengguo commented Oct 11, 2023

sw005320 commented Oct 11, 2023

codecov bot commented Aug 9, 2023 •

edited

pyf98 commented Sep 17, 2023 •

edited