Add Speech Seq2Seq Training script #14792

patrickvonplaten · 2021-12-16T11:46:34Z

What does this PR do?

An explanation of this new training script is given on the README.md.
Two successful training runs can be seen here: https://huggingface.co/models?other=asr_seq2esq

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

src/transformers/tokenization_utils_base.py

patrickvonplaten · 2021-12-17T18:55:54Z

Hmmm, not that easy the training it seems...will run some more examples next week with Wav2Vec2 - BERT

patrickvonplaten · 2021-12-22T13:00:03Z

Getting good results now for wav2vec2 - bart: https://huggingface.co/patrickvonplaten/wav2vec2-2-bart-base

patrickvonplaten · 2021-12-23T13:47:02Z

Results are good: https://huggingface.co/models?other=asr_seq2esq

Cleaning up the PR and we can merge in 1,2 days

sgugger

Nice new example! Two comments before merging:

the test is the second longest at 47s. If it's possible to reduce it a little bit, that would be great.
We already had a discussion on the fact that the speech models have a feature_extractor attribute that is distinct from the feature extractor that preprocesses the data, which is very confusing. We need another name for the freeze_feature_extractor methods and argument as a result.

examples/pytorch/speech-recognition/run_speech_recognition_seq2seq.py

src/transformers/models/speech_encoder_decoder/modeling_speech_encoder_decoder.py

patrickvonplaten · 2021-12-28T09:20:47Z

Merging this now. Very much agree on the naming issue @sgugger and thanks for reminding me again. Will open another PR for this later today.

* start * add gradient checkpointing and feature extractor freezing * Apply suggestions from code review * up * up * up * correct * up * more changes * up * up * up * remove rst

patrickvonplaten added 2 commits December 16, 2021 12:46

start

db9d3ec

add gradient checkpointing and feature extractor freezing

7357d89

patrickvonplaten mentioned this pull request Dec 16, 2021

Speech2Text2 training support #13860

Closed

patrickvonplaten commented Dec 16, 2021

View reviewed changes

src/transformers/tokenization_utils_base.py Outdated Show resolved Hide resolved

Apply suggestions from code review

e51b514

patrickvonplaten added 6 commits December 21, 2021 12:45

up

7144fbd

up

102c68f

up

fbfe037

up

512aeed

correct

3bd0e8d

up

0eb72f6

more changes

6dc07c1

patrickvonplaten added 4 commits December 24, 2021 12:41

up

75e06e6

up

2b0ae9d

up

4060f70

up

5b84e58

patrickvonplaten changed the title ~~[WIP] Add Speech Seq2Seq Training script~~ Add Speech Seq2Seq Training script Dec 24, 2021

patrickvonplaten requested review from sgugger, anton-l and patil-suraj December 24, 2021 17:12

sgugger approved these changes Dec 27, 2021

View reviewed changes

remove rst

7949d0e

patrickvonplaten merged commit 1c12191 into huggingface:master Dec 28, 2021

patrickvonplaten deleted the add_speech_seq2seq branch December 28, 2021 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Speech Seq2Seq Training script #14792

Add Speech Seq2Seq Training script #14792

patrickvonplaten commented Dec 16, 2021 •

edited

Loading

patrickvonplaten commented Dec 17, 2021

patrickvonplaten commented Dec 22, 2021

patrickvonplaten commented Dec 23, 2021

sgugger left a comment

patrickvonplaten commented Dec 28, 2021

Add Speech Seq2Seq Training script #14792

Add Speech Seq2Seq Training script #14792

Conversation

patrickvonplaten commented Dec 16, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

patrickvonplaten commented Dec 17, 2021

patrickvonplaten commented Dec 22, 2021

patrickvonplaten commented Dec 23, 2021

sgugger left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Dec 28, 2021

patrickvonplaten commented Dec 16, 2021 •

edited

Loading