-
Notifications
You must be signed in to change notification settings - Fork 26.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Speech Seq2Seq Training script #14792
Add Speech Seq2Seq Training script #14792
Conversation
Hmmm, not that easy the training it seems...will run some more examples next week with Wav2Vec2 - BERT |
Getting good results now for wav2vec2 - bart: https://huggingface.co/patrickvonplaten/wav2vec2-2-bart-base |
Results are good: https://huggingface.co/models?other=asr_seq2esq Cleaning up the PR and we can merge in 1,2 days |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice new example! Two comments before merging:
- the test is the second longest at 47s. If it's possible to reduce it a little bit, that would be great.
- We already had a discussion on the fact that the speech models have a
feature_extractor
attribute that is distinct from the feature extractor that preprocesses the data, which is very confusing. We need another name for thefreeze_feature_extractor
methods and argument as a result.
examples/pytorch/speech-recognition/run_speech_recognition_seq2seq.py
Outdated
Show resolved
Hide resolved
examples/pytorch/speech-recognition/run_speech_recognition_seq2seq.py
Outdated
Show resolved
Hide resolved
examples/pytorch/speech-recognition/run_speech_recognition_seq2seq.py
Outdated
Show resolved
Hide resolved
examples/pytorch/speech-recognition/run_speech_recognition_seq2seq.py
Outdated
Show resolved
Hide resolved
src/transformers/models/speech_encoder_decoder/modeling_speech_encoder_decoder.py
Show resolved
Hide resolved
Merging this now. Very much agree on the naming issue @sgugger and thanks for reminding me again. Will open another PR for this later today. |
* start * add gradient checkpointing and feature extractor freezing * Apply suggestions from code review * up * up * up * correct * up * more changes * up * up * up * remove rst
* start * add gradient checkpointing and feature extractor freezing * Apply suggestions from code review * up * up * up * correct * up * more changes * up * up * up * remove rst
What does this PR do?
An explanation of this new training script is given on the README.md.
Two successful training runs can be seen here: https://huggingface.co/models?other=asr_seq2esq
Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.