-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ESPnet-SPk: major update #5408
ESPnet-SPk: major update #5408
Conversation
for more information, see https://pre-commit.ci
Unify frontend
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
test_python seems to have timeout errors randomly (maybe better to increase timeout threshold?). currently I'm just closing and reopening.. |
@Emrys365, we still observe an issue in https://github.com/espnet/espnet/actions/runs/6738909252/job/18319405795?pr=5408#step:9:694 |
These errors seem to be coming from the files/unittests that wasn't edited in this pr. Maybe the update here (4c96187) is the reason? |
Interesting. I thought fixing the input in my PR should avoid this issue. I will look at it later. |
What?
This PR combines several previous working PRs with new functions.
ESPnet-spk: add more model architectures #5385, ESPnet-SPK: add inference #5398
This PR newly introduces:
Speed perturbation as speaker augmentation
Frontend design modification -> we now can train the model with frozen/free SSL front-ends through S3PRL
Additional Speaker models: ECAPA-TDNN and MFA-Conformer
Adds novel on-going research on representation learning
Inference stage, a separate inference stage for evaluating models with EER and minDCF.
(updated) Supported configurations
Why?
We have been adding several new features.
Several ongoing PRs have conflicts in places.
See also