New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ESPnet-Spk part 3 - inference every epoch using EER #5314
Conversation
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
Codecov Report
@@ Coverage Diff @@
## master #5314 +/- ##
==========================================
- Coverage 76.21% 76.13% -0.09%
==========================================
Files 669 672 +3
Lines 59566 59859 +293
==========================================
+ Hits 45396 45571 +175
- Misses 14170 14288 +118
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
lines = f.readlines() | ||
|
||
joint_key = lines[0].strip().split(" ")[0] + "*" + lines[1].strip().split(" ")[0] | ||
with open(os.path.join(sys.argv[2], "trial.scp"), "w") as f: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe it's better to rename trial.scp
to trial1.scp
to make them more consistent.
Can you fix the conflicts due to the merge of your previous PRs? |
This pull request is now in conflict :( |
Just resolved all conflicts! Now same to my local branch which passes CI tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great effort! some minor comments.
@kan-bayashi , the number of parameter of ESPnet-SPK's ci model was 0.1M and I further reduced it to 0.01M. Hope this is small enough. Let me know if you need further alterations. |
Co-authored-by: Wangyou Zhang <C0me_On@163.com>
Co-authored-by: Wangyou Zhang <C0me_On@163.com>
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is more for a question.
Is there a way to keep the same random seed even in the multiple worker environment, so that we could reproduce the same augmented data?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is more for a question. Is there a way to keep the same random seed even in the multiple worker environment, so that we could reproduce the same augmented data?
I think it is handled for the sequence iterator by the worker_init_fn
:
https://github.com/espnet/espnet/blob/master/espnet2/iterators/sequence_iter_factory.py#L13-L17
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see, thx!
This PR is a follow-up PR on #5258.
We only had training available, whereas through this PR, we finish the ESPnet-SPK basic pipeline.
Now, EER is calculated every epoch.