-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does mdeval computes DER taking overlapped speech into account? #62
Comments
Hi @AntoineBlanot! I had to check that first.
This option is, however, currently not set by our wrapper. Even if the option is not set, I found the following things:
meeteval.der.md_eval_22(
meeteval.io.asseglst([
{'speaker': 'A', 'start_time': 0, 'end_time': 7, 'session_id': 'X', 'words': ''},
{'speaker': 'A', 'start_time': 4, 'end_time': 10, 'session_id': 'X', 'words': ''},
]),
meeteval.io.asseglst([{'speaker': 'A', 'start_time': 0, 'end_time': 10, 'session_id': 'X', 'words': ''}]),
)
# WARNING: speaker A speaking more than once at time 4
# WARNING: speaker A speaking more than once at time 4
# WARNING: speaker A speaking more than once at time 4
# WARNING: speaker A speaking more than once at time 4
# {'X': DiaErrorRate(error_rate=Decimal('0.00'), scored_speaker_time=Decimal('10.000000'), missed_speaker_time=Decimal('0.000000'), falarm_speaker_time=Decimal('0.000000'), speaker_error_time=Decimal('0.000000'))}
meeteval.der.md_eval_22(
meeteval.io.asseglst([
{'speaker': 'A', 'start_time': 0, 'end_time': 10, 'session_id': 'X', 'words': ''},
{'speaker': 'B', 'start_time': 0, 'end_time': 10, 'session_id': 'X', 'words': ''},
]),
meeteval.io.asseglst([
{'speaker': 'A', 'start_time': 0, 'end_time': 10, 'session_id': 'X', 'words': ''},
{'speaker': 'B', 'start_time': 0, 'end_time': 10, 'session_id': 'X', 'words': ''},
]),
)
# {'X': DiaErrorRate(error_rate=Decimal('0.00'), scored_speaker_time=Decimal('20.000000'), missed_speaker_time=Decimal('0.000000'), falarm_speaker_time=Decimal('0.000000'), speaker_error_time=Decimal('0.000000'))} @boeddeker Do you know anything about the |
We looked through MeetEval does currently not set |
@thequilo Thank you very much for your responses ! Your comments were very clear, thank you for your insights! Being able to deactivate scoring would help yes, as it can indicate if a model is good on non-overlapped regions or not. |
Sure, it's just an option that has to be passed to md-eval. The naming of such an option is not that easy though. I currently prefer |
I think we have two options (In the future, we may add more DER backends, e.g. pyannote and/or spyder):
I am against |
I would agree with you if the native name wasn't If you are against |
There is always a trade-off between keeping the original name and introducing a new name. While the IMHO, the If we rename that option, yes, the long form of spyder is probably the best (short form |
I also think that keeping the |
We could also define a long and short form for the option, like Some tools also have the following interface (not for this particular option), but I feel like they pollute the interface namespace:
Or one of these?
|
Thank you very much for sharing this repository. It is very useful to have a single repo with many audio metrics :)
Tools like pyannote allows us to choose the collar and if we want to compute DER on overlapped speech regions or not.
With
mdeval
, we can specify the collar but it seems like there is no option for including overlapped speech in the metric or not.Does that mean that by default it computes over overlapped regions? Or are they excluded for the calculations?
Thank you for your answer !
The text was updated successfully, but these errors were encountered: