Skip to content

unreliable recording and speaker diarization #3518

@kodjima33

Description

@kodjima33

UPDATED DIRECTION

Llet's implement saving voices, so that once the user has labeled a speaker (or it was auto-labeled), we saved it to speaker library and it would be reused and auto-labeled in future conversations

It will help us to properly name conversations later and automatically share them

Existing "saved voices" library is located on "identifying others" page

I've also attached an example screenshot of how imitless does it, but i don't think we need to change a UI for this feature

Image Image

I talked to 7 omi users (mobile) and top-1 issue heard was poor quality of diarization (speaker assignment) and reliability.

As Im writing this, i just had a conversation with someone for 20 minutes but only 2 !!!!!!!!!!!!!!! minutes were captured. In that conversation i was the only one who was speaking but our app didn't even assign my name

  • Check why omi didn't capture 80% of this conversation - make sure I understand the problem and then fix it
  • suggest Nik a solution for speaker diarization to make it THE BEST IN THE DAMN WORLD, don't be lazy even if you need to invent smth new
Image

Metadata

Metadata

Assignees

Labels

sttRelated to speech-to-text transcription

Type

Projects

Status

Done

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions