Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speaker Recognition #11

Closed
kmarekspartz opened this issue Dec 16, 2013 · 2 comments
Closed

Speaker Recognition #11

kmarekspartz opened this issue Dec 16, 2013 · 2 comments

Comments

@kmarekspartz
Copy link

Could this system be used for speaker recognition?

@worldveil
Copy link
Owner

It depends what you mean. If you have labeled recordings of a speaker and want to recognize those exact recordings being played, then yes.

If however you want to recognize (1:N) or verify (1:1) a person's speech by their particular idiosyncrasies of speech, then no. dejavu works off of a fingerprinting (read: hashing) system. Like any good hashing scheme, a small perturbation of the input (in dejavu's case, timing and frequency) will cause very different fingerprints.

While very robust to noise, trying to recognize voice, which is not reliably the same timing or frequency each time, won't work. dejavu is meant for recognizing exact duplicates of previously recorded audio.

@kmarekspartz
Copy link
Author

I was thinking about having a long recording of an individual repeating their name many times, and then as input having them say their name once. The fingerprinting approach may be useful there. I or a friend will try it out when we get a chance and get back to you.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants