Getting Bad request error for enrolment - Speaker Recognition #66

rajagopal28 · 2016-05-17T08:04:59Z

Hi,
I've been trying to enroll a voice file for a created profile using the python API.
I was able to create a profile and list all profiles successfully. But when I try to enroll a voice (.wav) file with a simple hello world phrase with the created profile, I get the error 'ERROR:root:Error enrolling profile.' which in the trace tells 'Exception: Error enrolling profile: Bad Request'. If needed I can attach the stack trace. Can you help me getting started with this?

rajagopal28 · 2016-05-18T08:35:25Z

It seems like an API problem, I've tried hitting the actual endpoint with a POST request along with the documentation specified parameters and headers. I get the response { 'status' : 'Bad request', message: 'Not a valid WAVE file - No RIFF header'. I've tried with multipart/form-data and using file input from postman REST client. I've also tried to hit the API endpoint in the actual console provided by Microsoft (which doesn't have any way to pass the wave file as file input) by encoding the audio file into string(which starts with data:audio/wav;base64..) Can anyone from Microsoft answer this. I know its in preview stage, but it should have some understandable instructions and parameter details.

momohs · 2016-05-18T09:31:42Z

Hi @rajagopal28,
Thanks for your comments. Can you please attach the *.wav file used for enrollment?

rajagopal28 · 2016-05-18T10:42:47Z

I've used 3 files, I'm attaching all the three
Archive.zip

@momohs I see that you are from Microsoft. In the API console link for enrolling and verifying there are text fields to send the audio file, In what format it should be sent? I used base64 encoded text (as mentioned above), I get the same error. Can you please clarify this? Thanks for your comment.

cthrash · 2016-05-19T02:23:45Z

It looks like the enrollment audio is too short. The audio file should be at least 20 seconds long and no longer than 5 minutes. The minimum number of total speech needed for enrollment, after removing silence, is 60 seconds.

@momohs - one improvement to consider is to include the response body in the exception. In this case it would have made the error much more obvious: { "error": { "code": "BadRequest", "message": "Audio too short" }

rajagopal28 · 2016-05-19T07:15:23Z

@cthrash Thank you so much. It worked, I enrolled a voice phrase to the created profile.
It would be better if there is a way to know this message('Not a valid WAVE file - No RIFF header' or 'Audio too short') in the python wrapper log. It only shows the code ('BadRequest'), which is not so helping in identifying the issue.

momohs · 2016-05-19T16:41:32Z

@rajagopal28 I have tried out the files you sent and I did some successful enrollments with them. However, the file "password.wav" has an incorrect sampling rate. and thus gave me an "incorrect sampling rate error". I have used a REST client for this.

Regarding the python wrapper, the enrollments were successful but I have received a "Bad request" for the file "password.wav". Indeed the exception needs to be better handled in the python wrapper.

Using the console, I am not sure how to attach the file to the request. I am in contact with the team responsible for that. I'll get back to you once it is sorted out.

@cthrash The "Audio Too Short" exception message is currently thrown out by the server if the audio is too short. At this moment, the audio should be from 1 to 15 seconds (as mentioned in the API Documentation)

cthrash · 2016-05-19T18:08:52Z

1-15 seconds, IIUC, is for Speaker Verification. In the Stack Overflow Post, @rajagopal28 is asking (despite the title) about Speaker Identification, as you can see from the call stack.

jjsuarez · 2016-06-25T22:21:02Z

Hello, I am also having problems enrolling an audio file in the API testing console. Please can you answer the question that @rajagopal28 asked, what format should be used in the Request body field? I am getting the same error: {
"error": {
"code": "BadRequest",
"message": "Invalid Audio Format: Not a WAVE file - no RIFF header"
}
}

My file is recorded according to the required parameter values of format and length. Any help would be greatly appreciated. Thanks a lot.

momohs · 2016-06-26T13:51:22Z

Thanks for your feedback @jjsuarez!
We are aware of the issue with uploading audio files using the API Testing Console and we are still sorting it out! Meanwhile, I urge you to use the Python sample code or the C# sample code or the Online demos to test the Speaker Recognition service.

margaretmz · 2016-07-07T05:10:24Z

This issue was moved to microsoft/Cognitive-SpeakerRecognition-Python#2

taunkankur · 2017-07-04T18:51:05Z

I am getting follwing response -

{
"error": {
"code": "BadRequest",
"message": "InvalidPhrase"
}
}

soso-maitha · 2018-03-29T04:22:47Z

I am getting "InvalidPhrase" as well. What could be the cause?

khilscher · 2018-05-10T19:37:59Z

Also getting "InvalidPhrase". Regardless of the audio length.

EasonWang01 · 2018-10-30T12:42:30Z

Hey guys, I also encounter this InvalidPhrase issue before.
Eventually, I found out we can only say what Azure ask us to say.

Using the following API to List All Supported Verification Phrases.
https://westus.dev.cognitive.microsoft.com/docs/services/563309b6778daf02acc0a508/operations/5652c0801984551c3859634d

kiranmahto · 2019-01-15T12:31:57Z

i used python sample code code but it is giving error "message": "Invalid Audio Format: Require Mono"
or "message": "Invalid Audio Format: Require PCM"

soso-maitha · 2019-01-15T12:58:54Z

@kiranmahto use “Audacity” software with which you can convert the audio file to the required format. For the Speaker Verification service the audio file should be in specific format eg. Mono channel not dual, sampeling rate..etc. you will find these in the documentation of the API, i can share the link tomorrow

kiranmahto · 2019-01-15T15:51:50Z

Ok thanks Please do share the links

…

On Jan 15, 2019 18:29, "soso-maitha" ***@***.***> wrote: @kiranmahto <https://github.com/kiranmahto> use “Audacity” software with which you can convert the audio file to the required format. For the Speaker Verification service the audio file should be in specific format eg. Mono channel not dual, sampeling rate..etc. you will find these in the documentation of the API, i can share the link tomorrow — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#66 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AjJC7G-TWJuaN5jbrrGHORlmP1Awvs11ks5vDdCVgaJpZM4IgBtU> .

soso-maitha · 2019-01-16T16:21:10Z

All audio files in the dataset should be stored in the WAV (RIFF) audio format.
The audio must have a sampling rate of 8 kilohertz (KHz) or 16 KHz, and the sample values should be stored as uncompressed, pulse-code modulation (PCM) 16-bit signed integers (shorts).
Only single-channel (mono) audio files are supported.
You will find these requirement for most of Microsoft cognitive services dealing with sound files.
Reference: https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-customize-acoustic-models

Also search in Pluralsight (the website or the app) search for Microsoft speech and Speaker Recognition course it is explained step by step.

rajagopal28 changed the title ~~Getting Bad request error for enrolment - Speech Recogntioon~~ Getting Bad request error for enrolment - Speech Recogntion May 17, 2016

rajagopal28 changed the title ~~Getting Bad request error for enrolment - Speech Recogntion~~ Getting Bad request error for enrolment - Speech Recognition May 17, 2016

rajagopal28 changed the title ~~Getting Bad request error for enrolment - Speech Recognition~~ Getting Bad request error for enrolment - Speaker Recognition May 17, 2016

lightfrenzy added the Speaker Recognition label May 17, 2016

momohs self-assigned this Jun 26, 2016

margaretmz mentioned this issue Jul 7, 2016

Getting Bad request error for enrolment - Speaker Recognition microsoft/Cognitive-SpeakerRecognition-Python#2

Closed

margaretmz closed this as completed Jul 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting Bad request error for enrolment - Speaker Recognition #66

Getting Bad request error for enrolment - Speaker Recognition #66

rajagopal28 commented May 17, 2016

rajagopal28 commented May 18, 2016

momohs commented May 18, 2016

rajagopal28 commented May 18, 2016 •

edited

cthrash commented May 19, 2016

rajagopal28 commented May 19, 2016

momohs commented May 19, 2016

cthrash commented May 19, 2016

jjsuarez commented Jun 25, 2016

momohs commented Jun 26, 2016

margaretmz commented Jul 7, 2016

taunkankur commented Jul 4, 2017

soso-maitha commented Mar 29, 2018

khilscher commented May 10, 2018

EasonWang01 commented Oct 30, 2018 •

edited

kiranmahto commented Jan 15, 2019

soso-maitha commented Jan 15, 2019

kiranmahto commented Jan 15, 2019 via email

soso-maitha commented Jan 16, 2019 •

edited

Getting Bad request error for enrolment - Speaker Recognition #66

Getting Bad request error for enrolment - Speaker Recognition #66

Comments

rajagopal28 commented May 17, 2016

rajagopal28 commented May 18, 2016

momohs commented May 18, 2016

rajagopal28 commented May 18, 2016 • edited

cthrash commented May 19, 2016

rajagopal28 commented May 19, 2016

momohs commented May 19, 2016

cthrash commented May 19, 2016

jjsuarez commented Jun 25, 2016

momohs commented Jun 26, 2016

margaretmz commented Jul 7, 2016

taunkankur commented Jul 4, 2017

soso-maitha commented Mar 29, 2018

khilscher commented May 10, 2018

EasonWang01 commented Oct 30, 2018 • edited

kiranmahto commented Jan 15, 2019

soso-maitha commented Jan 15, 2019

kiranmahto commented Jan 15, 2019 via email

soso-maitha commented Jan 16, 2019 • edited

rajagopal28 commented May 18, 2016 •

edited

EasonWang01 commented Oct 30, 2018 •

edited

soso-maitha commented Jan 16, 2019 •

edited