audio_device selection in Livespeech #23

saibharani · 2017-05-23T08:47:14Z

I want to do keyword recognition from a specific mic. Can you tell me how to do audio_device selection.
I want to select audio_device with index 2.
Thank you.

slowrunner · 2019-06-19T21:35:52Z

@saibharani
You can try
speech = LiveSpeech(audio_device='2')

but if you are asking about an ALSA audio_device=2 you may get (like I am getting...):

Error opening audio device 2 for capture: Connection refused Traceback (most recent call last): File "keyword.py", line 16, in <module> speech = LiveSpeech(audio_device='2',verbose=True,sampling_rate=44100,lm=False,keyphrase='forward',kws_threshold='le-20') File "/usr/local/lib/python3.5/dist-packages/pocketsphinx/__init__.py", line 206, in __init__ self.ad = Ad(self.audio_device, self.sampling_rate) File "/usr/local/lib/python3.5/dist-packages/sphinxbase/ad_pulse.py", line 124, in __init__ this = _ad_pulse.new_Ad(audio_device, sampling_rate) RuntimeError: new_Ad returned -1

LiveSpeech() seems to be trying pulseaudio, can it use ALSA device?

Can we use a USB mic that only supports 44.1K and 48K sample_rate?

bkravi-os-iot · 2019-08-26T03:57:02Z

@slowrunner . may be this late as I joined this platform late.. May be you are not in this need anymore. But can be helpful for others.. By default LiveSpeech will call ad_pulse.py when sys.platform.startswith('linux').. as in your case it is calling ad_pulse. But if you want to change this call to as_alsa.py, simple change below lines in /usr/local/lib/python2.7/dist-packages/sphinxbase/init.py:

import sys

if sys.platform.startswith('win'):
from .ad_win32 import *
elif sys.platform.startswith('darwin'):
from .ad_openal import *
elif sys.platform.startswith('linux'):
try:
### RAVI commented below and added line to import alsa. Otherwise, it will call pulse not alsa ###
#from .ad_pulse import *
from .ad_alsa import *
except:
from .ad_alsa import *

from .sphinxbase import *

Please check above in bold and italic.

Hope this helps

slowrunner · 2019-08-26T14:35:23Z

@bkravi-os-iot Thank you. Made that change in each sphinxbase/__init__.py, but still not having enough success.

If I use LiveSpeech(audio_device='hw:1,0',sampling_rate=16000...), it finds the capture device but claims:

*** get_model_path(): /usr/local/lib/python3.5/dist-packages/pocketsphinx/model
Available samping rate 11025 is too far from requested 16000

(This may be the HDMI device though, not my USB mic)

which is weird because in a non-livespeech script I use:

p = pyaudio.PyAudio()
samprate = 16000
stream = p.open(format=pyaudio.paInt16, channels=1, rate=int(samprate), input=True, frames_per_buffer=1024)
stream.start_stream()

and pocketsphinx works just fine.

I tried sampling_rate=11025 but pocketsphinx only has a 16k model and cores:

*** get_model_path(): /usr/local/lib/python3.5/dist-packages/pocketsphinx/model
Segmentation fault

btw: arecord -l

**** List of CAPTURE Hardware Devices ****
card 1: Device [USB Audio Device], device 0: USB Audio [USB Audio]
  Subdevices: 1/1
  Subdevice #0: subdevice #0

Ozer0 · 2019-11-19T16:51:51Z

I got a sollution (works for me) (:
I have the same result as you when i use arecord -l, that means the hardware is plughw:1,0
Once you installed and builded pocketsphinx and sphinxbase, You can test it in a terminal with the following comand:
pocketsphinx_continuous -inmic yes -adcdev plughw:1,0
If it works, you can go to the next step with the LiveSpeech function with the parameter
audio_device='plughw:1,0'

Example

import os
from pocketsphinx import LiveSpeech, get_model_path

model_path = get_model_path()

speech = LiveSpeech(
    audio_device='plughw:1,0',   #<---- Here is
    verbose=False,
    sampling_rate=16000,
    buffer_size=2048,
    no_search=False,
    full_utt=False,
    hmm=os.path.join(model_path, 'en-us'),
    lm=os.path.join(model_path, 'en-us.lm.bin'),
    dic=os.path.join(model_path, 'cmudict-en-us.dict')
)

for phrase in speech:
    print(phrase)

NOTE: I needed to change the init.py as @bkravi-os-iot said. (Thank you)

Ozer0 · 2019-12-10T01:44:11Z

Hi (: Yes, what i did was chage the file __init__.py that is on the sphinxbase directory. Just get into that directory and type: sudo nano __init__.py Change the two lines and it's done. I'll put two screenshots to make it easier for you. El 09/12/2019 6:54 p. m., Judison Bacalso <notifications@github.com> escribió: Hi @Ozer0<https://github.com/Ozer0> I was just wondering how did you change the init.py that @bkravi-os-iot<https://github.com/bkravi-os-iot> said. because i cannot change it because it cannot be writable. Thanks! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#23?email_source=notifications&email_token=AMFRP7XP7G5MUZ5MTIUIRSDQX3SGDA5CNFSM4DMNPF72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEGLHUIA#issuecomment-563509792>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AMFRP7URAOEIIBHT6PUBN63QX3SGDANCNFSM4DMNPF7Q>.

JDSNX · 2019-12-10T02:22:22Z

Hi @Ozer0
I have successfully change the init.py. But now I have another problem, take a look at this:

Error opening audio device plughw:1,0 for capture: Invalid argument
Traceback (most recent call last):
File "/home/pi/Desktop/try.py", line 17, in
dic=os.path.join(model_path, 'cmudict-en-us.dict')
File "/usr/local/lib/python3.7/dist-packages/pocketsphinx/init.py", line 206, in init
self.ad = Ad(self.audio_device, self.sampling_rate)
File "/usr/local/lib/python3.7/dist-packages/sphinxbase/ad_pulse.py", line 122, in init
this = _ad_pulse.new_Ad(audio_device, sampling_rate)
RuntimeError: new_Ad returned -1

I just follow what you said above about the audio_device.

JDSNX · 2019-12-10T02:23:39Z

By the way, I am running this in Raspberry Pi

Ozer0 · 2019-12-10T03:50:37Z

Ok, my init.py looks like this.

import sys

if sys.platform.startswith('win'):
    from .ad_win32 import *
elif sys.platform.startswith('darwin'):
    from .ad_openal import *
elif sys.platform.startswith('linux'):
    try:
        from .ad_alsa import *
    except:
        from .ad_alsa import *

from .sphinxbase import *

I had the same problem. Did you tried with this comand on the console?
pocketsphinx_continuous -inmic yes -adcdev plughw:1,0

If there's the problem like "ERROR continuos.c" maybe when you builded sphinxbase the Raspberry choosed pulseaudio instead of ALSA. So, check this link. http://robot.laylamah.com/?p=35

You have to verify Sphinxbase uses ALSA when you type the comand ./configure.
This is because Sphinxbase always look first for pulseaudio, but in the documentation they said in embebed systems you should use ALSA.

Even if you didn't installed pulseaudio this could happen (my case), so what I did was look for that pulseaudio.h file that makes that error. It was on a directory called Pulse in the /home/pi
So I had to move it to another directory (Desktop) with the comand
sudo mv pulse/ /home/pi/Desktop
Then I re-configured sphinxbase and realized sphinxbase choosed ALSA.

I think you could look for the file with whis comand on the console
whereis pulseaudio

Ozer0 · 2019-12-10T03:52:54Z

Working with pocketsphinx python on Raspberry could be difficult because of many errors while installing all dependencies, that's why I'm working on a YouTube tutorial for begginers like me.
I'll share the video's link when finished (:

JDSNX · 2019-12-10T11:37:17Z

How can I choose the ALSA?

I cannot find this when i configure the sphinxbase

I am really confused now :<

Ozer0 · 2019-12-10T17:53:51Z

First of all do the steps to rename or move the "pulseaudio.h" file or if it's the case, uninstall pulseaudio. Then you have to re-configure sphinxbase with ./configure
When finish, look for the line that says
checking for alsa/aaoundlib.h .. yes

JDSNX · 2019-12-11T05:51:22Z

I already re-configured my sphinxbase and I already saw checking for alsa/aaoundlib.h .. yes. And I also uninstall my pulseaudio.

My problem now is when i check the alsamixer it has an error:

After I uninstall my pulseaudio this error came up. Been searching for a while for this error but cannot fixed.

Ozer0 · 2019-12-11T06:28:46Z

Did you only uninstall pulseaudio?
What i did was look for the file i told you and moved it to another directory.
I don't have installed pulseaudio on my raspberry and i don't have that problem while using alsautils.

JDSNX · 2019-12-11T09:03:27Z

I uninstall pocketsphinx and sphinxbase. Re-configured them all but when I check init.py i doesnt have any code.

Now, when i run this: pocketsphinx_continuous -inmic yes -adcdev plughw:1,0 this is the output:

Can you determine what did I do wrong?

Ozer0 · 2019-12-11T17:57:19Z

Are you sure you're trying to edit the sphinxbase's init.py?
Because pocketsphinx also has an init.py file, don't confuse them.
You don't need to uninstall pocketsphinx or sphinxbase, you only have to re-configure them and install again. Try to do that

And about the problem, seems you need to specific the language model. You can try with a comand like this

Try tp add the 3 parameter
-hmm is the en-us file
-lm = language model (.lm file)
-dict = dictionary (.dic file)
So go to your model folder and change the name to yours

JDSNX · 2019-12-12T03:10:05Z

Hi sir! I am able to run the continuous now. But the thing is i cannot run this line: pocketsphinx_continuous -inmic yes -adcdev plughw:1,0 I can only run by using this ./src/programs/pocketsphinx_continuous -adcdev plughw:1,0 -nfft 2048 -samprate 48000.

When im using the first one the error is this

But when I use the second one i need to go to the pocketsphinx folder and run it there. I just followed the steps you linked to me before.

JDSNX · 2019-12-12T03:15:14Z

I fixed it using sudo ldconfig.

Ozer0 · 2019-12-12T03:24:13Z

Happy to hear that.
So now you can use a python script with pocketsphinx?

JDSNX · 2019-12-12T03:27:26Z

Yes, I am using the one you gave. Is there a way to catch the string? Or I will need the LiveSpeech?

Ozer0 · 2019-12-12T04:20:25Z

Yeah, you have to use LiveSpeech and convert the "phrase" in code.
See this video, it was helpful for me.
https://www.youtube.com/watch?v=f_Pq7GDdxos

Also you can create your own dictionary and language model to reduce the error, it's very helpful. See this page (Step 8) https://medium.com/@ranjanprj/for-some-time-now-i-have-been-thinking-really-hard-to-build-a-diy-study-aid-for-children-which-uses-17ce90e72f43
Once you get the two files, you have to copy to the model folder and next use them on the python script (LiveSpeech parameters)

JDSNX · 2019-12-12T10:09:42Z

I followed the steps of the video you showed me and it has an error about the LiveSpeech

But the using the pocketpshinx_continuous does not have an error. I even create my own dictionary and language.

Ozer0 · 2019-12-12T17:59:02Z

Why are you executing your script with ./ ?
I don't know if that works but you have errors with python's keywords like "import", you should try with
python speech.py

JDSNX · 2019-12-13T03:19:36Z

Why I cannot import LiveSpeech?

Ozer0 · 2019-12-13T04:15:40Z

That's was my problem too haha
Did you successfully installed pocketsphinx?

Here are the steps I followed, you should try with pip3 instead of pip.

Ozer0 · 2019-12-13T04:16:54Z

First try to uninstall pocketsphinx python with
pip uninstall pocketsphinx

JDSNX · 2019-12-13T08:39:59Z

Hi! Do you have any idea how to deal with this?

JDSNX · 2019-12-13T08:50:30Z

I am back with this error.

I already change init.py from pulse to alsa. Both 3.7 and 2.7. I just followed the video you linked.

Ozer0 · 2019-12-14T03:37:33Z

Hey man, did you solve it?

JDSNX · 2019-12-14T03:45:01Z

Nah. I keep getting the error like this. Same error for python 2 and 3.

I changed the init.py but the error still there.

JDSNX · 2019-12-14T03:45:32Z

What parameters you put in the LiveSpeech?

Ozer0 · 2019-12-14T04:00:54Z

See my first reply on this post

JDSNX · 2019-12-14T04:59:55Z

I got it now sir! Thank you so much! 👍 really appreciate your help!

Ozer0 · 2019-12-14T09:25:55Z

Very happy to hear you finally can use it :D
You're welcome, I'm here to help people with the things i learned in so many forums like this.

And not a Sir, I'm just 22 haha
Cheers from Mexico City 🇲🇽

JDSNX · 2019-12-16T04:00:31Z

Hi! Im back again! :D
I just wanna ask if it is possible in LiveSpeech to print the words when I stop speaking? Because when I have a certain sentence it will break. For example I will say "HI HOW ARE YOU" It will output like this
HI HOW
ARE YOU

Is there a way to catch it?

Ozer0 · 2019-12-16T17:42:36Z

Do you want to do something with the words you say?
Like a command?

JDSNX · 2019-12-17T04:22:15Z

Yes. It cannot catch all of my sentence because when I try to use it, it will automatically read what I say. Is there way that the livespeech could run when im done talking?

JDSNX · 2019-12-17T04:48:50Z

I have a custom language model and dictionary already.

Ozer0 · 2019-12-17T05:22:38Z

Ok, you have to use your .lm and .dict files in the LiveSpeech's parameters function. Then you need to compare the string with the words you say
Here is a simple example

#Creating a command function
def command(word):
    if 'THE WORDS YOU WANT' in word:
        #do what you want, example
        print("Hello")
    else:
        print("Repeat it please")

for phrase in speech:
    word = str(phrase)
    #calling the commands function
    command(word)

Ozer0 · 2019-12-17T05:24:24Z

Ozer0 · 2019-12-17T05:25:11Z

Ozer0 · 2019-12-17T05:26:39Z

JDSNX · 2019-12-17T09:30:35Z

Ya, we have the same code. But I dont know why it will automatically print the phrase even if im not done talking yet.

Ozer0 · 2019-12-17T18:41:23Z

The dictionary you created is conformed of only words?, or you wrote the wole phrases (two or more words)?
For example, I use a LIGHT ON command. So that's how i wrote the txt file to create the language model.

LIGHT ON
LIGHT OFF
.
.
.

etc

JDSNX · 2019-12-18T02:30:27Z

What I wrote on my txt file is by word like OFF, ON, OPEN, CLOSE, etc. I cannot get the whole sentence because it will break automatically for example i will say "Please turn on the lights". The sentence will be separated to PLEASE TURN and ON THE LIGHTS. That is my problem as of now. I cannot get the whole phrase

Ozer0 · 2019-12-19T05:57:51Z

Maybe you should try using the whole phrases in your txt file.
Once you get that, you can use the whole phrase in your if sentence.
I tell you this because the dictionary is different from the language model, the last one is used to determinate what words would be in front of some other word.
You can read the CMUSphinx documentation to understand it.

ismailryt · 2020-04-24T04:57:40Z

Hi @Ozer0 ,I hope you want to help me
I already follow every step and all the step in conversation between you and @JDSNX

But i still have a error like this

Error opening audio device plughw:1,0 for capture: Device or resource busy
Traceback (most recent call last):
File "try.py", line 15, in
dic=os.path.join(model_path, '0175.dic')
File "/usr/local/lib/python3.7/dist-packages/pocketsphinx/init.py", line 206, in init
self.ad = Ad(self.audio_device, self.sampling_rate)
File "/usr/local/lib/python3.7/dist-packages/sphinxbase/ad_alsa.py", line 122, in init
this = _ad_alsa.new_Ad(audio_device, sampling_rate)
RuntimeError: new_Ad returned -1

As you can see audio device plughw:1,0 is error, i already check using arecord -l, i hope you have any solution, anyway im using respeaker 4-mic array

ismailryt · 2020-04-24T05:03:50Z

pi@raspberrypi:~ $ arecord -l
**** List of CAPTURE Hardware Devices ****
card 1: seeed4micvoicec [seeed-4mic-voicecard], device 0: bcm2835-i2s-ac10x-codec0 ac10x-codec.1-003b-0 [bcm2835-i2s-ac10x-codec0 ac10x-codec.1-003b-0]
Subdevices: 0/1
Subdevice #0: subdevice #0

Ozer0 · 2020-04-24T17:08:36Z

pi@raspberrypi:~ $ arecord -l
**** List of CAPTURE Hardware Devices ****
card 1: seeed4micvoicec [seeed-4mic-voicecard], device 0: bcm2835-i2s-ac10x-codec0 ac10x-codec.1-003b-0 [bcm2835-i2s-ac10x-codec0 ac10x-codec.1-003b-0]
Subdevices: 0/1
Subdevice #0: subdevice #0
Hi there.
Seems like you have the ALSA problem, have you changed the init file?

ismailryt · 2020-04-25T12:04:13Z

@Ozer0 Thanks for you reply, actually i already fix that but now i have this error

RuntimeError: new_Decoder returned -1

I know this is happend because path problem, but i dont know how to set right path, i use your code

model_path = get_model_path()

speech = LiveSpeech(
audio_device='plughw:1,0', #<---- Here is
verbose=False,
sampling_rate=16000,
buffer_size=2048,
no_search=False,
full_utt=False,
hmm=os.path.join(model_path, 'en-us-id'),
lm=os.path.join(model_path, '0175.lm'),
dic=os.path.join(model_path, '0175.dic')
)

Can you teach me this part of line " model_path = get_model_path() "
Thanks

Ozer0 · 2020-05-02T03:40:11Z

@Ozer0 Thanks for you reply, actually i already fix that but now i have this error

RuntimeError: new_Decoder returned -1

I know this is happend because path problem, but i dont know how to set right path, i use your code

model_path = get_model_path()

speech = LiveSpeech(
audio_device='plughw:1,0', #<---- Here is
verbose=False,
sampling_rate=16000,
buffer_size=2048,
no_search=False,
full_utt=False,
hmm=os.path.join(model_path, 'en-us-id'),
lm=os.path.join(model_path, '0175.lm'),
dic=os.path.join(model_path, '0175.dic')
)

Can you teach me this part of line " model_path = get_model_path() "
Thanks

Hi, you solved the problem?

ismailryt · 2020-05-09T14:05:08Z

@Ozer0 actually no error right now, but now my problem is LiveSpeech cant recognize anything.. When i use http://blog.justsophie.com/python-speech-to-text-with-pocketsphinx/ decode method, it can recognize 1 word then input over flowed because my raspberry pi cpu too slow

So my last hope is using LiveSpeech, im wondering why no error but cant recognize anything, when I use pocketsphinx_continuous, it works can record my word and can give me output. Can you help me

Ozer0 · 2020-05-12T02:34:28Z

@Ozer0 actually no error right now, but now my problem is LiveSpeech cant recognize anything.. When i use http://blog.justsophie.com/python-speech-to-text-with-pocketsphinx/ decode method, it can recognize 1 word then input over flowed because my raspberry pi cpu too slow

So my last hope is using LiveSpeech, im wondering why no error but cant recognize anything, when I use pocketsphinx_continuous, it works can record my word and can give me output. Can you help me

Well, gotta say I used LiveSpeech in a Raspberry Zero so i don't think the CPU is your problem.
What it could be is you have to use an infinite loop, are you using that on your python code?
You can see part of my code above these answers

ismailryt · 2020-05-14T12:40:08Z

@Ozer0 i find the another solution! I use decode method not using LiveSpeech and can do hotword after that give intruction to system. Thanks for this thread and all of you guys, really helped me

ismailryt mentioned this issue May 9, 2020

LiveSpeech cant recognize anything, it needed pulseaudio? #56

Open

audio_device selection in Livespeech #23

audio_device selection in Livespeech #23

Comments

saibharani commented May 23, 2017 • edited

slowrunner commented Jun 19, 2019

bkravi-os-iot commented Aug 26, 2019

slowrunner commented Aug 26, 2019 • edited

Ozer0 commented Nov 19, 2019

Ozer0 commented Dec 10, 2019 via email

JDSNX commented Dec 10, 2019

JDSNX commented Dec 10, 2019

Ozer0 commented Dec 10, 2019

Ozer0 commented Dec 10, 2019

JDSNX commented Dec 10, 2019

Ozer0 commented Dec 10, 2019

JDSNX commented Dec 11, 2019

Ozer0 commented Dec 11, 2019

JDSNX commented Dec 11, 2019

Ozer0 commented Dec 11, 2019

JDSNX commented Dec 12, 2019

JDSNX commented Dec 12, 2019

Ozer0 commented Dec 12, 2019

JDSNX commented Dec 12, 2019

Ozer0 commented Dec 12, 2019

JDSNX commented Dec 12, 2019

Ozer0 commented Dec 12, 2019

JDSNX commented Dec 13, 2019

Ozer0 commented Dec 13, 2019

Ozer0 commented Dec 13, 2019

JDSNX commented Dec 13, 2019

JDSNX commented Dec 13, 2019 • edited

Ozer0 commented Dec 14, 2019

JDSNX commented Dec 14, 2019

JDSNX commented Dec 14, 2019

Ozer0 commented Dec 14, 2019

JDSNX commented Dec 14, 2019

Ozer0 commented Dec 14, 2019

JDSNX commented Dec 16, 2019

Ozer0 commented Dec 16, 2019

JDSNX commented Dec 17, 2019

JDSNX commented Dec 17, 2019

Ozer0 commented Dec 17, 2019 • edited

Ozer0 commented Dec 17, 2019

Ozer0 commented Dec 17, 2019

Ozer0 commented Dec 17, 2019

JDSNX commented Dec 17, 2019

Ozer0 commented Dec 17, 2019

JDSNX commented Dec 18, 2019

Ozer0 commented Dec 19, 2019

ismailryt commented Apr 24, 2020

ismailryt commented Apr 24, 2020

Ozer0 commented Apr 24, 2020

ismailryt commented Apr 25, 2020

Ozer0 commented May 2, 2020

ismailryt commented May 9, 2020

Ozer0 commented May 12, 2020 • edited

ismailryt commented May 14, 2020

saibharani commented May 23, 2017 •

edited

slowrunner commented Aug 26, 2019 •

edited

JDSNX commented Dec 13, 2019 •

edited

Ozer0 commented Dec 17, 2019 •

edited

Ozer0 commented May 12, 2020 •

edited