iCub Audio Recognition

About

This set of modules implement Audio Filtering, Speech Recognition, and Hotword Detection for the iCub.

Structure

Audio Filtering

The Audio Filtering Module is located in the src/audioFiltering folder. Audio Filtering uses the webrtc-audio-filtering library for beamforming/filtering. The script setup.sh can be used to download and install this library.

The module then publishes filtered audio on the port /filteredAudio:o

Speech Recognition

The folder src/PythonSpeechRecognizer contains multiple scripts for different forms of speech recognition.

hotword.py

This script performs basic Speech Recognition based on the model from the Tensorflow Examples. The recognized speech is then published on the port /speech:o.

recognize.py

This script performs large vocabulary speech recognition using Kaldi. The script needs a speech recognition server setup using docker.

PocketSphinx Speech Recognizer

The folder src/PocketsphinxSpeechRecognizer contains a module that performs speech recognition with PocketSphinx.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cmake/Modules		cmake/Modules
src		src
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iCub Audio Recognition

About

Structure

Audio Filtering

Speech Recognition

hotword.py

recognize.py

PocketSphinx Speech Recognizer

About

Releases

Packages

Contributors 2

Languages

TataLab/iCubAudioRecognition

Folders and files

Latest commit

History

Repository files navigation

iCub Audio Recognition

About

Structure

Audio Filtering

Speech Recognition

hotword.py

recognize.py

PocketSphinx Speech Recognizer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages