whisper_ros

This repository provides a set of ROS 2 packages to integrate whisper.cpp into ROS 2 using audio_common. Besides, silero-vad is used to perform VAD (Voice Activity Detection).

Installation

$ cd ~/ros2_ws/src
$ git clone https://github.com/mgonzs13/audio_common.git
$ git clone --recurse-submodules https://github.com/mgonzs13/whisper_ros.git
$ sudo apt install portaudio19-dev
$ pip3 install -r audio_common/requirements.txt
$ pip3 install -r whisper_ros/requirements.txt
$ cd ~/ros2_ws
$ colcon build

CUDA

To run llama_ros with CUDA, you have to install the CUDA Toolkit and the following line in the CMakeLists.txt must be uncommented:

option(WHISPER_CUBLAS "whisper: support for cuBLAS" ON)

Usage

Run Silero for VAD and Whisper for STT:

$ ros2 launch whisper_bringup whisper.launch.py

Send a goal action to listen:

$ ros2 action send_goal /whisper/listen whisper_msgs/action/STT "{}"

Or try the example of a whisper client:

$ ros2 run whisper_ros whisper_client_node

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
whisper_bringup		whisper_bringup
whisper_msgs		whisper_msgs
whisper_ros		whisper_ros
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper_bringup

whisper_bringup

whisper_msgs

whisper_msgs

whisper_ros

whisper_ros

.gitignore

.gitignore

.gitmodules

.gitmodules

CITATION.cff

CITATION.cff

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

whisper_ros

Installation

CUDA

Usage

About

Releases 16

Packages

Languages

License

mgonzs13/whisper_ros

Folders and files

Latest commit

History

Repository files navigation

whisper_ros

Installation

CUDA

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages