Skip to content

idiap/audio_perception

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Perception

Dependencies

Sound Source Localization

Script

  • ssl_nn.py

Subscribe topics

  • /naoqi_driver_node/audio : Audio signals captured by Pepper

Publish topics

  • /audio_perception/ssl : sound detected
  • /visualization_marker : markers of detected sound, which can be visualized in rviz

Example Usage

rosrun audio_perception ssl_nn.py -n stft --context-frames=25 -v --marker --frame-id=Head PATH

A trained model (consists of two files) can be downloaded from http://protolab.aldebaran.com/mummer_downloads/idiap/models/

See also the example launch file launch/ssl_example.launch.

Arguments

  • PATH : the path to the trained neural network model without its extension

Options

  • --cpu : use CPU for computation (instead of GPU)
  • --visualize : visualize localization
  • --marker : publish /visualization_marker
  • --frame-id : specify the reference frame of detected sound location

For more detailed help message, run rosrun audio_perception ssl_nn.py -h.

Visualization

There are two ways to visualize the SSL results.

  1. Visualize the likelihood of sound source being in each individual direction in a top view (with option --visualize).
  2. Publish detection results as visulization markers (with option --marker). You can view them in rviz by adding topic /visualization_marker to visualization. To overlay the markers to the pepper RGB camera images, you can view the topic /naoqi_driver_node/camera/front as camera. A correct reference frame of the markers must be specifed with option --frame-id.

About

ROS package to wrap apkit and archs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published