[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

iory · 2022-01-07T01:20:00Z

What is this?

This PR adds a script to publish audio amplitude plot image.
The following video shows the audio amplitude visualized at this node while clapping.
This PR is based on #2654 .

audio_amplitude_plot--SLASH--output--SLASH--viz.mp4

708yamaguchi

Thank you very much for the nice plotter and documentation!

This is because my code was ad-hoc, but can you make some of the code common?

708yamaguchi · 2022-01-07T01:49:03Z

audio_to_spectrogram/scripts/audio_amplitude_plot.py

+        # Audio topic config
+        # The number of channels in audio data
+        self.n_channel = rospy.get_param('~n_channel', 1)
+        # Sampling rate of microphone (namely audio topic).
+        self.mic_sampling_rate = rospy.get_param('~mic_sampling_rate', 16000)
+        # Bits per one audio data
+        bitdepth = rospy.get_param('~bitdepth', 16)
+        if bitdepth == 16:
+            self.dtype = 'int16'
+        else:
+            rospy.logerr("'~bitdepth' {} is unsupported.".format(bitdepth))


These lines are the settings to convert audio stream and used in audio_to_spectrum.py, too.

jsk_recognition/audio_to_spectrogram/scripts/audio_to_spectrum.py

Lines 18 to 30 in 6ff031d

# Audio topic config

# The number of channels in audio data

self.n_channel = rospy.get_param('~n_channel', 1)

# Sampling rate of microphone (namely audio topic).

mic_sampling_rate = rospy.get_param('~mic_sampling_rate', 16000)

# Period[s] to sample audio data for one fft

fft_sampling_period = rospy.get_param('~fft_sampling_period', 0.3)

# Bits per one audio data

bitdepth = rospy.get_param('~bitdepth', 16)

if bitdepth == 16:

self.dtype = 'int16'

else:

rospy.logerr("'~bitdepth' {} is unsupported.".format(bitdepth))

I think these codes should be shared.

Could you create python class like AudioBuffer, which receives audio_common_msgs/AudioData and create self.audio_buffer buffer.

Thanks. Created AudioBuffer class.

708yamaguchi · 2022-01-07T01:50:33Z

audio_to_spectrogram/scripts/audio_amplitude_plot.py

+    def audio_cb(self, msg):
+        # Convert audio buffer to int array
+        data = msg.data
+        audio_buffer = np.frombuffer(data, dtype=self.dtype)
+        # Retreive one channel data
+        audio_buffer = audio_buffer[0::self.n_channel]
+        # Save audio msg to audio_buffer
+        with self.lock:
+            self.audio_buffer = np.append(
+                self.audio_buffer, audio_buffer)
+            self.audio_buffer = self.audio_buffer[
+                -self.audio_buffer_len:]


As I mentioned in another comment, could you create python class like AudioBuffer?

jsk_recognition/audio_to_spectrogram/scripts/audio_to_spectrum.py

Lines 64 to 74 in 6ff031d

def audio_cb(self, msg):

# Convert audio buffer to int array

data = msg.data

audio_buffer = np.frombuffer(data, dtype=self.dtype)

# Retreive one channel data

audio_buffer = audio_buffer[0::self.n_channel]

# Save audio msg to audio_buffer

self.audio_buffer = np.append(

self.audio_buffer, audio_buffer)

self.audio_buffer = self.audio_buffer[

-self.audio_buffer_len:]

Thanks. Created AudioBuffer class.

708yamaguchi · 2022-01-07T01:54:04Z

audio_to_spectrogram/scripts/audio_amplitude_plot.py

+        self.ax.set_ylim((-self.maximum_amplitude, self.maximum_amplitude))
+
+        self.ax.legend(loc='upper right')
+        if self.pub_img.get_num_connections() > 0:


Could you use ConnectionBasedTransport?

In addition, if possible, could you create python class like AudioPlot, which configures matplotlib and can be used in both audio_amplitude_plot.py and spectrum_plot.py

k-okada · 2022-01-07T03:26:57Z

why did you create custom plot node instead of existing plotting tools (rqt / plotjuggler)

iory · 2022-01-07T04:07:33Z

@k-okada

how you created custom plot node instead of existing plotting tools (rqt / plotjuggler)

I'm sorry I don't understand well.
Do you suggest that we should make a plugin such as rqt_plot?

k-okada · 2022-01-07T04:52:05Z

@iory I want to know the reason why you created the nose just to visualize audio amplitude

iory · 2022-01-07T05:34:14Z

Since sound is invisible, it is nice for the user to have it visualized.
The sample rate, number of channels, and other parameters are given manually by the user.
Sometimes, users give these values wrong.
The simplicity of this node, which visualizes the amplitude of sound, makes it easy to notice these mistakes.
In other words, it is useful for debugging purposes.

k-okada · 2022-01-07T06:05:49Z

why you can not use rqt? or other plotting tools?

knorth55 · 2022-01-07T06:18:38Z

FYI:
we have audioinfo message to pass meta data of audio data.
ros-drivers/audio_common#152

iory · 2022-01-07T06:27:18Z

OK. I'll take another way.

iory · 2022-01-07T08:47:19Z

@k-okada

why you can not use rqt? or other plotting tools?

I'm sorry, I'd like to ask you one point.
How do you think visualizing audio_common_msgs/AudioData is a good way to do it?
Since audio_common_msgs/AudioData is array data, I think it is necessary to insert another node in order to visualize it with rqt_plot.

$ rosmsg show audio_common_msgs/AudioData
uint8[] data

Also, when it comes out as image data, we are happy to be able to correspond with another image because it has a timestamp.

k-okada · 2022-06-16T10:30:16Z

Since audio_common_msgs/AudioData is array data, I think it is necessary to insert another node in order to visualize it with rqt_plot.

@iory I see, LGTM

…tude_plot

… to 775

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

k-okada · 2022-12-14T02:41:25Z

@iory merged in #2755 , but I have manually fixed conflict so it might contain wrong code, please check that.

iory requested a review from 708yamaguchi January 7, 2022 01:20

iory force-pushed the visualize-audio-amplitude branch from 7c7a460 to 945ab02 Compare January 7, 2022 01:35

708yamaguchi reviewed Jan 7, 2022

View reviewed changes

iory closed this Jan 7, 2022

iory deleted the visualize-audio-amplitude branch January 7, 2022 06:28

iory restored the visualize-audio-amplitude branch January 7, 2022 08:36

iory reopened this Jan 7, 2022

k-okada added the PR/MergeOK label Jun 16, 2022

iory added enhancement document test sample labels Jun 16, 2022

iory force-pushed the visualize-audio-amplitude branch from 8de398b to cf705fc Compare June 17, 2022 11:47

iory added 8 commits June 26, 2022 20:31

[audio_to_spectrogram] Enable publishing frequency vs amplitude plot

aca42f6

[audio_to_spectrogram] Add docs for spectrum_plot.py

5e51ba7

[audio_to_spectrogram] Delete debug print

b5b60bc

[audio_to_spectrogram] Use ConnectionBasedTransport

30d7e3c

[audio_to_spectrogram] Fixed typo unsupported -> unsubscribe

fc70662

[audio_to_spectrogram] Move spectrum_plot.py node outside of gui option.

02eac51

[audio_to_spectogram] Add example spectrum image to docs

7095b97

[audio_to_spectogram] Add catkin_python_setup

a11e4f0

iory added 13 commits June 26, 2022 20:31

[audio_to_spectogram] convert_matplotlib_to_img as a library

fec0a44

[audio_to_spectogram] Add AudioAmplitudePlot cfg

74d315d

[audio_to_spectogram] Add AudioAmplitudePlot node

fdd1189

[audio_to_spectogram] Add AudioAmplitudePlot node to launch file

a8d35af

[audio_to_spectogram] Add AudioAmplitudePlot docs

067815c

[audio_to_spectogram] Add audio amplitude image to README

f371805

[audio_to_spectogram] Fixed size of audio amplitude image

60ba8f0

[audio_to_spectogram] Add AudioBuffer class

8fd8c5e

[audio_to_spectogram] Use ConnectionBasedTransport and AudioBuffer

e8b4807

[audio_to_spectrogram] Use AudioBuffer in audio_to_spectrum

c73528b

[audio_to_spectrogram] Set rosparam

3010192

[audio_to_spectrogram] Add docs for audio_to_spectrum and audio_ampli…

b9817c6

…tude_plot

[audio_to_spectrogram/cfg] Modified permission AudioAmplitudePlot.cfg…

7e09365

… to 775

iory force-pushed the visualize-audio-amplitude branch from 8f7f018 to 7e09365 Compare June 26, 2022 11:31

iory and others added 3 commits July 19, 2022 10:50

Merge branch 'master' into visualize-audio-amplitude

4320277

Merge branch 'master' into visualize-audio-amplitude

850e927

Merge branch 'master' into visualize-audio-amplitude

7636547

k-okada mentioned this pull request Dec 13, 2022

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657 #2755

Merged

k-okada added a commit that referenced this pull request Dec 14, 2022

Merge pull request #2755 from k-okada/visualize-audio-amplitude

c4df514

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

k-okada closed this Dec 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

iory commented Jan 7, 2022

708yamaguchi left a comment

708yamaguchi Jan 7, 2022

iory Jan 7, 2022

708yamaguchi Jan 7, 2022

iory Jan 7, 2022

708yamaguchi Jan 7, 2022

k-okada commented Jan 7, 2022 •

edited

iory commented Jan 7, 2022

k-okada commented Jan 7, 2022

iory commented Jan 7, 2022

k-okada commented Jan 7, 2022

knorth55 commented Jan 7, 2022

iory commented Jan 7, 2022

iory commented Jan 7, 2022 •

edited

k-okada commented Jun 16, 2022

k-okada commented Dec 14, 2022

	# Audio topic config
	# The number of channels in audio data
	self.n_channel = rospy.get_param('~n_channel', 1)
	# Sampling rate of microphone (namely audio topic).
	mic_sampling_rate = rospy.get_param('~mic_sampling_rate', 16000)
	# Period[s] to sample audio data for one fft
	fft_sampling_period = rospy.get_param('~fft_sampling_period', 0.3)
	# Bits per one audio data
	bitdepth = rospy.get_param('~bitdepth', 16)
	if bitdepth == 16:
	self.dtype = 'int16'
	else:
	rospy.logerr("'~bitdepth' {} is unsupported.".format(bitdepth))

	def audio_cb(self, msg):
	# Convert audio buffer to int array
	data = msg.data
	audio_buffer = np.frombuffer(data, dtype=self.dtype)
	# Retreive one channel data
	audio_buffer = audio_buffer[0::self.n_channel]
	# Save audio msg to audio_buffer
	self.audio_buffer = np.append(
	self.audio_buffer, audio_buffer)
	self.audio_buffer = self.audio_buffer[
	-self.audio_buffer_len:]

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

[audio_to_spectrogram] Add AudioAmplitudePlot node to visualize audio amplitude #2657

Conversation

iory commented Jan 7, 2022

What is this?

708yamaguchi left a comment

Choose a reason for hiding this comment

708yamaguchi Jan 7, 2022

Choose a reason for hiding this comment

iory Jan 7, 2022

Choose a reason for hiding this comment

708yamaguchi Jan 7, 2022

Choose a reason for hiding this comment

iory Jan 7, 2022

Choose a reason for hiding this comment

708yamaguchi Jan 7, 2022

Choose a reason for hiding this comment

k-okada commented Jan 7, 2022 • edited

iory commented Jan 7, 2022

k-okada commented Jan 7, 2022

iory commented Jan 7, 2022

k-okada commented Jan 7, 2022

knorth55 commented Jan 7, 2022

iory commented Jan 7, 2022

iory commented Jan 7, 2022 • edited

k-okada commented Jun 16, 2022

k-okada commented Dec 14, 2022

k-okada commented Jan 7, 2022 •

edited

iory commented Jan 7, 2022 •

edited