Fix sample rate issues #153

juanmc2005 · 2023-05-21T16:14:20Z

This PR addresses issue #152.

Changelog

MicrophoneAudioSource now detects supported sample rates and chooses the lowest possible one (higher than 16khz)
MicrophoneAudioSource no longer receives a sample_rate argument
Change block_size argument to block_duration in all audio sources
Fix bug in StreamingInference when resampling audio to pipeline's sample rate
Resample can now run on GPU by passing a device argument

juanmc2005 · 2023-06-05T11:29:37Z

src/diart/inference.py

        # Dynamic resampling if the audio source isn't compatible
        if sample_rate != self.source.sample_rate:
            msg = f"Audio source has sample rate {self.source.sample_rate}, " \
                  f"but pipeline's is {sample_rate}. Will resample."
            logging.warning(msg)
            self.stream = self.stream.pipe(
-                ops.map(blocks.Resample(self.source.sample_rate, sample_rate))
+                ops.map(blocks.Resample(self.source.sample_rate, sample_rate, self.pipeline.config.device))


Resample before rearrange_audio_stream so the same audio is not resampled multiple times.
Because of how the first 5s buffer is filled at the beginning, this actually means that Resample will be called more times, but (unless it's running on GPU) each call should also be faster because the size of the chunk is reduced by 10 (80k vs 8k samples)

Let's track this for a future PR, see #180

…ampling crash.

* Add automatic sample rate detection in MicrophoneAudioSource. Fix resampling crash. * Replace block_size by block_duration in audio source constructors

juanmc2005 added the bug Something isn't working label May 21, 2023

juanmc2005 added this to the Version 0.8 milestone May 21, 2023

juanmc2005 marked this pull request as ready for review May 27, 2023 15:07

juanmc2005 commented Jun 5, 2023

View reviewed changes

juanmc2005 mentioned this pull request Sep 19, 2023

Windows 10 - Exits with no errors or results #149

Open

juanmc2005 added 2 commits October 11, 2023 16:12

Add automatic sample rate detection in MicrophoneAudioSource. Fix res…

d3f008e

…ampling crash.

Replace block_size by block_duration in audio source constructors

9fd258f

juanmc2005 force-pushed the fix/samplerate branch from c878998 to 9fd258f Compare October 11, 2023 14:12

juanmc2005 merged commit 410ab89 into develop Oct 11, 2023

juanmc2005 deleted the fix/samplerate branch October 11, 2023 14:13

juanmc2005 mentioned this pull request Oct 11, 2023

Crash when 16khz sampling is not supported by input device #152

Closed

juanmc2005 mentioned this pull request Oct 26, 2023

Version 0.8 #192

Merged

juanmc2005 added a commit that referenced this pull request Oct 28, 2023

Fix sample rate issues (#153)

8299b70

* Add automatic sample rate detection in MicrophoneAudioSource. Fix resampling crash. * Replace block_size by block_duration in audio source constructors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sample rate issues #153

Fix sample rate issues #153

juanmc2005 commented May 21, 2023 •

edited

Loading

juanmc2005 Jun 5, 2023

juanmc2005 Oct 11, 2023

Fix sample rate issues #153

Fix sample rate issues #153

Conversation

juanmc2005 commented May 21, 2023 • edited Loading

Changelog

juanmc2005 Jun 5, 2023

Choose a reason for hiding this comment

juanmc2005 Oct 11, 2023

Choose a reason for hiding this comment

juanmc2005 commented May 21, 2023 •

edited

Loading