py-webrtcvad

This is a python interface to the WebRTC Voice Activity Detector (VAD). It is compatible with Python 2 and Python 3.

A VAD classifies a piece of audio data as being voiced or unvoiced. It can be useful for telephony and speech recognition.

The VAD that Google developed for the WebRTC project is reportedly one of the best available, being fast, modern and free.

How to use it

Install the webrtcvad module:
```
pip install webrtcvad
```
Create a Vad object:
```
import webrtcvad
vad = webrtcvad.Vad()
```
Optionally, set its aggressiveness mode, which is an integer between 0 and 3. 0 is the least aggressive about filtering out non-speech, 3 is the most aggressive. (You can also set the mode when you create the VAD, e.g. vad = webrtcvad.Vad(3)):
```
vad.set_mode(1)
```

Give it a short segment ("frame") of audio. The WebRTC VAD only accepts 16-bit mono PCM audio, sampled at 8000, 16000, 32000 or 48000 Hz. A frame must be either 10, 20, or 30 ms in duration:

# Run the VAD on 10 ms of silence. The result should be False.
sample_rate = 16000
frame_duration = 10  # ms
frame = b'\x00\x00' * int(sample_rate * frame_duration / 1000)
print 'Contains speech: %s' % (vad.is_speech(frame, sample_rate)

See example.py for a more detailed example that will process a .wav file, find the voiced segments, and write each one as a separate .wav.

How to run unit tests

To run unit tests:

pip install -e ".[dev]"
python setup.py test

History

2.0.10

Fixed memory leak. Thank you, bond005!

2.0.9

Improved example code. Added WebRTC license.

2.0.8

Fixed Windows compilation errors. Thank you, xiongyihui!

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
cbits		cbits
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
example.py		example.py
leak-test.wav		leak-test.wav
setup.py		setup.py
test-audio.raw		test-audio.raw
test_webrtcvad.py		test_webrtcvad.py
webrtcvad.py		webrtcvad.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cbits

cbits

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.rst

README.rst

example.py

example.py

leak-test.wav

leak-test.wav

setup.py

setup.py

test-audio.raw

test-audio.raw

test_webrtcvad.py

test_webrtcvad.py

webrtcvad.py

webrtcvad.py

Repository files navigation

py-webrtcvad

How to use it

How to run unit tests

History

About

Releases

Packages

Used by 3.1k

Contributors 7

Languages

License

wiseman/py-webrtcvad

Folders and files

Latest commit

History

Repository files navigation

py-webrtcvad

How to use it

How to run unit tests

History

About

Resources

License

Stars

Watchers

Forks

Languages