easyvad

Extremely simple implementation of a voice activity detection algorithm.

How does it work?

Basics

easyvad is a quick implementation of the algorithm described by the paper "Approach for Energy-Based Voice Detector with Adaptive Scaling Factor". It assumes that the audio is packetized, analyzes each packet and decides whether it is a relevant voice packet or a noise packet.

Implementation details

The function is built to work with 8-bit, signed int encoded samples, meaning that each audio sample can range in values from -128 to 127. This is easily modifiable by changing the int8_t variable type to intX_t, with X being the size of each sample. Since the algorithm uses packet signal energy as its sole analyzed quantity, the software can also work for unsigned (positive only) samples, as signal energy is a purely positive quantity. To use unsigned samples, simply add a "u" before the variable type, e.g. uint8_t.

Conclusion

This software is still in development, major tweaks are to be implemented in order for it to be easily usable.

Todo

Wrap external function values in a struct
Add how-to-use guide

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
vad.c		vad.c
vad.h		vad.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

easyvad

How does it work?

Basics

Implementation details

Conclusion

Todo

About

Releases

Packages

Languages

License

cirosilvano/easyvad

Folders and files

Latest commit

History

Repository files navigation

easyvad

How does it work?

Basics

Implementation details

Conclusion

Todo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages