Speech-to-Intent Benchmark

Made in Vancouver, Canada by Picovoice

This framework benchmarks the accuracy of Picovoice's Speech-to-Intent engine, Rhino. It compares the accuracy of Rhino with:

Results

Command acceptance rate is the probability of an engine correctly understanding the spoken command. Below is the summary:

The figure below depicts engines performance at each SNR:

Data

The speech data are crowd-sourced from more than 50 unique speakers. Each speaker contributed about ten different utterances. Collectively there are 619 commands used in this benchmark. We test the engines in noisy conditions to simulate real-world situations. Noise is from Freesound.

How to Reproduce?

Clone the repository:

git clone https://github.com/Picovoice/speech-to-intent-benchmark.git

Get the usage message:

python3 src/bench.py --help

Then run the script for each engine.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-to-Intent Benchmark

Table of Contents

Results

Data

How to Reproduce?

About

Releases

Packages

Contributors 5

Languages

License

Picovoice/speech-to-intent-benchmark

Folders and files

Latest commit

History

Repository files navigation

Speech-to-Intent Benchmark

Table of Contents

Results

Data

How to Reproduce?

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages