Skip to content

corticph/bewer

Repository files navigation

BeWER

Beyond Word Error Rate → BeWER (/ˈbiːvər/) 🦫

Python Versions Coverage License

⚠️ Important: This project is not production ready and is still in early development. Breaking changes may occur, and backwards compatibility between alpha versions is not guaranteed.

BeWER is an evaluation and analysis framework for automatic speech recognition in Python. It defines a transparent YAML-based approach for configuring evaluation pipelines and makes it easy to inspect and analyze individual examples through a web-based interface. The built-in preprocessing pipeline and metrics collection are designed to cover all conventional use cases and then some, while still being fully extensible.

Contents | Installation | Quickstart |

pip install bewer

Quickstart

Create a Dataset

from bewer import Dataset

dataset = Dataset()

Add data

From a file:

dataset.load_csv(
    "data.csv",
    ref_col="reference",
    hyp_col="hypothesis",
)

Or manually:

for ref, hyp in iterator:
    dataset.add(ref=ref, hyp=hyp)

List available metrics

dataset.metrics.list_metrics()

Compute metrics lazily

print(f"WER: {dataset.metrics.wer().value:.2%}")

About

Evaluation and analysis framework for automatic speech recognition.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors