sherpa

sherpa is an open-source speech-text-text inference framework using PyTorch, focusing exclusively on end-to-end (E2E) models, namely transducer- and CTC-based models. It provides both C++ and Python APIs.

This project focuses on deployment, i.e., using pre-trained models to transcribe speech. If you are interested in how to train or fine-tune your own models, please refer to icefall.

We also have other similar projects that don't depend on PyTorch:

sherpa-onnx and sherpa-ncnn also support iOS, Android and embedded systems.

Installation and Usage

Please refer to the documentation at https://k2-fsa.github.io/sherpa/

Try it in your browser

Try sherpa from within your browser without installing anything: https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition

Name		Name	Last commit message	Last commit date
Latest commit History 445 Commits
.github		.github
cmake		cmake
docs		docs
scripts		scripts
sherpa		sherpa
triton		triton
.clang-format		.clang-format
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
get_version.py		get_version.py
requirements.txt		requirements.txt
setup.py		setup.py

License

k2-fsa/sherpa

Folders and files

Latest commit

History

Repository files navigation

sherpa

Installation and Usage

Try it in your browser

About

Topics

Resources

License

Stars

Watchers

Forks

Languages