It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.
-
Updated
Aug 8, 2024 - TypeScript
It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.
Silero TTS Enhanced is a Python library that enhances the original Silero TTS project, providing a convenient way to synthesize speech from text using Silero TTS models. It offers a user-friendly interface for both standalone script usage and integration into Python projects, along with additional features
A web-app/library for transcribing speech
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
Automatically cuts out parts without speech from given video, making it shorter and more enjoyable to watch (look examples). Usage on google.collab in several clicks.
This is a simple server that uses Silero models to convert text to audio files over HTTP
Silero TTS backend service. Can be used with Home Assistant and Rhasspy.
Telegram bot with voice message recognition and generation. Speech to Text and Text to Speech
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Add a description, image, and links to the silero topic page so that developers can more easily learn about it.
To associate your repository with the silero topic, visit your repo's landing page and select "manage topics."