speech-recognition

Star

Here are 4,633 public repositories matching this topic...

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jun 2, 2024
Python

inboxpraveen / LLM-Minutes-of-Meeting

Star

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

python nlp natural-language-processing web translation transformers web-application speech-recognition speech-to-text whisper meeting-minutes webapplication minutes-of-meeting huggingface huggingface-transformers wav2vec2 llm whisper-ai llm-inference

Updated Jun 2, 2024
CSS

m-bain / whisperX

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Jun 2, 2024
Python

thevickypedia / Jarvis

Star

Fully Functional Voice Based Natural Language UI

Updated Jun 2, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 2, 2024
Python

compulim / web-speech-cognitive-services

Star

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

text-to-speech azure speech-synthesis speech-recognition speech-to-text cognitive-services

Updated Jun 2, 2024
JavaScript

matthiasn / lotti

Sponsor

Star

Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.

windows macos ios journal health speech-recognition time-tracker speech-to-text android-app flutter linux-app fitness-app

Updated Jun 1, 2024
Dart

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 1, 2024
C++

Uberi / speech_recognition

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated Jun 1, 2024
Python

Aunmag / speech-trainer

Star

An automated speech trainer. Beeps a sound every time you pronounce an unwanted word

application app speech-recognition speech-to-text

Updated Jun 1, 2024
Python

botbahlul / crx-live-translate

Star

Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!

javascript chrome edge voice-recognition speech-recognition browser-extension speech-to-text google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated Jun 1, 2024
JavaScript

botbahlul / js-live-audio-video-translate

Star

HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE

javascript html web voice-recognition speech-recognition google-translate web-template google-translate-api webkitspeechrecognition auto-caption auto-subtitle webkit-speech-recognition

Updated Jun 1, 2024
JavaScript

savbell / whisper-writer

Star

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

speech-recognition openai speech-to-text dictation whisper typing-assistant openai-api openai-whisper faster-whisper

Updated Jun 1, 2024
Python

Chenyme / Chenyme-AAVT

Star

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

speech-recognition whisper video-translation gpt-4 faster-whisper gpt-4o

Updated Jun 1, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 1, 2024
Python

pluja / whishper

Star

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

golang ui web ai subtitles webapp speech-recognition speech-to-text transcription stt whisper audio-to-text sveltekit web-whisper

Updated Jun 1, 2024
Svelte

Sritam-K-Behera / SER-WebApp

Star

This project implements a Speech Emotion Recognition (SER) model using TensorFlow Lite, specifically designed for deployment on microcontrollers like the Arduino Nano BLE33. The model is trained on the RAVDESS dataset and can recognize seven emotions: Angry, Disgust, Fear, Happy, Neutral, Sad, and Surprise.

machine-learning speech-recognition streamlit

Updated Jun 1, 2024
Jupyter Notebook

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 1, 2024
Python

NVIDIA / DeepLearningExamples

Star

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

nlp translation computer-vision deep-learning mxnet tensorflow pytorch speech-synthesis speech-recognition forecasting drug-discovery recommender-systems paddlepaddle tensorflow2 large-language-models

Updated Jun 1, 2024
Jupyter Notebook

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jun 1, 2024
C++

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,633 public repositories matching this topic...

huggingface / transformers

inboxpraveen / LLM-Minutes-of-Meeting

m-bain / whisperX

thevickypedia / Jarvis

DmitryRyumin / ICASSP-2023-24-Papers

compulim / web-speech-cognitive-services

matthiasn / lotti

openvinotoolkit / openvino

Uberi / speech_recognition

Aunmag / speech-trainer

botbahlul / crx-live-translate

botbahlul / js-live-audio-video-translate

savbell / whisper-writer

Chenyme / Chenyme-AAVT

leon-ai / leon

pluja / whishper

Sritam-K-Behera / SER-WebApp

speechbrain / speechbrain

NVIDIA / DeepLearningExamples

mkiol / dsnote

Improve this page

Add this topic to your repo