Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
-
Updated
Feb 2, 2024 - HTML
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
Rhasspy voice assistant for offline home automation
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Real-time transcription using faster-whisper
This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @udacity.
Pytorch implementation of subband decomposition
StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.
Python platform for working with LLMs
VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
A MATLAB implementation of CHiME4 baseline Beamformit
webpage for maintaining the list of openly available DL, ML, RL, Vision, NLP, Optimization courses
A mobile web application that helps you convert spoken words to sharable/editable text 🎊
This App allows users to convert their speech into text and send that text as a message. It records blobs in realtime! After every 10 seconds recorded blob is sent to server and there it is converted into text and send as a message to other user.
Babelin Speach, for voice recognition and real-time translation, services offered by web browsers
UrSR: Urbit Speech Recognition
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."