Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
-
Updated
Jun 9, 2021 - MATLAB
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
关于语音信号声源定位DOA估计所用的一些传统算法
Spectral Subtraction, Wiener Filtering, MMSE
Efficient voice activity detection algorithm using long-term speech information
ASPP: Binaural Speech Enhancement with Atomic Speech Presence Probability Estimation
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Python and Matlab code for segmentation of field recordings
An Automatic Centroid Tracking tool for analyzing vocal tract actions in MRI speech production data
A real-time analyzer to detect normal speech/abusive speech/noise
The FADE simulation framework adapted to predict SRTs for the Hurricance 2.0 challenge
Exemplary simulation toolchain combining FADE, TASCAR, and openMHA for aided speech recognition performance predictions in complex auditory scenes
Classifying sound signals as Links, Midden or Rechts using features computed using a Mel-Frequency filterbank, summing the power of the frequency-domain in the relevant filters. Dynamic Time Warping is used to find proper alignment between the unknown word and several labelled exemplars per word we are looking for. Then, k nearest neighbours tel…
This project is written by MATLAB R2020b for speech watermarking suitable for content authentication. Firstly, 4 folders are made by names of "original", "watermark", "extract" and "attack". Then 4 wav files are copied to "original" folder. Finally "final_1.m" can be run.
Digital Speech Recognition course with references and implementations for master's course of digital speech recognition at shahrood university of technology
Four utterances of 10 digits sampled at 8 kHz from each of 4 male speakers are provided. A template based digit recognition is developed with the help of k-means algorithm.
MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.
Signal Processing design to improve SPEECH signal quality through spectrum by mainly using Filtration Techniques (IEEE Report & MATLAB code).
wSTMI: A speech intelligibility prediction algorithm for noisy and processed speech
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."