Build software better, together

ddbourgin / numpy-ml

Machine learning, in numpy

machine-learning reinforcement-learning word2vec lstm neural-networks gaussian-mixture-models vae topic-modeling attention resnet bayesian-inference wavenet mfcc knn gaussian-processes hidden-markov-models gradient-boosting wgan-gp good-turing-smoothing

Updated Oct 29, 2023
Python

x4nth055 / emotion-recognition-using-speech

Star

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

machine-learning deep-learning sklearn keras recurrent-neural-networks feature-extraction neural-networks support-vector-machine mfcc librosa emotion-detection gradient-boosting emotion-recognition kneighborsclassifier random-forest-classifier mlp-classifier speech-emotion-recognition emotion-recognizer

Updated Nov 3, 2023
Python

SuperKogito / spafe

Sponsor

Star

🔉 spafe: Simplified Python Audio Features Extraction

Updated Jun 19, 2024
Python

gionanide / Speech_Signal_Processing_and_Classification

Star

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can…

nlp classifier natural-language-processing feature-extraction nltk gaussian-mixture-models support-vector-machines mfcc principal-component-analysis speech-processing linear-discriminant-analysis isomap spectral-clustering long-short-term-memory kernel-pca spectral-embedding locally-linear-embedding linear-prediction-coefficients speech-utterance

Updated Mar 3, 2023
Python

jsingh811 / pyAudioProcessing

Star

Audio feature extraction and classification

classifier audio-files feature-extraction audio-data mfcc hyperparameter-tuning wav-files classify mfcc-features mfcc-extractor classify-audio gfcc gfcc-features gfcc-extractor spectral-features chroma-features classifier-options classify-audio-samples pyaudioprocessing

Updated Jul 6, 2023
Python

SuperKogito / Voice-based-gender-recognition

Sponsor

Star

🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

data-science machine-learning scikit-learn voice speech gaussian-mixture-models signal gender-recognition gender gmm mfcc speaker gender-classification vocal gender-recognition-by-voice gender-detection mel-frequencies scikit-learn-python

Updated Jul 6, 2023
Python

sp-nitech / diffsptk

Star

A differentiable version of SPTK

python deep-learning signal-processing dsp pytorch lpc stft digital-signal-processing mfcc plp lsp cqt sptk mdct ddsp cepstrum pqmf

Updated Nov 14, 2024
Python

tympanix / subsync

Star

Synchronize your subtitles using machine learning

machine-learning neural-network delay subtitles subtitle fix mfcc shift subsync speech-detection shift-subtitle

Updated Sep 18, 2023
Python

GauravWaghmare / Speaker-Identification

Star

A program for automatic speaker identification using deep learning techniques.

keras mfcc speaker-recognition speaker-verification

Updated Feb 28, 2017
Python

MycroftAI / sonopy

Star

A simple audio feature extraction library

library sound spectrogram mfcc audio-processing mel-spectrogram

Updated Jul 3, 2019
Python

ZitengWang / python_kaldi_features

Star

python codes to extract MFCC and FBANK speech features for Kaldi

kaldi mfcc

Updated Nov 28, 2018
Python

georgid / AlignmentDuration

Star

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

python music duration synchronization research deep-learning signal-processing lyrics decoding music-information-retrieval neural-networks alignment hidden-markov-model gmm mfcc upf htk

Updated Mar 9, 2020
Python

k-farruh / speech-accent-detection

Star

The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.

mfcc accent accent-detection native-speakers english-languages

Updated Nov 9, 2021
Python

SuperKogito / Voice-based-speaker-identification

Sponsor

Star

🔉 👦 👧 👩 👨 Speaker identification using voice MFCCs and GMM

machine-learning scikit-learn voice speech gaussian-mixture-models signal gmm mfcc speaker-recognition vocal mel-frequencies speaker-identification mel-frequency-cepstral-coefficients scikit-learn-python

Updated Dec 13, 2020
Python

supikiti / PNCC

Star

A implementation of Power Normalized Cepstral Coefficients: PNCC

deep-learning speech-recognition mfcc speech-processing robustness pncc speech-enhancement

Updated Aug 11, 2019
Python

zhengyima / DTW_Digital_Voice_Recognition

Star

基于DTW与MFCC特征进行数字0-9的语音识别，DTW，MFCC，语音识别，中英数据，端点检测，Digital Voice Recognition。

machine-learning dtw voice-recognition dynamic-programming digital-signal-processing mfcc

Updated Jul 29, 2021
Python

skaws2003 / pytorch-mfcc

Star

A pytorch implementation of MFCC.

pytorch mfcc

Updated Jun 1, 2022
Python

linksense / ConvolutionaNeuralNetworksToEnhanceCodedSpeech

Star

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…

keras cnn generative-adversarial-network gan post-processing mfcc speech-processing wasserstein-gan speech-enhancement speech-reconstruction 1d-convolution

Updated Mar 8, 2020
Python

nipunmanral / Spoken-Language-Identification

Star

Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features

deep-learning keras lstm gru neural-networks mfcc language-identification

Updated Aug 2, 2024
Python

LexicalStressDetection / lexical-stress-detection

Star

Deep Learning model for lexical stress detection in spoken English

data-science deep-learning pronunciation speech-recognition mfcc convolutional-neural-network english-learning mel-frequency-cepstral-coefficients pronunciation-scoring lexical-stress-detection

Updated Mar 17, 2020
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mfcc

Here are 106 public repositories matching this topic...

ddbourgin / numpy-ml

x4nth055 / emotion-recognition-using-speech

SuperKogito / spafe

gionanide / Speech_Signal_Processing_and_Classification

jsingh811 / pyAudioProcessing

SuperKogito / Voice-based-gender-recognition

sp-nitech / diffsptk

tympanix / subsync

GauravWaghmare / Speaker-Identification

MycroftAI / sonopy

ZitengWang / python_kaldi_features

georgid / AlignmentDuration

k-farruh / speech-accent-detection

SuperKogito / Voice-based-speaker-identification

supikiti / PNCC

zhengyima / DTW_Digital_Voice_Recognition

skaws2003 / pytorch-mfcc

linksense / ConvolutionaNeuralNetworksToEnhanceCodedSpeech

nipunmanral / Spoken-Language-Identification

LexicalStressDetection / lexical-stress-detection

Improve this page

Add this topic to your repo