#

wav2vec2

Here are 54 public repositories matching this topic...

aitor-alvarez / large-speech-models

Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper

whisper asr asr-model speech-recognition-model wav2vec2 arabic-speech-recognition large-speech-models finetuning-wav2vec finetuning-whisper

Updated Jan 23, 2024
Python

keshavbhandari / Audioneme

AI model for speech disorder detection

wav2vec2 speech-disorder speech-disorder-detection child-speech

Updated Sep 7, 2022
Python

nomnomnonono / SoundEffect-Search

Application to search for similar sound effects by voice and title.

python sound-effects machine-learning deep-learning scraping poetry pytorch artificial-intelligence gradio bert vector-search huggingface wav2vec2

Updated Apr 29, 2023
Python

seb5433 / wav2vec2-speaker-recognition

Speaker recognition task using wav2vec2 model.

speaker-recognition fine-tuning speaker-recognition-systems wav2vec2

Updated Apr 25, 2024
Python

viksit-siddhant / compare2023

SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model

transformers audio-classification bert asr speech-emotion-recognition multimodal wav2vec2

Updated Jul 4, 2023
Python

JingleCate / SpeechEmotionRecog

A simple Speech Emotion Recognition (SER) project based on Wav2Vec2.

audio classification wav2vec2

Updated Apr 22, 2024
Python

egorsmkv / wav2vec2-hidet

transformers pytorch wav2vec2 hidet

Updated Oct 4, 2023
Python

moncefbenaicha / SpokenNER

Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning

speech-recognition transfer-learning ner asr spoken-language-understanding wav2vec2 xlsr spoken-ner

Updated May 7, 2024
Python

wngh1187 / IPET

Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS

transfer-learning wav2vec2 general-purpose-audio-model audio-spectrogram-transformer prompt-based-learning

Updated Sep 14, 2023
Python

appledora / wav2vec2_scripts

A modular codebase to process audio dataset, generate custom tokenizer, finetune and infer wav2vec2 model on custom dataset.

end-to-end inference speech-to-text fine-tuning huggingface wav2vec2

Updated Nov 12, 2023
Python

dangrebenkin / wav2vec2_speech_markuper

Automatic generation of speech dataset markup using Wav2Vec2 ASR models

speech-recognition speech-to-text audio-segmentation forced-alignment wav2vec2

Updated Sep 20, 2023
Python

seanghay / kfa

A fast Khmer Forced Aligner powered by Wav2Vec2CTC and Phonetisaurus

alignment cambodia khmer forced-alignment wav2vec2

Updated May 2, 2024
Python

Voice-Assistant

andrejanesic / Voice-Assistant

🤗 Voice assistant built in Python with NLP & wav2vec2.

python nlp wav2vec2

Updated Feb 14, 2023
Python

trinhtuanvubk / finetune-wav2vec2

docker speech-recognition fine-tuning ngram-language-model wav2vec2 wav2vec2-base-960h

Updated Jan 31, 2024
Python

ECNU-Cross-Innovation-Lab / ENT

[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

automatic-speech-recognition speech-emotion-recognition wav2vec2

Updated Apr 11, 2024
Python

mead-ml / audio8

Deep audio modeling

audio deep-learning pytorch speech-recognition wav2vec wav2vec2

Updated Jan 27, 2022
Python

SanchezCris / SDR-Automatic-Speech-Recognition

FM signal capturing system and voice recognition for the assistance of individuals with hearing impairments.

python speech-recognition sdr automatic-speech-recognition speech-to-text gnuradio asr software-defined-radio wav2vec2

Updated Apr 17, 2023
Python

zhu00121 / Universal-representation-dynamics-of-deepfake-speech

This repo contains code used in the paper "Characterizing the temporal dynamics of universal speech representations for generalizable deepfake detection"

self-supervised deepfake-detection wav2vec2 wavlm modulation-transformation

Updated Oct 19, 2023
Python

Natalia-T / NeurIPS2021

ssl speech ast asr slu aer wav2vec wav2vec2 lebenchmark french-models

Updated Dec 11, 2021
Python

sotiriskar / audio-note

Python application for taking audio notes and create summary of meetings.

python nlp machine-learning pytorch speech-to-text audio-processing huggingface wav2vec2

Updated Apr 2, 2023
Python

Improve this page

Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."