Skip to content

Solvve/ml_speech2text_voice_denoiser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Speech2text & Denoise

License Python 3.7 scikit-learn 0.23.2 torch 0.23.2 Solvve

Description

Speech to text & Denoiser using Wav2Vec pretrained model. Denoiser using Dual-signal Transformation LSTM Network. Fine-Tune Wav2Vec2 model

We follow the next steps:

  1. Data preparation
  2. Data preprocessing
  3. Modeling with Wav2Vec2 model
  4. Modeling after denoise
  5. Fine-tune Wav2Vec multi-language ASR

From Wec2Vec2_Denoise.ipynb:

Levenshtein metrics Mean Median
Word Error Rate 0.26 0.20
Match Error Rate 0.25 0.2
Word Information Lost 0.40 0.36

Releases

No releases published

Packages

No packages published