Speech Recognition

This repo will be my experiments with audio data processing and Deep Learning for speech recognition.

Data

I will be using some of the well known datasets used in audio machine learning as well as a few lesser known sets. The following datasets will be used:

Repository Structure


├── Data                                <- contains csv data and nested subfolder of audio data
│   └── ...
├── images                              <- contains saved images from notebooks. 
│   └── ...
├── notebooks                           <- contains more in depth notebooks 
│  └── .ipynb                           <- 
│  └── .ipynb                           <- 
│  └── .ipynb                           <-
│  └── .ipynb                           <- 
├── .gitattributes                      <- file specifying files for git lfs to track
├── .gitignore                          <- file specifying files/directories to ignore
├── README.md                           <- Top-level README
└──

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
Data/SpeechCommands/speech_commands_v0.02		Data/SpeechCommands/speech_commands_v0.02
scaps		scaps
.DS_Store		.DS_Store
.gitignore		.gitignore
MyDataClasses.py		MyDataClasses.py
README.md		README.md
Torch_SpecAugment.ipynb		Torch_SpecAugment.ipynb
Wav2Vec2.ipynb		Wav2Vec2.ipynb
asssembly_api.py		asssembly_api.py
audio_augmentation.py		audio_augmentation.py
combined_df.pkl		combined_df.pkl
librispeech.ipynb		librispeech.ipynb
librispeech.py		librispeech.py
my_functions.py		my_functions.py
my_metrics.py		my_metrics.py
speech_recognition_pytorch.ipynb		speech_recognition_pytorch.ipynb
torch_functions.py		torch_functions.py
transcript.txt		transcript.txt
with_classes.ipynb		with_classes.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition

Data

Repository Structure

About

Releases

Packages

Languages

Jyve00/Speech_Recognition

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition

Data

Repository Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages