LSTM-RNN Voice Activity Detection

REQUIRED PACKAGES

numpy, tensorflow, libROSA, matplotlib

FILES

- dataset_utils.py
Dataset related utilities: One-hot encoding, wav file normalisation, TRS to CSV conversion, JSON to CSV conversion, Youtube wav download for the AudioSet Google corpus, Liblinear library data transformations

- metrics_utils.py
(NOT FINALISED) Metrics' related utilities for the baseline VAD methods

- feature_extractor.py
Feature extraction class to extract MFCC, deltas, double deltas, RSE

- VAD_model.py
LSTM-RNN tensorflow learning model

- _main_.py
The program's main entry point

- /checkpoint
Tensorflow checkpoint directory for saving and restoring learning models

- /parameter
LSTM-RNN learning model hyper-parameters, training parameters, and log/checkpoint directories names

- /notebook
Jupyter notebooks to test initial VAD prototypes

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
parameters		parameters
README.md		README.md
dataset_utils.py		dataset_utils.py
feature_extractor.py		feature_extractor.py
vad_model.py		vad_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parameters

parameters

README.md

README.md

dataset_utils.py

dataset_utils.py

feature_extractor.py

feature_extractor.py

vad_model.py

vad_model.py

Repository files navigation

LSTM-RNN Voice Activity Detection

REQUIRED PACKAGES

FILES

About

Releases

Packages

Languages

mounalab/LSTM-RNN-VAD

Folders and files

Latest commit

History

Repository files navigation

LSTM-RNN Voice Activity Detection

REQUIRED PACKAGES

FILES

About

Topics

Resources

Stars

Watchers

Forks

Languages