GTCRN

This repository is the official implementation of paper "GTCRN: A Speech Enhancement Model Requiring Ultralow Computational Resources". The paper has been accepted by ICASSP 2024.

Audio examples are available at Audio examples of GTCRN.

About GTCRN

Grouped Temporal Convolutional Recurrent Network (GTCRN) is a speech enhancement model requiring ultralow computational resources, featuring only 23.7 K parameters and 39.6 MFLOPs. Experimental results show that our proposed model not only surpasses RNNoise, a typical lightweight model with similar computational burden, but also achieves competitive performance when compared to recent baseline models with significantly higher computational resources requirements.

Pre-trained Models

Pre-trained models are provided in checkpoints, which were trained on DNS3 and VCTK-DEMAND datasets, respectively.

The inference procedure is presented in infer.py.

Related Repositories

SEtrain: A training code template for DNN-based speech enhancement.

TRT-SE: An example of how to convert a speech enhancement model into a streaming format and deploy it using ONNX or TensorRT.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
checkpoints		checkpoints
test_wavs		test_wavs
README.md		README.md
gtcrn.py		gtcrn.py
infer.py		infer.py
loss.py		loss.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checkpoints

checkpoints

test_wavs

test_wavs

README.md

README.md

gtcrn.py

gtcrn.py

infer.py

infer.py

loss.py

loss.py

requirements.txt

requirements.txt

Repository files navigation

GTCRN

About GTCRN

Pre-trained Models

Related Repositories

About

Releases

Packages

Languages

SherryYu33/gtcrn

Folders and files

Latest commit

History

Repository files navigation

GTCRN

About GTCRN

Pre-trained Models

Related Repositories

About

Resources

Stars

Watchers

Forks

Languages