Sound Separation to Improve Sound Classification

Software requirements:

librosa, matplotlib, numpy, pandas, sklearn. To download the dependencies: pip3 install -r requirements.txt
MATLAB 2020b.

Dataset:

Audio: The raw audio selected from the FSDnoisy18K dataset for evaluating the proposed method. Download
One_STFT: This dataset consists of 20 classes where each audio per class is extracted to only 1 Short-Time Fourier Transform (STFT). Download
Separate_STFT_addNoise_Class: This dataset consists of 21 classes that are 20 classes selected from the original data and additional noisy class. Each audio per class is separated into multiple STFT frames and manually labelled as clean label (the original label of class) and noisy label (that is merged to construct the noisy class). Download

Note: All above datasets need to download and extract to the data folder.

Usage:

First, clone the repository locally:

git clone https://github.com/nhattruongpham/soundSepsound.git

Run TransferLearning.mlx to reproduce the experiments and results with pre-trained CNNs.
Run ReproduceFSDNoisy18k.m to reproduce the experiment and result with the proposed network in [1].
Run SFTF_Extractor.ipynb to extract STFT features (if any).

Citation

If you use this code/data or part of it, please cite the following paper:

@inproceedings{tran2021separate,
  title={Separate Sound into STFT Frames to Eliminate Sound Noise Frames in Sound Classification},
  author={Tran, Thanh and Huy, Kien Bui and Pham, Nhat Truong and Carrat{\`u}, Marco and Liguori, Consolatina and Lundgren, Jan},
  booktitle={2021 IEEE Symposium Series on Computational Intelligence (SSCI)},
  pages={1--7},
  year={2021},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ReproduceFSDNoisy18k.m		ReproduceFSDNoisy18k.m
SFTF_Extractor.ipynb		SFTF_Extractor.ipynb
TransferLearning.mlx		TransferLearning.mlx
createLgraphUsingConnections.m		createLgraphUsingConnections.m
findLayersToReplace.m		findLayersToReplace.m
freezeWeights.m		freezeWeights.m
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

ReproduceFSDNoisy18k.m

ReproduceFSDNoisy18k.m

SFTF_Extractor.ipynb

SFTF_Extractor.ipynb

TransferLearning.mlx

TransferLearning.mlx

createLgraphUsingConnections.m

createLgraphUsingConnections.m

findLayersToReplace.m

findLayersToReplace.m

freezeWeights.m

freezeWeights.m

requirements.txt

requirements.txt

Repository files navigation

Sound Separation to Improve Sound Classification

Software requirements:

Dataset:

Note: All above datasets need to download and extract to the data folder.

Usage:

Citation

About

Releases

Packages

Languages

License

nhattruongpham/soundSepsound

Folders and files

Latest commit

History

Repository files navigation

Sound Separation to Improve Sound Classification

Software requirements:

Dataset:

Note: All above datasets need to download and extract to the data folder.

Usage:

Citation

About

Resources

License

Stars

Watchers

Forks

Languages