Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition

Introduction

The Code Repository for "Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition"

Getting Started

Environments

The codebase is developed with pytorch == 1.8.1, torch-lightning == 1.5.9 Install requirements as follows:

pip install -r requirements.txt

Download and Processing Datasets

config.py

change the varible "dataset_path" to your audioset address
change the variable "desed_folder" to your DESED address
change the classes_num to 527

AudioSet

./create_index.sh # 
// remember to change the pathes in the script
// more information about this script is in https://github.com/qiuqiangkong/audioset_tagging_cnn

python main.py save_idc 
// count the number of samples in each class and save the npy files

ESC-50

Open the jupyter notebook at esc-50/prep_esc50.ipynb and process it

Speech Command V2

Open the jupyter notebook at scv2/prep_scv2.ipynb and process it

DESED Dataset

python conver_desed.py 
// will produce the npy data files

Set the Configuration File: config.py

The script config.py contains all configurations you need to assign to run your code. Please read the introduction comments in the file and change your settings.

Training

TBD

Results

TBD

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
__init__.py		__init__.py
bal_idc.npy		bal_idc.npy
class_hier.json		class_hier.json
class_hier_map.npy		class_hier_map.npy
class_label_indice.csv		class_label_indice.csv
config.py		config.py
config_backup.py		config_backup.py
config_patch_size4.py		config_patch_size4.py
convert_desed.py		convert_desed.py
data_generator.py		data_generator.py
environment.yml		environment.yml
eval_idc.npy		eval_idc.npy
fl_evaluate.py		fl_evaluate.py
full_train_idc.npy		full_train_idc.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition

Introduction

Getting Started

Environments

Download and Processing Datasets

Set the Configuration File: config.py

Training

Results

About

Releases

Packages

Languages

diggerdu/AudioMamba

Folders and files

Latest commit

History

Repository files navigation

Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition

Introduction

Getting Started

Environments

Download and Processing Datasets

Set the Configuration File: config.py

Training

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages