The Code Repository for "Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition"
The codebase is developed with pytorch == 1.8.1, torch-lightning == 1.5.9 Install requirements as follows:
pip install -r requirements.txt
- config.py
change the varible "dataset_path" to your audioset address
change the variable "desed_folder" to your DESED address
change the classes_num to 527
./create_index.sh #
// remember to change the pathes in the script
// more information about this script is in https://github.com/qiuqiangkong/audioset_tagging_cnn
python main.py save_idc
// count the number of samples in each class and save the npy files
Open the jupyter notebook at esc-50/prep_esc50.ipynb and process it
Open the jupyter notebook at scv2/prep_scv2.ipynb and process it
python conver_desed.py
// will produce the npy data files
The script config.py contains all configurations you need to assign to run your code. Please read the introduction comments in the file and change your settings.
TBD
TBD