Revisiting Random Weight Perturbation for Efficiently Improving Generalization

The code is the official implementation of our TMLR 2024 paper Revisiting Random Weight Perturbation for Efficiently Improving Generalization. A short version is on NeurIPS Workshops on Optimization for Machine Learning (2023).

In this work, we enhance the generalization performance of random weight perturbation from the perspective of convergence and perturbation generation, and shows that it can achieve more efficient generalization improvement than adversarial weight perturbation in SAM, with comparable or even betteer performance.

Dependencies

Install required dependencies:

pip install -r requirements.txt

How to run

We show sample usages in run_rwp.sh, run_mrwp.sh and run_mrwp_ddp.sh.

For 1x computational cost version by incoperating our adaptive perturbation generation, run

bash run_rwp.sh

For 2x computational cost version by incoperating our mixing loss objective and adaptive perturbation generation, run

bash run_mrwp.sh

If you have more than one GPU, the training can be parallelized by

bash run_mrwp_ddp.sh

Citation

If you find this work helpful, please cite:

@article{li2024revisiting,
  title={Revisiting Random Weight Perturbation for Efficiently Improving Generalization},
  author={Li, Tao and Tao, Qinghua and Yan, Weihao and  Lei, Zehao and Wu, Yingwen and Fang, Kun and He, Mingzhen and Huang, Xiaolin},
journal={Transactions on Machine Learning Research (TMLR)},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
models		models
readme.md		readme.md
run_mrwp.sh		run_mrwp.sh
run_mrwp_ddp.sh		run_mrwp_ddp.sh
run_rwp.sh		run_rwp.sh
rwp.png		rwp.png
train_marwp.py		train_marwp.py
train_mrwp_parallel.py		train_mrwp_parallel.py
train_rwp_cos.py		train_rwp_cos.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

readme.md

readme.md

run_mrwp.sh

run_mrwp.sh

run_mrwp_ddp.sh

run_mrwp_ddp.sh

run_rwp.sh

run_rwp.sh

rwp.png

rwp.png

train_marwp.py

train_marwp.py

train_mrwp_parallel.py

train_mrwp_parallel.py

train_rwp_cos.py

train_rwp_cos.py

utils.py

utils.py

Repository files navigation

Revisiting Random Weight Perturbation for Efficiently Improving Generalization

Dependencies

How to run

Citation

About

Releases

Packages

Languages

nblt/mARWP

Folders and files

Latest commit

History

Repository files navigation

Revisiting Random Weight Perturbation for Efficiently Improving Generalization

Dependencies

How to run

Citation

About

Resources

Stars

Watchers

Forks

Languages