Recon [Paper Link]

Recon: Reducing Conflicting Gradients from the Root for Multi-Task Learning

Guangyuan Shi, Qimai Li, Wenlong Zhang, Jiaxin Chen and Xiao-Ming Wu

BibTeX

@article{shi2023recon,
  title={Recon: Reducing Conflicting Gradients from the Root for Multi-Task Learning},
  author={Shi, Guangyuan and Li, Qimai and Zhang, Wenlong and Chen, Jiaxin and Wu, Xiao-Ming},
  journal={arXiv preprint arXiv:2302.11289},
  year={2023}
}

Updates

✅ 2023-04-17: Release the first version of the paper at Arxiv.
✅ 2022-04-17: Release the first version of codes and configs of Recon (including the implementation of CAGrad, PCGrad, Graddrop and MGDA).
✅ 2022-04-19: Upload the training scripts of Single-Task Learning Baseline.
🚧 (To do) Upload the training codes and configs on dataset PASCAL-Context and CelebA.
🚧 (To do) Upload implementations of BMTAS and RotoGrad.

Overview

Dependencies and Installation

Clone repo

git clone https://github.com/moukamisama/FS-IL.git

Install wandb

Downloading the Datasets

Refer to the README file in dataset folder.

Training The Baselines

Refer to the ./exp/ folder for the bash scripts of all baseline models on different datasets. For example, to train CAGrad on MultiFashion+MNIST datasets

./exp/MultiFashion+MNIST/run_CAGrad.sh

Training Recon

We provide the bash scripts of Recon on different datasets in the ./exp/ folder.

For example, to train Recon on MultiFashion+MNIST datasets, first we need to run the following codes for calculating the cos similarity between each pair of shared layers:

./exp/MultiFashion+MNIST/run_Recon.sh

Then we need to run the following code for Calculating the S-conflict Score of each layer and obtain the layers permutation:

./exp/MultiFashion+MNIST/calculate_Sconflict.sh

Training the modified model: Pre-calculated layer permutations are provided in ./logs/. You can skip the first two steps and directly run the following command to train the modified model:

./exp/MultiFashion+MNIST/run_Recon_Final.sh

Evaluation results can be seen in the logger or wandb. In the paper, we repeat the experiments with 3 different seeds for each dataset, and the average results of the last iteration are reported.

For Different Datasets

Our modified model can be easily applied to other datasets. The layer permutations we obtained can sometimes be directly used for other datasets. Tuning the hyperparameters (e.g., topK in the third procedure) directly on different datasets can lead to better performance.
Generating specific models for different datasets leads to better performance.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
archs		archs
criterion		criterion
data		data
datasets		datasets
exp		exp
logs		logs
losses		losses
models		models
scripts		scripts
utils		utils
README.md		README.md
overview.png		overview.png
version.py		version.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recon [Paper Link]

Recon: Reducing Conflicting Gradients from the Root for Multi-Task Learning

BibTeX

Updates

Overview

Dependencies and Installation

Downloading the Datasets

Training The Baselines

Training Recon

For Different Datasets

About

Releases

Packages

Languages

moukamisama/Recon

Folders and files

Latest commit

History

Repository files navigation

Recon [Paper Link]

Recon: Reducing Conflicting Gradients from the Root for Multi-Task Learning

BibTeX

Updates

Overview

Dependencies and Installation

Downloading the Datasets

Training The Baselines

Training Recon

For Different Datasets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages