PaMaL

Official implementation of Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models.

Installation

git clone https://github.com/nik-dim/pamal.git && cd pamal

# Method 1: Conda
conda create -n pamal python=3.10
conda activate pamal
pip install -r requirements.txt

# Method 2: Dockerfile
docker build -t pamal .

Running Experiments

The experiments are configured using Hydra. Hydra uses a hierarchical structure to define experiments. See here for more details.

The configuration files are in the configs/ directory and are further split into two directories. The first, configs/general/, houses the definitions of methods, optimizers, datasets etc, while the second one, configs/experiment/ makes uses of these definitions to configure the experiments for each dataset. The supported methods are outlined here and the supported datasets here.

For example, we can run experiments on the MultiMNIST dataset as follows:

# run experiment on MultiMNIST using the proposed method (PaMaL) 
python multimnist.py method=pamal

# ...modify the `inner_method` parameter specific to pamal
python multimnist.py method=pamal method.inner_method=ls # or gradnorm or rwa

# run the same experiment for a baseline
python multimnist.py method=ls # or mgda or phn etc

To launch multiple experiments at the same time (e.g. ablation studies), we can use the --multirun (-m) flag. Note that this flag is sometimes problematic in conjunction with Weights&Biases logging. For example, to run the ablation study for the proposed method, we can run:

# launch the previous experiment for multiple seeds
python multimnist.py -m method=pamal seed=0,1,2

Alternatively, check the run.sh script for launching multiple experiments.

Supported Methods

This repository supports the methods outlined in the table below. Apart from the proposed method pamal, we support Pareto Front Approximation methods phn and cosmos, as single solution methods, such as rlw, graddrop, autol, rotograd, cagrad, nashmtl, pcgrad, dwa, si, mgda, uw, ls, stl.

In order to select one baseline set method=XXX where XXX is the ticker presented below. For example, to run the ls baseline, run python census.py method=ls.

Ticker	Paper
stlX	Single-Task Learning for task X, X=0,1,2...
ls	Fixed Weighting (e.g. equal)
uw	Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
mgda	Multi-Task Learning as Multi-Objective Optimization
si	See Multi-Task Learning as a Bargaining Game for details
dwa	End-to-End Multi-Task Learning with Attention
pcgrad	Gradient Surgery for Multi-Task Learning
imtl	Towards Impartial Multi-task Learning
cagrad	Conflict-Averse Gradient Descent for Multi-task Learning
nashmtl	Multi-Task Learning as a Bargaining Game
rlw	A Closer Look at Loss Weighting in Multi-Task Learning
graddrop	Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout
autol	Auto-Lambda: Disentangling Dynamic Task Relationships
rotograd	RotoGrad: Gradient Homogenization in Multitask Learning
phn	Learning the Pareto Front with Hypernetworks
cosmos	Scalable Pareto Front Approximation for Deep Multi-Objective Learning
pamal	Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models

Supported Datasets

The repository supports the datasets in the table below. By default, the datasets will be downloaded to the ~/benchmarks/data directory. The root path can be modified in src/datasets/base_data_module.py.

Dataset	Notes
MultiMNIST	needs to be created
Census	needs to be downloaded
MultiMNIST3	needs to be created
UtkFace	needs to be downloaded
Cityscapes	needs to be downloaded
CelebA	needs to be downloaded

Extending the codebase

The code is structured similar to PyTorch Lightning. We have used various codebase sources for baselines. For this reason, the API is not consistent for all baselines. Most baselines can be found in src/methods/weight_methods.py. However, graddrop, autol, rotograd, phn and cosmos are implemented in their own specific way. In order to add a new single solution method, check src/methods/weight_methods.py where multiple loss and gradient balancing methods can be found. For multi solution methods check src/trainer/multi_solution_trainer.py.

If you have any questions, please contact me. We plan to merge and unify the baselines, with method-specific callbacks. For the moment, this is the case only for Auto-$\lambda$ in src/utils/callbacks/auto_lambda_callback.py.

References

The reporsitory makes heavy use of previously developped codebases. Specifically, we use:

Citation

If you find our work useful, please consider citing our paper:

@inproceedings{
    dimitriadis2023pareto,
    title={Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models},
    author={Dimitriadis, Nikolaos and Frossard, Pascal and Fleuret, Fran{\c{c}}ois},
    booktitle={International Conference on Machine Learning},
    year={2023},
    url={https://arxiv.org/abs/2210.09759}
}

Contact

If you have any questions, please contact nikolaos.dimitriadis@epfl.ch.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.devcontainer		.devcontainer
configs		configs
docs/media		docs/media
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
census.py		census.py
cityscapes.py		cityscapes.py
multimnist.py		multimnist.py
multimnist3.py		multimnist3.py
requirements.txt		requirements.txt
run.sh		run.sh
split-cityscapes.py		split-cityscapes.py
utkface.py		utkface.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PaMaL

Installation

Running Experiments

Supported Methods

Supported Datasets

Extending the codebase

References

Citation

Contact

About

Languages

License

nik-dim/pamal

Folders and files

Latest commit

History

Repository files navigation

PaMaL

Installation

Running Experiments

Supported Methods

Supported Datasets

Extending the codebase

References

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages