Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning

Code for "Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning", AAMAS 2025 paper.

Introduction

we propose the Salience-Invariant Consistent Policy Learning(SCPL) algorithm, an efficient framework for zero-shot generalization in visual reinforcement learning. SCPL utilizes a novel value consistency module to encourage the encoder and value function to capture task-relevant pixels in original and perturbed observations. Meanwhile, a dynamics module is proposed to generate dynamic and reward relevant representations for both observations. Furthermore, SCPL regularizes the policy network using a KL divergence constraint between the policies for original and augmented observations, enabling agents to make consistent decisions in test environments. Experimental results demonstrate SCPL's superior performance over state-of-the-art baselines.

Quick Start

Setting up repo

git clone https://github.com/bofusun/SCPL

Install Dependencies

conda create -n SCPL python=3.8
conda activate SCPL
cd SCPL
pip install -r requirements.txt

Train

(1) train scpl without dynamic module with random convolution augmentation

python src/my_train_all.py --domain_name walker --task_name walk --algorithm scpl0r --seed 0

(2) train scpl without dynamic module with overlay augmentation

python src/my_train_all.py --domain_name walker --task_name walk --algorithm scpl0 --seed 0

(3) train scpl with random convolution augmentation

python src/my_train_all.py --domain_name walker --task_name walk --algorithm scplr --seed 0

(4) train scpl without dynamic module with overlay augmentation

python src/my_train_all.py --domain_name walker --task_name walk --algorithm scpl --seed 0

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
figures		figures
scripts		scripts
setup		setup
src		src
LICENSE		LICENSE
README.md		README.md
READMES.md		READMES.md
command.sh		command.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning

Introduction

Quick Start

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning

Introduction

Quick Start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages