Skip to content
/ Diaster Public

Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward

Notifications You must be signed in to change notification settings

HxLyn3/Diaster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Diaster: Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward

This is the code for the paper "Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward".

Installation instructions

Install Python environment with:

conda create -n diaster python=3.9 -y
conda activate diaster
conda install pytorch cudatoolkit=11.3 -c pytorch -y
pip install -r ./requirements.txt

Run an experiment

python3 main.py --env-name=[Env name] 

The config files act as defaults for a task.

They are all located in config. --env-name refers to the config files in config/ including Hopper-v3, Walker2d-v3, Swimmer-v3, Humanoid-v3, HumanoidStandup-v2.

All results will be stored in the result folder.

For example, run Diaster on Hopper:

python main.py --env-name=Hopper-v3

About

Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages