Self Reward Design with Fine-grained Interpretability

This folder contains the codes for Self Reward Design with Fine-grained Interpretability.

In this project, we attempt to solve reinforcement learning problem using artificial neural network (NN) attained to achieve interpretability in an extreme way. Each neuron in the NN is defined with purposeful human design.

Version 2

All commands used to execute our experiments can be found in misc/commands.txt and misc/commands_mujoco.txt. The full results can be found in our google drive.

Installation

We use conda environment, env.yml is available. Some manual installation is still necessary. We use pytorch 1.12.1. Please perform the necessary installation that depends on your machine (refer to pytorch's main website)

Fish sale auction

In this scenario, traditional RL is not the suitable choice since interpretability is crucial. Fig. (C) is our main result, while fig. (D) shows the result where the lack of interpretability results in sabotaged result.

MuJoCo and Half Cheetah

In this scenario, we use SRD to control half cheetah motion.

In the following example, we show movement with inhibitor=2, i.e. we allow the user to give "stop" instruction.

To-do:

remove ROOT_DIR argument that is not used.
add more mujoco examples.

Version 1

All codes for version 1 has been moved into legacy/v1.

Quick start: refer to the _quick_start folder.
Existing results can be found in google drive link.

Toy Fish

A simple toy world where a fish either moves or eats food, while trying to stay alive.

Robot2D in Lava land

A grid world where robot tries to reach the target tile (yellow).
The project features uncertainty avoidance, where robot tries to avoid lava tiles at all cost.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
checkpoint		checkpoint
legacy/v1		legacy/v1
misc		misc
src		src
README.md		README.md
env.yml		env.yml
main.py		main.py
mujoco_entry.py		mujoco_entry.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checkpoint

checkpoint

legacy/v1

legacy/v1

misc

misc

src

src

README.md

README.md

env.yml

env.yml

main.py

main.py

mujoco_entry.py

mujoco_entry.py

Repository files navigation

Self Reward Design with Fine-grained Interpretability

Version 2

Installation

Fish sale auction

MuJoCo and Half Cheetah

Version 1

Toy Fish

Robot2D in Lava land

About

Releases

Packages

Languages

ericotjo001/srd

Folders and files

Latest commit

History

Repository files navigation

Self Reward Design with Fine-grained Interpretability

Version 2

Installation

Fish sale auction

MuJoCo and Half Cheetah

Version 1

Toy Fish

Robot2D in Lava land

About

Resources

Stars

Watchers

Forks

Languages