GitHub - SSKKai/Interactive-Scoring-IRL

Description

Implementation of Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores.

Requirements

PyTorch
Gym

Optional Environments:

We use hydra to manage our configs:

pip install hydra-core --upgrade

We use pyqt5 to create our scoring GUI:

pip install pyqt5
pip install pyqt5-tools

This repo only support conducting real human experiment in Metaworld environments.

It seems that pyqt5 may conflict with RLBench(Coppeliasim), so if you want to try real human experiment, you may need to delete qt5 python package and remove export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so in your ~/.bashrc file for saving the reference videos.

Others requirements are listed in requirements.txt

Usage

We use hydra to manage our hypermaraters. We listed the config file in \config for different experiments. To run them, just specify the CONFIG_NAME. For example: --config-name mujoco-HalfCheetah-fb250 or --config-name metaworld-ButtonPress-fb500.

python main.py [--config-name CONFIG_NAME]

If you want to try your own configurations, you can follow these example config and write your own .yaml configuration file. We have noted the meaning of every hyperparameter in the config file.

We use wandb to log our experiments data, if you want to use it, please change the hyperparameters wand_log to true.

We use slightly customized environments for the experiments in RLBench simulator. For the usage of RLBench, you can check https://github.com/SSKKai/RLBench-SAC-with-DenseReward

Real Human Experiment

We design a scoring GUI for real human experiments. You can try it by running:

python main.py --config-name metaworld-ButtonPress-realhuman

Results

We use wandb to log our experiment data, you can see them in the following link:

Acknowledgement

This repo is based on the pytorch-soft-actor-critic.

We use the repo similarity_measures to compute the DTW trajectory distance.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
GUI_screenshot.png		GUI_screenshot.png
LICENSE.md		LICENSE.md
README.md		README.md
custom_env.py		custom_env.py
main.py		main.py
model.py		model.py
pure_sac_train.py		pure_sac_train.py
rate_trajectory.py		rate_trajectory.py
rate_trajectory.ui		rate_trajectory.ui
rate_window.py		rate_window.py
replay_memory.py		replay_memory.py
requirements.txt		requirements.txt
reward_net.py		reward_net.py
sac.py		sac.py
utils.py		utils.py
wandb_to_plot.py		wandb_to_plot.py

License

SSKKai/Interactive-Scoring-IRL

Folders and files

Latest commit

History

Repository files navigation

Description

Requirements

Usage

Real Human Experiment

Results

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Languages