Skip to content

Stanford-ILIAD/multimodal-rewards-from-rankings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Companion code to CoRL 2021 paper:
Vivek Myers, Erdem Bıyık, Nima Anari, Dorsa Sadigh. "Learning Multimodal Rewards from Rankings". 5th Conference on Robot Learning (CoRL), London, UK, Nov. 2021.

This code actively learns multimodal reward functions from rankings in various tasks with respect to an information gain acquisition function and compares it to random querying.

The codes for the interface of the user studies are excluded, but the environments can still be simulated with the given trajectory datasets.

Dependencies

You need to have the following libraries with Python3:

Running

You simply run:

	python run.py [task_name]

where [task_name] is either of the following: lunar, fetch, synthetic. The output is a PNG file in the main directory that compares the two querying methods.

About

Companion code to CoRL 2021 paper "Learning Multimodal Rewards from Rankings"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages