Reward Design with Language Models

The code in this repository is based on the paper Reward Design with Language Models. This repository contains the prompts that we used for each domain as well as code to train an RL agent with an LLM in the loop using those prompts. Each domain (Ultimatum Game, Matrix Games, DealOrNoDeal) has a separate directory and will need a seperate conda/virtual environment. Please check out the READMEs in each directory for more information on how to run things.

Using GPT3

We use GPT3 for our experiments. You will need to have an API key from them saved in your ~/.bashrc or ~/.zshrc under the variable OPENAI_API_KEY.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
matrix		matrix
negotiation		negotiation
ultimatum		ultimatum
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

matrix

matrix

negotiation

negotiation

ultimatum

ultimatum

README.md

README.md

Repository files navigation

Reward Design with Language Models

Using GPT3

About

Releases

Packages

Contributors 2

Languages

minaek/reward_design_with_llms

Folders and files

Latest commit

History

Repository files navigation

Reward Design with Language Models

Using GPT3

About

Resources

Stars

Watchers

Forks

Languages