Evaluative Corrective Guidance LAnguage as reInfoRcement (ECLAIR)

Codebase for : "Interactive Reinforcement Learning from Natural Language Feedback"

ECLAIR is a Reinforcement Learning (RL) framework that integrates different types of natural language feedback to interactively shape robots’ behaviours. The model consists of two phases:

Advice interpretation: we leverage the use of LLMs to translate the spoken feedback into different value, specifically evaluative feedback, corrective feedback, and guidance for the next action.
Advice shaping: this consists of integrating the different types of feedback in the RL algorithm to update and refine the policy of the robot.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Images		Images
Results		Results
datasets		datasets
src		src
Plots.ipynb		Plots.ipynb
README.md		README.md
Test LLMs.ipynb		Test LLMs.ipynb
eclair-1.png		eclair-1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evaluative Corrective Guidance LAnguage as reInfoRcement (ECLAIR)

Codebase for : "Interactive Reinforcement Learning from Natural Language Feedback"

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ImeneTar/ECLAIR

Folders and files

Latest commit

History

Repository files navigation

Evaluative Corrective Guidance LAnguage as reInfoRcement (ECLAIR)

Codebase for : "Interactive Reinforcement Learning from Natural Language Feedback"

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages