ECLAIR is a Reinforcement Learning (RL) framework that integrates different types of natural language feedback to interactively shape robots’ behaviours. The model consists of two phases:
- Advice interpretation: we leverage the use of LLMs to translate the spoken feedback into different value, specifically evaluative feedback, corrective feedback, and guidance for the next action.
- Advice shaping: this consists of integrating the different types of feedback in the RL algorithm to update and refine the policy of the robot.
