[RLlib] Hope RLlib can support DQfD & POfD #25058
Labels
enhancement
Request for new feature and/or capability
P2
Important issue, but not time-critical
rllib
RLlib related issues
rllib-contrib
Issues related to algorithms in rllib-contrib
Description
DQfD algorithm has mentioned in the latest feature-overview, but it is not yet supported in the code. Hope RLlib can support DQfD & POfD!
Offline RL and imitation learning/behavior cloning: You don’t have a simulator for your particular problem, but tons of historic data recorded by a legacy (maybe non-RL/ML) system? This branch of reinforcement learning is for you! RLlib’s comes with several offline RL algorithms (CQL, MARWIL, and DQfD), allowing you to either purely behavior-clone your existing system or learn how to further improve over it.)
RLlib: Industry-Grade Reinforcement Learning — Ray 1.12.1
Use case
Classical Imitation learning Algorithm
The text was updated successfully, but these errors were encountered: