[RLlib] Hope RLlib can support DQfD & POfD #25058

mahuangxu · 2022-05-21T04:21:01Z

Description

DQfD algorithm has mentioned in the latest feature-overview, but it is not yet supported in the code. Hope RLlib can support DQfD & POfD!

Offline RL and imitation learning/behavior cloning: You don’t have a simulator for your particular problem, but tons of historic data recorded by a legacy (maybe non-RL/ML) system? This branch of reinforcement learning is for you! RLlib’s comes with several offline RL algorithms (CQL, MARWIL, and DQfD), allowing you to either purely behavior-clone your existing system or learn how to further improve over it.)

RLlib: Industry-Grade Reinforcement Learning — Ray 1.12.1

Use case

Classical Imitation learning Algorithm

mahuangxu added the enhancement Request for new feature and/or capability label May 21, 2022

sven1977 added the rllib RLlib related issues label May 23, 2022

kouroshHakha added the P2 Important issue, but not time-critical label May 24, 2022

Rohan138 added the rllib-contrib Issues related to algorithms in rllib-contrib label Jul 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Hope RLlib can support DQfD & POfD #25058

[RLlib] Hope RLlib can support DQfD & POfD #25058

mahuangxu commented May 21, 2022

[RLlib] Hope RLlib can support DQfD & POfD #25058

[RLlib] Hope RLlib can support DQfD & POfD #25058

Comments

mahuangxu commented May 21, 2022

Description

Use case