Skip to content

Latest commit

 

History

History
12 lines (10 loc) · 253 Bytes

READNE.md

File metadata and controls

12 lines (10 loc) · 253 Bytes

DOPS:

DOPS (Diverse Optimal Policy Sampling) mitigates out-of-distribution (OOD) actions by minimizing the Q-value of the Q-function according to the learned policy.

Setup

pip install -r requirements.txt

Run

run main.py