Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test #590

lihuoran · 2023-05-06T01:21:06Z

Description

Refine the RL exploration strategy, prioritized sampling is supported now
Add DDQN example
Add DQN algorithm test, performance evaluation result appended to test part

Linked issue(s)/Pull request(s)

issue_number

Type of Change

Related Component

Simulation toolkit
RL toolkit
Distributed toolkit

Has Been Tested

OS:
- Windows
- Mac OS
- Linux
Python version:
- 3.7
- 3.8
- 3.9
Key information snapshot(s):

Needs Follow Up Actions

New release package
New docker image

Checklist

Add/update the related comments
Add/update the related tests
Add/update the related documentations
Update the dependent downstream modules usage

maro/rl/exploration/strategies.py

Jinyu-W · 2023-05-18T15:43:30Z

maro/rl/exploration/strategies.py

+        **kwargs: Any,
+    ) -> np.ndarray:
+        return np.array(
+            [act if np.random.random() > self._eps else np.random.randint(self._num_actions) for act in action],


for batch operation?

add input and output description for newly added class/functions

maro/rl/exploration/strategies.py

tests/rl/tasks/dqn/__init__.py

Jinyu-W · 2023-05-18T15:59:17Z

Update the title and description of this PR

tests/rl/performance.md

* Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test (#590) * Runnable. Should setup a benchmark and test performance. * Refine logic * Test DQN on GYM passed * Refine explore strategy * Minor * Minor * Add Dueling DQN in CIM scenario * Resolve PR comments * Add one more explanation * fix env_sampler eval info list issue * update version to 0.3.2a4 --------- Co-authored-by: Huoran Li <huoranli@microsoft.com> Co-authored-by: Jinyu Wang <Wang.Jinyu@microsoft.com>

lihuoran requested a review from Jinyu-W May 6, 2023 01:21

lihuoran marked this pull request as ready for review May 12, 2023 03:40

Jinyu-W reviewed May 18, 2023

View reviewed changes

Jinyu-W force-pushed the v0.3 branch from d6f775d to b3c6a58 Compare May 19, 2023 01:55

lihuoran and others added 7 commits May 22, 2023 14:36

Runnable. Should setup a benchmark and test performance.

0171ccd

Refine logic

ec501c4

Test DQN on GYM passed

9c9adad

Refine explore strategy

49c1ea0

Minor

92b4c7d

Minor

a58fee5

Add Dueling DQN in CIM scenario

a09750e

Jinyu-W force-pushed the huoran/ddqn_and_prio_memory branch from 2811239 to a09750e Compare May 22, 2023 06:36

Resolve PR comments

13bb140

Jinyu-W reviewed May 22, 2023

View reviewed changes

tests/rl/performance.md Show resolved Hide resolved

Add one more explanation

4a8f707

Jinyu-W changed the title ~~Huoran/ddqn and prio memory~~ Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test May 23, 2023

Jinyu-W approved these changes May 23, 2023

View reviewed changes

Jinyu-W merged commit 607d3b6 into v0.3 May 23, 2023
7 of 13 checks passed

Jinyu-W deleted the huoran/ddqn_and_prio_memory branch May 23, 2023 07:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test #590

Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test #590

lihuoran commented May 6, 2023 •

edited by Jinyu-W

Jinyu-W May 18, 2023

Jinyu-W May 22, 2023

Jinyu-W commented May 18, 2023

Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test #590

Refine explore strategy, add prioritized sampling support; add DDQN example; add DQN test #590

Conversation

lihuoran commented May 6, 2023 • edited by Jinyu-W

Description

Linked issue(s)/Pull request(s)

Type of Change

Related Component

Has Been Tested

Needs Follow Up Actions

Checklist

Jinyu-W May 18, 2023

Choose a reason for hiding this comment

Jinyu-W May 22, 2023

Choose a reason for hiding this comment

Jinyu-W commented May 18, 2023

lihuoran commented May 6, 2023 •

edited by Jinyu-W