Qlib RL framework (stage 2) - trainer #1125

ultmaster · 2022-06-13T13:15:20Z

Description

This PR introduces a trainer implementation that is used to train a policy in Qlib RL framework.

This includes the following components:

TrainingVessel: a bundle containing all elements, logic in the training.
Trainer: implements a loop to drive the vessel.

The designs mimicks pytorch-lightning. The key difference is that it's tailored for RL paradigm, in which data comes from online collects.

The trainer relies on the full implementation of logger to be 100% powered. The logger will be refactored and implemented in the next stage.

Motivation and Context

It's an immediate follow-up of #1076.

How Has This Been Tested?

Pass the test by running: pytest qlib/tests/test_all_pipeline.py under upper directory of qlib.
If you are adding a new feature, test on your own test scripts.

Screenshots of Test Results (if appropriate):

Pipeline test:
Your own tests:

Types of changes

Fix bugs
Add new feature
Update documentation

(cherry picked from commit 1a8e0bd)

(cherry picked from commit 3498e18)

…ner-2

qlib/rl/trainer/trainer.py

qlib/rl/trainer/vessel.py

qlib/rl/trainer/trainer.py

qlib/rl/order_execution/reward.py

qlib/rl/trainer/callbacks.py

you-n-g · 2022-06-28T02:24:57Z

Please merge main branch to run the new version of CI

…ner-2

you-n-g · 2022-06-28T11:53:00Z

It looks great now!

* checkpoint (cherry picked from commit 1a8e0bd) * Not a workable version (cherry picked from commit 3498e18) * vessel * ckpt * . * vessel * . * . * checkpoint callback * . * cleanup * logger * . * test * . * add test * . * . * . * . * New reward * Add train API * fix mypy * fix lint * More comment * 3.7 compat * fix test * fix test * . * Resolve comments * fix typehint

ultmaster added 30 commits June 1, 2022 02:27

checkpoint

542d295

(cherry picked from commit 1a8e0bd)

Not a workable version

d5e15ac

(cherry picked from commit 3498e18)

vessel

4acb0c2

ckpt

2d1d8cb

.

1f85487

vessel

ea40fdf

.

6816c7e

.

319766f

checkpoint callback

f2c02e0

.

4db8567

cleanup

163bc2a

logger

b76b810

.

647dc76

test

fc7eb9a

.

38ecc21

add test

67a53fb

.

6b391de

.

c73ec3a

.

5980e45

.

85710ef

New reward

26883c8

Add train API

30afc6c

fix mypy

68716e4

fix lint

cbf2577

Merge branch 'main' of https://github.com/microsoft/qlib into rl-trai…

8e479c2

…ner-2

More comment

4aa421f

3.7 compat

54d2342

fix test

f432525

fix test

5344846

.

3123f1a

matluster mentioned this pull request Jun 14, 2022

[Proposal] Systematic RL support in qlib #1011

Open

ultmaster requested review from lihuoran and you-n-g June 14, 2022 03:37

you-n-g reviewed Jun 24, 2022

View reviewed changes

Resolve comments

1bb307d

ultmaster added 3 commits June 28, 2022 03:46

Merge branch 'main' of https://github.com/microsoft/qlib into rl-trai…

e8c6f4f

…ner-2

fix typehint

5fe8bff

Merge branch 'main' of https://github.com/microsoft/qlib into rl-trai…

4b5fcb0

…ner-2

you-n-g merged commit 25ecb11 into microsoft:main Jun 28, 2022

you-n-g added the enhancement New feature or request label Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qlib RL framework (stage 2) - trainer #1125

Qlib RL framework (stage 2) - trainer #1125

ultmaster commented Jun 13, 2022

you-n-g commented Jun 28, 2022

you-n-g commented Jun 28, 2022

Qlib RL framework (stage 2) - trainer #1125

Qlib RL framework (stage 2) - trainer #1125

Conversation

ultmaster commented Jun 13, 2022

Description

Motivation and Context

How Has This Been Tested?

Screenshots of Test Results (if appropriate):

Types of changes

you-n-g commented Jun 28, 2022

you-n-g commented Jun 28, 2022