-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refine RL workflow & tune RL models under GYM #577
Merged
Merged
Changes from 43 commits
Commits
Show all changes
49 commits
Select commit
Hold shift + click to select a range
6bc7f0b
PPO, SAC, DDPG passed
lihuoran b3f5aef
Explore in SAC
lihuoran 5dab711
Test GYM on server
lihuoran 211c06f
Sync server changes
lihuoran f92f7f1
Merge branch 'v0.3' into rl_benchmark_debug
lihuoran 514250a
pre-commit
lihuoran fc0c02d
Ready to try on server
lihuoran 9fcdf42
.
lihuoran 01b5a94
.
lihuoran dd27eed
.
lihuoran 1c8f258
.
lihuoran 1aa1085
.
lihuoran 148af38
Performance OK
lihuoran 99ff7b9
Move to tests
lihuoran 65ba1a1
Remove old versions
lihuoran f4a85b8
PPO done
lihuoran 2349191
Start to test AC
lihuoran f6f7dae
Start to test SAC
lihuoran 110fec4
SAC test passed
lihuoran 2a1ccd5
Multiple round in evaluation
lihuoran c371220
Modify config.yml
lihuoran a65d902
Add Callbacks
lihuoran aa484f8
[wip] SAC performance not good
lihuoran 84ec6e6
[wip] still not good
lihuoran 0ceaac4
update for some PR comments; Add a MARKDOWN file (#576)
Jinyu-W aad41d9
Use FullyConnected to replace mlp
lihuoran 8884231
Update action bound
lihuoran 0a01fb1
Merge branch 'rl_benchmark_debug' into rl_workflow_refine
lihuoran 0bd25ca
???
lihuoran 8781dd6
Change gym env wrapper metrics logci
lihuoran 7b9b698
Change gym env wrapper metrics logci
lihuoran 52b4d1d
refine env_sampler.sample under step mode
lihuoran a3fea0d
Add DDPG. Performance not good...
lihuoran 23f39d1
Add DDPG. Performance not good...
lihuoran 9da8b90
wip
lihuoran fb11c31
Sounds like sac works
lihuoran d7d3282
Refactor file structure
lihuoran ea26275
Refactor file structure
lihuoran 8881a1c
Refactor file structure
lihuoran b4db842
Pre-commit
lihuoran 8874a65
Merge branch 'rl_benchmark_debug' into rl_workflow_refine
lihuoran 2a7334b
Merge branch 'v0.3' into rl_workflow_refine
lihuoran eb7ae9b
Pre commit
lihuoran 627b7d1
Minor refinement of CIM RL
lihuoran 8386312
Jinyu/rl workflow refine (#578)
Jinyu-W b05c849
Resolve PR comments
lihuoran ab5e675
Compare PPO with spinning up (#579)
lihuoran e180f10
SAC Test parameters update (#580)
Jinyu-W 9371949
Episode truncation & early stopping (#581)
lihuoran File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -29,3 +29,5 @@ htmlcov/ | |
.coveragerc | ||
.tmp/ | ||
.xmake/ | ||
outputs/ | ||
tests/rl_log/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO: runtime error