Experimental support for stable_baselines3 by blumu · Pull Request #86 · microsoft/CyberBattleSim

blumu · 2022-09-22T03:45:30Z

No description provided.

- Observation fields can now be optionally padded to the shape expected by their corresponding gym space. (Requires more memory but is needed to train with stable-baseline agents) - Gym wrappers to flatten the Action and Observation spaces from CyberBattleSim - Hack CyberBattleSim environment to allow invalid moves and return negative reward instead - Flatten multi-dimensioned `MultiBinary` spaces * works with spaces.MultiBinary([list]) and spaces.MultiBinary(number) * working with `nodes_privilegelevel` * works with `leaked_credentials` * works with `credential_cache_matrix` * works with `discovered_nodes_properties` - Add a `stable-baseline` test notebook (Requires custom patch of stable-baseline3 from bug DLR-RM/stable-baselines3#1073) - Fix python 3.8 warnings

blumu added the Hackathon label Sep 22, 2022

blumu force-pushed the wiblum/sb branch 4 times, most recently from b715508 to db7d639 Compare September 23, 2022 20:35

blumu force-pushed the wiblum/sb branch from db7d639 to c595d18 Compare September 23, 2022 21:10

blumu merged commit 4fd228b into main Sep 24, 2022

blumu deleted the wiblum/sb branch September 30, 2022 18:00

Screamer-Y mentioned this pull request Nov 2, 2022

Play the environment with other RL algorithms #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental support for stable_baselines3#86

Experimental support for stable_baselines3#86
blumu merged 1 commit intomainfrom
wiblum/sb

blumu commented Sep 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

blumu commented Sep 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant