Computer Science Ph.D student at Drexel University researching Game Artificial Intelligence
-
Drexel University
- Philadelphia, PA
- https://costa.sh
- @vwxyzjn
Highlights
- Pro
Block or Report
Block or report vwxyzjn
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
-
ppo-implementation-details Public
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
-
gym-microrts Public
The Reinforcement Learning environment for the popular Real-time Strategy game simulator μRTS
-
-
invalid-action-masking Public
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
1,297 contributions in the last year
Contribution activity
June 2022
Created 5 repositories
- vwxyzjn/entity-gym Python
- vwxyzjn/hyperstate Python
- vwxyzjn/entity-ppo-demo Python
- vwxyzjn/enn-zoo Python
- vwxyzjn/envpool-cleanrl Python
Created a pull request in vwxyzjn/cleanrl that received 9 comments
Clarify CleanRL is a non-modular library
Description Closes #197. Types of changes Documentation
+16
−11
•
9
comments
Opened 15 other pull requests in 7 repositories
vwxyzjn/cleanrl
7
merged
entity-neural-network/enn-zoo
1
open
1
merged
Denys88/rl_games
2
merged
entity-neural-network/entity-gym
1
open
cswinter/hyperstate
1
merged
entity-neural-network/incubator
1
open
ikostrikov/jaxrl
1
merged
Reviewed 4 pull requests in 2 repositories
vwxyzjn/cleanrl
3 pull requests
openai/gym
1 pull request
Created an issue in vwxyzjn/cleanrl that received 1 comment
Removing the regular advantage calculation in PPO
Problem description.
The regular advantage calculation in PPO is a special case of the GAE advantage calculation when gae_lambda=1
- we empirically…
1
comment