重构Unity Wrapper #25

StepNeverStop · 2020-12-28T08:13:26Z

由python端在连接Unity时自动发送诸如“环境并行数量、智能体雷达检测密度、是否强制reset环境”等变量
由python端在初始化训练环境时指定是否需要stack状态输入，无需另写StackWapper
...

- add `initialize_config` in config.yaml - add `reset_config` and `step_config` for unity training - remove `GrayVisualWrapper`. `ResizeVisualWrapper`. `StackVisualWrapper`

… command line when using unity training agents (#25)

- add `UnitySingleBehaviorInfo` in indexs.py - remove BasicActionWrapper - remove redundant identifiers

- support multi-vector and multi-visual input - optimize `gym` and `unity` wrapper - fix `ActorCriticValueCts` - tag 2.0.0 - add `ObsSpec` - refactor `SingleAgentEnvArgs` and `MultiAgentEnvArgs` - remove `self.s_dim`, use `self.concat_vector_dim` instead - stop using vector input normalization temporarily

…training. (#41,#25,#31) 1. change variable name from "is_lg_batch_size" to "can_sample" 2. optimized unity wrapper 3. optimized multi-agents replay buffers

1. fixed n-step replay buffer 2. reconstruct representation net 3. remove 'use_stack' 4. implement multi-agent algorithms with shared parameters 5. optimized agent network

1. removed sarl off-policy algorithm pd_ddpg, 'cause it's not in main stream 2. updated README 3. removed `iql` and added script `IndependentMA.py` instead to implement independent multi-agent algorithms 4. optimized summary writing 5. move NamedDict from 'rls.common.config' to 'rls.common.specs' 6. updated example config 7. updated `.gitignore` 8. added property `is_multi` to identify whether training task is for sarl or marl for both unity and gym 9. reconstructed inheritance relationships between algorithms and their's superclass 10. removed `1.e+18` in yaml files and use a large integer number instead, 'cause we want a large integer rather than float

1. added `test.yaml` for quickly verify RLs 2. change folder name from `algos` to `algorithms` for better reading 3. removed single agent recoder, all algorithms(sarl&marl) using `SimpleMovingAverageRecoder` 4. removed `GymVectorizedType` in `common/specs.py` 5. removed `common/train/*`, and implement unified training interface in `rls/train` 6. reconstructed `make_env` function in `rls/envs/make_env` 7. optimized function `load_config` 8. moved `off_policy_buffer.yaml` to `rls/configs/buffer` 9. removed configurations like `eval_while_train`, `add_noise2buffer` etc. 10. optimized environments' configuration files 11. optimized environment wrappers and implemented unified env interface for `gym` and `unity`, see `env_base.py` 12. updated dockerfiles 13. updated README

…ng. (#34, #25) 1. updated `setup.py` 2. removed redundant packages 3. fixed bugs in unity wrapper 4. fixed bugs in agent models that occurred in continuous-action training tasks 5. fixed bugs in class `MLP`

*. redefine version to v0.0.1 1. removed package `supersuit` 2. implemented class `MPIEnv` 3. implemented class `VECEnv` 4. optimized env wrappers, implemented `render` method to `gyms` environment. 5. reconstructed some of returns of `env.step` from `obs` to `obs_fa` and `obs_fs`. - `obs_fa` is used to choose action based by agent/policy. For the cross point of episode i and i+1, `obs_fa` represents $observation_{i+1}^{0}$, otherwise it keeps same with `obs_fs` which represents $observation_{i}^{t}$. - `obs_fs` is used to be stored in buffer. For the cross point of episode i and i+1, `obs_fs` represents $observation_{i}^{T}$, otherwise it keeps same with `obs_fa`. 6. optimzed `rssm` related based on mentioned `obs_fs`.

StepNeverStop created this issue from a note in Tasks (To Do) Dec 28, 2020

StepNeverStop self-assigned this Dec 28, 2020

StepNeverStop added the enhancement New feature or request label Dec 28, 2020

StepNeverStop moved this from To Do to In Progress in Tasks Dec 31, 2020

StepNeverStop added a commit that referenced this issue Dec 31, 2020

refactor(unity): using env_copys in yaml instead of --n_agents in…

efa6f65

… command line when using unity training agents (#25)

StepNeverStop added a commit that referenced this issue Dec 31, 2020

setting(unity): change default value of env_copys (#25)

b92ed95

StepNeverStop added a commit that referenced this issue Jan 1, 2021

refactor(unity): optimize unity wrapper (#25)

747b3fa

- add `UnitySingleBehaviorInfo` in indexs.py - remove BasicActionWrapper - remove redundant identifiers

StepNeverStop added the optimization Better performance or solution label Jan 6, 2021

StepNeverStop added a commit that referenced this issue Jan 6, 2021

style: remove redundant code (#25)

fb9e5eb

StepNeverStop added a commit that referenced this issue Jan 6, 2021

fix&refactor(unity): optimize unity wrapper, fix ddpg (#25, #34)

7dad01d

StepNeverStop added a commit that referenced this issue Jan 6, 2021

refactor(unity): optimize unity wrapper (#25)

719ccf4

StepNeverStop added a commit that referenced this issue Aug 25, 2021

perf: reconstruct repo(#47, #25, #46, #34, #31, #33, #39, #41, #45, #26)

67b8979

StepNeverStop added a commit that referenced this issue Aug 27, 2021

fix(unity): fixed visual input training using Unity3D (#34, #25)

39d60c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

重构Unity Wrapper #25

重构Unity Wrapper #25

StepNeverStop commented Dec 28, 2020

重构Unity Wrapper #25

重构Unity Wrapper #25

Comments

StepNeverStop commented Dec 28, 2020