Humanoid -V2

Objective

Learn & Apply reinforcement learning techniques on complex continuous control domain to achieve maximum rewards. In the continuous control domain, where actions are continuous and often high-dimensional such as OpenAI-Gym environment Humanoid-V2. The Humanoid environment has 377 Observation dimensions and 17 action dimensions. This problem requires temporal difference learning compared to supervised learning since it has so many moving parts that are hard to debug, and they require substantial efforts in tuning in order to get good results. Also, in supervised learning problems, progress has been driven by large labeled datasets like ImageNet. In Reinforcement Learning, the closest equivalent would be a large and diverse collection of environments.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
README.md		README.md
humanoid-v2.ipynb		humanoid-v2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

README.md

README.md

humanoid-v2.ipynb

humanoid-v2.ipynb

Repository files navigation

Humanoid -V2

Objective

About

Releases

Packages

Languages

raphaelsc19/MSDS696

Folders and files

Latest commit

History

Repository files navigation

Humanoid -V2

Objective

About

Resources

Stars

Watchers

Forks

Languages