Data Satanist, Reinforcement learning researcher
-
ARVI Lab
- Kyiv
- https://www.linkedin.com/in/poddiachyi/
- @poddiachyi
Pinned Loading
-
ppo-for-young-researcherry
ppo-for-young-researcherry PublicProximal Policy Optimization. Basic example for YoungResearcherry community.
Python 2
-
alpha-zero-tic-tac-toe
alpha-zero-tic-tac-toe PublicAlphaZero. Tic-tac-toe example for YoungResearcherry community.
-
hierarchical-dqn
hierarchical-dqn PublicHierarchical-DQN in PyTorch on MountainCar environment
-
-
meta-reinforcement-learning
meta-reinforcement-learning PublicMeta Reinforcement Learning with DeepMind's Alchemy
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.