PhD student at UCL, Interested in Offline Reinforcement Learning (RL), Data-Efficient RL and Neuro-Symbolic Methods for RL.
-
University College London
- London, UK
- zhengyaojiang.github.io
- @zhengyaojiang
Pinned Loading
-
latentplan
latentplan PublicCode release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
-
Farama-Foundation/chatarena
Farama-Foundation/chatarena PublicChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
-
graphbackup
graphbackup PublicCode release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
-
PGPortfolio
PGPortfolio PublicPGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.