ZhengyaoJiang

Follow

Zhengyao Jiang ZhengyaoJiang

Follow

PhD student at UCL, Interested in Offline Reinforcement Learning (RL), Data-Efficient RL and Neuro-Symbolic Methods for RL.

462 followers · 30 following

University College London
London, UK
zhengyaojiang.github.io
@zhengyaojiang

Achievements

Achievements

Highlights

Pro

Organizations

Block or Report

Block or report ZhengyaoJiang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

latentplan latentplan Public

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

Python 88 10
Farama-Foundation/chatarena Farama-Foundation/chatarena Public

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1.3k 126
graphbackup graphbackup Public

Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824

Python 5 1
GTG GTG Public

Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).

Python 27 7
NLRL NLRL Public

Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)

Python 74 27
PGPortfolio PGPortfolio Public

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Python 1.7k 745