Skip to content
View ZhengyaoJiang's full-sized avatar

Highlights

  • Pro

Organizations

@uclnlp @ucl-dark
Block or Report

Block or report ZhengyaoJiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. latentplan latentplan Public

    Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

    Python 86 9

  2. Farama-Foundation/chatarena Farama-Foundation/chatarena Public

    ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

    Python 1.2k 125

  3. graphbackup graphbackup Public

    Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824

    Python 5 1

  4. GTG GTG Public

    Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).

    Python 27 7

  5. NLRL NLRL Public

    Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)

    Python 73 27

  6. PGPortfolio PGPortfolio Public

    PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

    Python 1.7k 745