Skip to content
View ruizhaogit's full-sized avatar
🎯
Focus
🎯
Focus
Block or Report

Block or report ruizhaogit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. music music Public

    Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)

    Python 35 4

  2. EnergyBasedPrioritization EnergyBasedPrioritization Public

    Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)

    Python 32 9

  3. mep mep Public

    Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)

    Python 23 6

  4. maximum_entropy_population_based_training maximum_entropy_population_based_training Public

    Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination

    Python 23 4

  5. GuessWhat-TemperedPolicyGradient GuessWhat-TemperedPolicyGradient Public

    Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient (SLT 2018) (IJCAIw 2018)

    Lua 8 2

  6. MNIST-GuessNumber MNIST-GuessNumber Public

    Efficient Dialog Policy Learning via Positive Memory Retention (SLT 2018) (NIPSw 2018)

    Python 6 1