Skip to content
Pro
Block or report user

Report or block AdamGleave

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse

Organizations

@HumanCompatibleAI
Block or report user

Report or block AdamGleave

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse

Pinned

  1. (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards

    Python 10 2

  2. The Firmament cluster scheduling platform

    C++ 371 71

  3. Adam Gleave's Cambridge Part II Project

    C++ 1

  4. Find best-response to a fixed policy in multi-agent RL

    Python 58 12

  5. Forked from openai/baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

    Python 1.9k 372

621 contributions in the last year

Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Mon Wed Fri

Contribution activity

April 1, 2020

AdamGleave has no activity yet for this period.

March 2020

Created a pull request in HumanCompatibleAI/adversarial-policies that received 1 comment

Add paper hyperparameters as config to train

Currently only way to replicate paper is via aprl.multi.train, but this is heavyweight for many purposes. Add the paper hyperparameters to a named …

+23 −13 1 comment

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.