Skip to content
View Yuhao-Wan's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Yuhao-Wan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. time-varying-discount time-varying-discount Public

    A practical method to reduce discounting-induced bias during training in deeep Q-networks.

    Python

  2. deep-reinforcement-learning deep-reinforcement-learning Public

    Implementations of deep reinforcement learning algorithms in Tensorflow

    Python 2 1

  3. Pairwise-combinatorial-learner Pairwise-combinatorial-learner Public

    Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons

    Python 1 1

  4. Gaussian-processes Gaussian-processes Public

    Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"

    Jupyter Notebook 1