Skip to content
View dyth's full-sized avatar

Block or report dyth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dyth/README.md

David Yu-Tung Hui, 許宇同

I am an independent researcher interested in Deep Reinforcement Learning. My research focuses on increasing the optimization stability of off-policy gradient-based Q -learning algorithms over a range of tasks and hyperparameters. I'm especially interested in developing algorithms to solve continuous control tasks.

I've written two works along this research direction:

  1. Stabilizing Q-Learning for Continuous Control
    David Yu-Tung Hui
    MSc Thesis, University of Montreal, 2022
    I showed that using LayerNorm in the critic of DDPG prevented divergence during training in MuJoCo and DeepMind Control continuous control environments, enabling non-trivial behaviors to be learned in the dog-run task of DeepMind Control.
    [.pdf] [Errata]

  2. Double Gumbel Q-Learning
    David Yu-Tung Hui, Aaron Courville, Pierre-Luc Bacon
    Spotlight at NeurIPS 2023
    We modeled noise introduced by a function approximator in Q -learning as a heteroscedastic Gumbel distribution and derived a loss function from this noise model that was effective in off-policy continuous control -- our resultant algorithm achieved ~2x the aggregate performance of SAC after 1M training timesteps.
    [.pdf] [Reviews] [Poster (.png)] [5-min talk] [1-hour seminar] [Code (GitHub)] [Errata]

In 2023, I graduated with an MSc from Mila, University of Montreal. I'm looking for opportunities where I can continue my research.

For more information about me, see my Google Scholar.

Pinned Loading

  1. doublegum Public

    NeurIPS 2023 Spotlight

    Python 10 3

792 contributions in the last year

Contribution Graph
Day of Week April May June July August September October November December January February March
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Activity overview

Contributed to dyth/dyth, dyth/causal-entropic-forces, dyth/doublegum and 1 other repository
Loading A graph representing dyth's contributions from April 07, 2024 to April 07, 2025. The contributions are 98% commits, 2% issues, 0% pull requests, 0% code review.   Code review 2% Issues   Pull requests 98% Commits

Contribution activity

April 2025

dyth has no activity yet for this period.
Loading