Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[S-02-1] Understanding HAC & UOF theory and sources #10

Closed
Tracked by #8
CUN-bjy opened this issue Aug 25, 2021 · 1 comment
Closed
Tracked by #8

[S-02-1] Understanding HAC & UOF theory and sources #10

CUN-bjy opened this issue Aug 25, 2021 · 1 comment
Assignees
Labels
documentation Improvements or additions to documentation
Milestone

Comments

@CUN-bjy
Copy link
Member

CUN-bjy commented Aug 25, 2021

Ref)

  • HAC
  • UOF,
    • Yang, Xintong, et al. "Hierarchical Reinforcement Learning With Universal Policies for Multistep Robotic Manipulation." IEEE Transactions on Neural Networks and Learning Systems (2021).
    • code : UOF-paper-code
@CUN-bjy CUN-bjy changed the title Understanding HAC & UOF theory and sources - 1week Understanding HAC & UOF theory and sources Aug 28, 2021
@CUN-bjy CUN-bjy changed the title Understanding HAC & UOF theory and sources [S-02-1]Understanding HAC & UOF theory and sources Aug 28, 2021
@CUN-bjy CUN-bjy changed the title [S-02-1]Understanding HAC & UOF theory and sources [S-02-1] Understanding HAC & UOF theory and sources Aug 28, 2021
@CUN-bjy CUN-bjy added the documentation Improvements or additions to documentation label Aug 28, 2021
@CUN-bjy CUN-bjy added this to Stage 2 in project-sandwich-man Aug 28, 2021
@CUN-bjy CUN-bjy added this to the Stage 2 milestone Aug 28, 2021
@CUN-bjy
Copy link
Member Author

CUN-bjy commented Aug 30, 2021

1. HAC(Hierarchical Actor-Critic)

: Learning multi-level hierarchies with hindsight

Features

  • DDPG + HER for each-level policy
  • Nested Policy Structure
  • parallel training for multi-level policy
    • hindsight action transition
    • hindsight goal transition
    • subgoal testing transition

2. UOF(Universal Option Framework)

: Hierarchical Reinforcement Learning With Universal Policies for Multistep Robotic Manipulation

Features

  • DIOL(Deep Intraoption learning) for high-level policy update
  • DDPG(using double critics) + HER for low-level policy
  • AAES(Auto-Adjusting Exploration Strategy) for stabilizing low-level policy in parallel training
  • Abstract Demonstration to accelerate

@CUN-bjy CUN-bjy closed this as completed Sep 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
Development

No branches or pull requests

2 participants