Problems with the exact gradient method in original paper #2

V-Enzo · 2020-05-27T12:47:35Z

Hi xiangyu, sorry to interrupt you. I recently want to implement the exact gradient method (gradient descent for LQR with gradient oracle), which is proved in lemma 1. P_k could be calculated iteratively with P = Q, while \sigma_K is hard to compute, as it sums all xx^T from t=0 to infinity. To program it, I set the infinite length to 1000. and run for 20000 epochs with learning rate, state_dim, action_dim equals to 1e-3, 100, 20, separately . However, the result doesn't seem to converge as shown on page 38 of the original paper. Since I am new and working alone on this problem, I really appreciate you can give me some insights! Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems with the exact gradient method in original paper #2

Problems with the exact gradient method in original paper #2

V-Enzo commented May 27, 2020

Problems with the exact gradient method in original paper #2

Problems with the exact gradient method in original paper #2

Comments

V-Enzo commented May 27, 2020