Course projects of CS395T Numerical Optimization, UT Austin
-
Updated
Dec 6, 2017 - Python
Course projects of CS395T Numerical Optimization, UT Austin
Trust Region Policy Optimization (TRPO) in pure TensorFlow
PyTorch implementation of Trust Region Policy Optimization
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
Python implementation of some numerical (optimization) methods
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
The pytorch implemetation of trpo
A collection of Reinforcement Learning implementations with PyTorch
Add a description, image, and links to the trust-region-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the trust-region-policy-optimization topic, visit your repo's landing page and select "manage topics."