RL_research :

-- About : Mainly focusses on augmenting sample efficiency in conventional RL algorithms and designing of new ones, using novel techniques based on deep generative models, optimization, machine learning techniques etc.

sac_ipns -- Implements a novel intrinsic reward generation technique termed IPNS, augmenting exploration of Soft actor critic (SAC) algorithm and improving its performance.
ddpg_td3_ipns -- Pairing of IPNS artifacts with conventional DDPG and TD3 algorithms.
sac_isac -- implementation of my work named ISAC: Improved Soft Actor-Critic, a later version of the shared code was used to generate the plots in the paper. In our proposed improved SAC (ISAC), we first introduce a new prioritization scheme for selecting better samples from the experience replay (ER) buffer. Second we use a mixture of the prioritized off-policy data with the latest on-policy data for training the policy and value function networks.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
ddpg_td3_ipns		ddpg_td3_ipns
sac_ipns		sac_ipns
sac_isac		sac_isac
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL_research :

About

Releases

Packages

Languages

License

cbanerji/Sample_efficient_RL.

Folders and files

Latest commit

History

Repository files navigation

RL_research :

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages