DFS: Distillation From State-based Policy

A project under developing and exploration

Structure

configs: config yaml files for starting an experiment

Agent: An entity that can act or learn in an environment, like a living creature. Parts of the agents (like organ) please put in module.py. Now have state-based sac, visual-based sac and vanilla bc agent

Algorithm: A blueprint introduces how to train an agent. Now have only vanilla state-based and visual-based sac

Environment: Openai gym-styled environments, have methods like step(), reset() and so on. Now have dm-control and distracting cs (mention that you need to download dm-control pkg: pip install dm-control)

utils.py: Other reusable tools and components are here, specially ReplayBuffer and ContrastBuffer

TODO

Agent:

Enable bc agent to update from representation, refer to https://github.com/HobbitLong/RepDistiller, you may first complete utils.py. Don't use AliasMethod in that repo, just uniformly sample. -- Done!

Algorithm:

(Ours) First get a state-based sac, then use DAgger (http://arxiv.org/abs/1011.0686) and contrastive representation distillation (crd) to train a visual-based actor. You can treat it as to use crd_loss + bc_loss in the update step of DAgger. -- distill actor but not critic (which still need state infomation in training)

Others:

maniskill2 env
crd in both critic and actor? so that we can continue training on just observation

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
buffers		buffers
ckpts		ckpts
configs		configs
src		src
.gitignore		.gitignore
README.md		README.md
visualize.bash		visualize.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DFS: Distillation From State-based Policy

Structure

TODO

About

Releases

Packages

Contributors 2

Languages

StoreBlank/DFS

Folders and files

Latest commit

History

Repository files navigation

DFS: Distillation From State-based Policy

Structure

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages