GitHub

Note: please check out URLB codebase for most up-to-date and improved implementation. This repo is no longer actively maintained.

Author's code for reproducing experiments in Behavior from the void: unsupervised active pre-training. It consists of unsupervised pretraining in single Atari environment, and subsequently adapt to downstream reward function.

The code is ported out with readability in mind, it can hopeful serve as a simple starting point for future research.

Model

Usage

Run python train.py --env_name breakout --batch_size 64 --rainbow_conv --aug --enable_cudnn --n_step 20 --start_timestep 1600 --reward_clip 1 --knn_rms --id run --proj_dim 256

Todo

Contact

hao.liu@cs.berkeley.edu

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
helper		helper
misc		misc
.gitignore		.gitignore
README.md		README.md
algo.py		algo.py
entropy.py		entropy.py
model.py		model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Note: please check out URLB codebase for most up-to-date and improved implementation. This repo is no longer actively maintained.

Model

Usage

Todo

Contact

About

Releases

Packages

Languages

forhaoliu/mini_apt

Folders and files

Latest commit

History

Repository files navigation

Note: please check out URLB codebase for most up-to-date and improved implementation. This repo is no longer actively maintained.

Model

Usage

Todo

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages