Skip to content

forhaoliu/mini_apt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Note: please check out URLB codebase for most up-to-date and improved implementation. This repo is no longer actively maintained.


Author's code for reproducing experiments in Behavior from the void: unsupervised active pre-training. It consists of unsupervised pretraining in single Atari environment, and subsequently adapt to downstream reward function.

The code is ported out with readability in mind, it can hopeful serve as a simple starting point for future research.

Model

APT model diagram

Usage

Run python train.py --env_name breakout --batch_size 64 --rainbow_conv --aug --enable_cudnn --n_step 20 --start_timestep 1600 --reward_clip 1 --knn_rms --id run --proj_dim 256

Todo

  • include hyperparameter setting as config file
  • requirement file
  • reward plotter
  • port out code on Atari
  • port out code on DMC

Contact

hao.liu@cs.berkeley.edu

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages