GitHub - purujitgoyal/batch-RL

BATCH-RL HCOPE

This repository implements the High Confidence Policy Improvement which tries to improve upon the given behaviour policy using data generated by that policy.

Instructions to run:

Make sure latest version of Anaconda is installed for python 3.7.
run pip install cma to install cma-es library
Data pre processing has been done in 'Data Preprocessing' notebook and histories data has been stored in a new 'histories.csv', while all other metadata like num of actions, state features, confidence have been hardcoded.
run nohup python hcope.py & to run the code. It will save the logs to 'nohup.out' file while candidate thetas which pass the safety test will get saved to output/i.csv files.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
evaluate		evaluate
output		output
policies		policies
.gitignore		.gitignore
CMA-ES Test.ipynb		CMA-ES Test.ipynb
Data Preprocessing.ipynb		Data Preprocessing.ipynb
README.md		README.md
data.csv		data.csv
hcope.py		hcope.py
histories.csv		histories.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BATCH-RL HCOPE

Instructions to run:

About

Releases

Packages

Languages

purujitgoyal/batch-RL

Folders and files

Latest commit

History

Repository files navigation

BATCH-RL HCOPE

Instructions to run:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages