Safe Value Functions: Learned Critics as Hard Safety Constraints

This paper was awarded Outstanding Paper at WFVML 2023

This is the code for "Safe Value Functions: Learned Critics as Hard Safety Constraints"

This codebase is implemented using PyTorch, building on CleanRL.

Experiments

Instructions to reproduce the experiments in the paper can be found here

Citing

If you find our work useful, please consider citing:

@article{
    tan2023value,
    title={Safe Value Functions: Learned Critics as Hard Safety Constraints}, 
    author={Daniel C. H. Tan and Fernando Acero and Robert McCarthy and Andromachi Maria Delfaki and Zhibin Li and Dimitrios Kanoulas},
    year={2024},
    eprint={2306.04026},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
experiments		experiments
requirements		requirements
rl_cbf		rl_cbf
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
License		License
Makefile		Makefile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Safe Value Functions: Learned Critics as Hard Safety Constraints

Experiments

Citing

About

Releases

Packages

Contributors 2

Languages

License

dtch1997/rl_cbf

Folders and files

Latest commit

History

Repository files navigation

Safe Value Functions: Learned Critics as Hard Safety Constraints

Experiments

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages