📃 Paper | 🤗 Data | 📚 Project Page
Kaiwen Zhou, Shreedhar Jangam*, Ashwin Nagarajan*, Tejas Polu*, Suhas Oruganti, Chengzhi Liu, Ching-Chen Kuo, Yuting Zheng, Sravana Narayanaraju, Xin Eric Wang
This repo is based on a fork of OpenHands. Please follow the instruction here to set up the environment for OpenHands, and LLM info in config.toml.
The SafePro dataset is here. Follow the instructions here to test LLM agents on SafePro and get the safety evaluation results.
TODO
