List of resources about programming practices for writing safety-critical software.
-
Updated
Apr 23, 2024 - Python
List of resources about programming practices for writing safety-critical software.
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Open-source vulnerability disclosure and bug bounty program database
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
面向中文大模型价值观的评估与对齐研究
A collaborative collection of open-source safe GPT-3 prompts that work well
Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)
Safe reinforcement learning with stability guarantees
Safe Bayesian Optimization
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel are detected beneath these hazards. Dataset sourced from Taiwan's construction industry.
[arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"
Safe Exploration with MPC and Gaussian process models
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
A repository dedicated to terminal escape injections.
Security audit Python project dependencies against security advisory databases.
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
🔥 Datasets and env wrappers for offline safe reinforcement learning
Safety Verification of Deep Neural Networks
Add a description, image, and links to the safety topic page so that developers can more easily learn about it.
To associate your repository with the safety topic, visit your repo's landing page and select "manage topics."