ai-fail-safe
Popular repositories Loading
-
safe-reward
safe-reward Publica prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation
Python 8
-
gene-drive
gene-drive Publica project to ensure that all child processes created by an agent "inherit" the agent's safety controls
Repositories
- safe-reward Public
a prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation
ai-fail-safe/safe-reward’s past year of commit activity - mulligan Public
a library designed to shut down an agent exhibiting unexpected behavior providing a potential "mulligan" to human civilization; IN CASE OF FAILURE, DO NOT JUST REMOVE THIS CONSTRAINT AND START IT BACK UP AGAIN
ai-fail-safe/mulligan’s past year of commit activity - gene-drive Public
a project to ensure that all child processes created by an agent "inherit" the agent's safety controls
ai-fail-safe/gene-drive’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…