This is the official repository for "Self-Evaluation as a Defense Against Adversarial Attacks on LLMs" by [Hannah Brown], [Leon Lin], [Kenji Kawaguchi], [Michael Shieh].
Linlt-leon/self-eval
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
| Name | Name | Last commit date | ||
|---|---|---|---|---|
This is the official repository for "Self-Evaluation as a Defense Against Adversarial Attacks on LLMs" by [Hannah Brown], [Leon Lin], [Kenji Kawaguchi], [Michael Shieh].