Stars
Evaluation
2 repositories
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Set of tools to assess and improve LLM security.