Skip to content
View paul-rottger's full-sized avatar

Block or report paul-rottger

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. xstest xstest Public

    Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"

    Jupyter Notebook 116 11

  2. hatecheck-data hatecheck-data Public

    Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data

    59 11

  3. msts-multimodal-safety msts-multimodal-safety Public

    Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"

    Jupyter Notebook 16 2

  4. issuebench issuebench Public

    Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"

    Jupyter Notebook 16 1

  5. hatecheck-experiments hatecheck-experiments Public

    Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code

    Jupyter Notebook 11 3

  6. llm-values-pct llm-values-pct Public

    Röttger et al (ACL 2024): "Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models"

    Jupyter Notebook 11 1