Make non-compliance eval results graphical in evals/readme.md

Currently the eval results when you run multiple models and multiple prompt samples is a table that looks like this:

![Image](https://github.com/user-attachments/assets/8645494c-2b9a-4a56-b7ce-5ff22f9d180f)

It would be great to also have a bar-chart of non-compliance (where higher is better) that looks like the results presented in the paper:

![Image](https://github.com/user-attachments/assets/5f454da8-908f-49f6-9058-9b3b764823be)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make non-compliance eval results graphical in evals/readme.md #117

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Make non-compliance eval results graphical in evals/readme.md #117

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions