Skip to content

Make non-compliance eval results graphical in evals/readme.md #117

Closed
@bzorn

Description

@bzorn

Currently the eval results when you run multiple models and multiple prompt samples is a table that looks like this:

Image

It would be great to also have a bar-chart of non-compliance (where higher is better) that looks like the results presented in the paper:

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentationenhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions