Skip to content

Pinned Loading

  1. MicroEvals MicroEvals Public

    Individual evaluations of agent-generated code.

    Python 3

  2. LeWitt-Bench LeWitt-Bench Public

    Forked from maximalmargin/lewitt_instructions

    A small dataset for Sol LeWitt’s instruction-based art.

    Jupyter Notebook 3

  3. agent-runner agent-runner Public

    Turn any model into an agent. Model-agnostic, framework agnostic agent harness.

    Python 31 5

Repositories

Showing 3 of 3 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…