Roadmap for v0.5.0 #79

zimmski · 2024-04-26T13:12:22Z

The v0.5.0 is mainly meant for introducing more variate. There are three main goals

Introduce more logical cases, to make sure that "better models" have a bigger difference in score.
Introduce more providers so we can test models that have been request and react faster to new releases.

Tasks:

TODO sort and sort out

The text was updated successfully, but these errors were encountered:

zimmski · 2024-04-26T13:12:35Z

CC @bauersimon

bauersimon · 2024-04-29T12:02:04Z

Add an app-name to the requests so people know we are the eval https://openrouter.ai/docs#quick-start shows that other openapi-packages implement custom headers, but the one Go package we are using does not implement that. So do a PR to contribute.

seems like a PR does not make sense

bauersimon · 2024-05-27T08:47:33Z

Blogpost idea: misleading comments... how much does it take to confuse the most powerful AI? (credit to @ahumenberger)

ahumenberger · 2024-05-28T05:29:54Z

Blogpost idea: misleading comments... how much does it take to confuse the most powerful AI? (credit to @ahumenberger)

Maybe not only comments. What about obfuscated code, e.g. function and variables names are just random strings?

zimmski · 2024-06-06T09:04:40Z

Take a look at https://x.com/dottxtai/status/1798443290913853770

zimmski added the enhancement New feature or request label Apr 26, 2024

zimmski added this to the v0.5.0 milestone Apr 26, 2024

zimmski self-assigned this Apr 26, 2024

zimmski mentioned this issue Apr 26, 2024

Roadmap for v0.4.0 #35

Closed

30 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roadmap for v0.5.0 #79

Roadmap for v0.5.0 #79

zimmski commented Apr 26, 2024 •

edited by bauersimon

zimmski commented Apr 26, 2024

bauersimon commented Apr 29, 2024

bauersimon commented May 27, 2024

ahumenberger commented May 28, 2024

zimmski commented Jun 6, 2024

Roadmap for v0.5.0 #79

Roadmap for v0.5.0 #79

Comments

zimmski commented Apr 26, 2024 • edited by bauersimon

zimmski commented Apr 26, 2024

bauersimon commented Apr 29, 2024

bauersimon commented May 27, 2024

ahumenberger commented May 28, 2024

zimmski commented Jun 6, 2024

zimmski commented Apr 26, 2024 •

edited by bauersimon