ABBA Eval

This repo showcases a simple (and silly?) LLM eval:

Can the LLM generate a four line poem that follows the ABBA rhyme scheme?

Results

Background

The ABBA rhyme scheme is a four-line poem where the first and last lines rhyme, and the second and third lines rhyme. For example:

In the realm of code and collaboration, (A)
Where developers unite and innovation thrives, (B)
GitHub stands tall, a platform that survives, (B)
Fostering creativity and inspiration. (A)

The Challenge

Can you use LLMs to generate ABBA poems? If so, how? Out of the box today only GPT-4o does an excellent job out of the box. But there are prompting tricks you can apply to make smaller models generate ABBA poems as well.

Repo

Notebooks:

Lets you generate the poems in various ways
Lets you label the results
Lets you compare the results

Package only contains the labeling code from PigeonXT, had to copy it in to make a minor change to the printing of the results.

Contributing

If you'd like to contribute, for example by adding more:

Models
Evaluation Criteria
Prompting Strategies

Feel free to create an issue so we can discuss it!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
notebooks		notebooks
output		output
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

notebooks

notebooks

output

output

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

Repository files navigation

ABBA Eval

Results

Background

The Challenge

Repo

Contributing

About

Releases

Packages

Languages

License

RensDimmendaal/abba_eval

Folders and files

Latest commit

History

Repository files navigation

ABBA Eval

Results

Background

The Challenge

Repo

Contributing

About

Resources

License

Stars

Watchers

Forks

Languages