Comparison of Prompt Optimization Frameworks

This repository provides a benchmark of the prompt optimization Libaries DSPy, AdalFlow, and Promptolution on SST-5 and GSM8K. It accompanies the paper Promptolution: A Unified, Modular Framework for Prompt Optimization (Link: TBD).

Results

Setup

This project uses uv for fast and reproducible Python dependency management.

Prerequisites

Python ≥ 3.12
uv (curl -LsSf https://astral.sh/uv/install.sh | sh)

Installation

git clone https://github.com/finitearth/prompt-optimization-framework-comparison.git
cd prompt-optimization-framework-comparison

uv sync
source .venv/bin/activate   # Windows: .venv\Scripts\activate

Usage

In order to run the benchmarking experiments, please configure your API provider, as well as the LLM inside configs/model_config.yaml. Additionally add your token inside token.txt on the root level of this repository.

Run a benchmark for a specific task:

 uv run -m src.run_experiment --optimizer capo --task_config configs/datasets/gsm8k.yaml --token_limit 1000000

In order to run the evaluation of the resulting prompt, run:

uv run -m src.run_evaluation --model_config configs/model_config.yaml --task_config configs/datasets/sst5.yaml --system_prompt "<YOUR SYSTEM PROMPT>" --prompt_template "<YOUR PROMPT>"

Your prompt template should include a {input} placeholder in order to parse the data points to the LLM.

Citation

If you use this benchmark or the associated framework, please cite:

Promptolution: A Unified, Modular Framework for Prompt Optimization (Full citation and link: TBD)

@misc{promptolution2025,
  title        = {Promptolution: A Unified, Modular Framework for Prompt Optimization},
  author       = {Zehle, Tom and Hei{\ss}, Timo and Schlager, Moritz and A{\ss}enmacher, Matthias and Feurer, Matthias},
  year         = {2025},
  note         = {Link: TBD},
}

Contact

For questions, issues, or collaboration ideas related to this benchmark or Promptolution, feel free to reach out:

Tom Zehle – tom.zehle@tue.ellis.eu

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
figures		figures
results_eval		results_eval
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Comparison of Prompt Optimization Frameworks

Results

Setup

Prerequisites

Installation

Usage

Citation

Contact

About

Uh oh!

Releases

Packages

Languages

finitearth/prompt-optimization-framework-comparison

Folders and files

Latest commit

History

Repository files navigation

Comparison of Prompt Optimization Frameworks

Results

Setup

Prerequisites

Installation

Usage

Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages