BAPPA: Bridging Agents, Plans, and Pipelines for Automated Text-to-SQL Generation

Text-to-SQL systems provide a natural language interface that can enable even laymen to access information stored in databases. However, existing Large Language Models (LLM) struggle with SQL generation from natural instructions due to large schema sizes and complex reasoning. Prior work often focuses on complex, somewhat impractical pipelines using flagship models, while smaller, efficient models remain overlooked. In this work, we explore three multi-agent LLM pipelines, with systematic performance benchmarking across a range of small to large open-source models: (1) Multi-agent discussion pipeline, where agents iteratively critique and refine SQL queries, and a judge synthesizes the final answer; (2) Planner-Coder pipeline, where a thinking model planner generates stepwise SQL generation plans and a coder synthesizes queries; and (3) Coder-Aggregator pipeline, where multiple coders independently generate SQL queries, and a reasoning agent selects the best query.

Installation and Usage

git clone https://github.com/treeDweller98/bappa-sql.git

cd bappa-sql

python3.13 -m venv .venv
source .venv/bin/activate

pip install pandas wandb func_timeout gdown vllm==v0.8.5.post1

# Download datasets and place them in data/
...

# Add API WANDB_API key and HF_TOKEN in secrets.env
...

# Modify configurations in run_exp.sh
...

bash run_exp.sh

Citation

Please cite our work using:

@misc{ahmed2025bappabenchmarkingagentsplans,
      title={BAPPA: Benchmarking Agents, Plans, and Pipelines for Automated Text-to-SQL Generation}, 
      author={Fahim Ahmed and Md Mubtasim Ahasan and Jahir Sadik Monon and Muntasir Wahed and M Ashraful Amin and A K M Mahbubur Rahman and Amin Ahsan Ali},
      year={2025},
      eprint={2511.04153},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2511.04153}, 
}

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
core		core
sqlgen		sqlgen
LICENSE		LICENSE
README.md		README.md
evaluate_results.py		evaluate_results.py
main.py		main.py
run_exp.sh		run_exp.sh
secrets.env		secrets.env
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BAPPA: Bridging Agents, Plans, and Pipelines for Automated Text-to-SQL Generation

Installation and Usage

Citation

License

About

Uh oh!

Contributors 2

Languages

License

treeDweller98/bappa-sql

Folders and files

Latest commit

History

Repository files navigation

BAPPA: Bridging Agents, Plans, and Pipelines for Automated Text-to-SQL Generation

Installation and Usage

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Languages