Improve wording for queries and DB descriptions by shreyashankar · Pull Request #1 · ucbepic/DataAgentBench

shreyashankar · 2025-07-08T20:06:48Z

The queries are awesome!!

This PR addresses grammar, clarity, and ambiguity issues in query files and database descriptions across all query
datasets.

Key changes:

Made queries unambiguous with clear instructions
Fixed grammar issues and standardized terminology in database descriptions
Centralized duplicate db_description.txt files into single files per dataset
Improved hints with clearer explanations

Outstanding issues to address:

Vague hints in Yelp dataset: The hint "The datasets contain five tables total. Carefully identify which tables and columns contain the information required to answer the query" should specify which tables are needed for each query type.
MongoDB symbol definitions: The stockmarket_symboldefinition/SymbolDirectoryDefinitions.bson file should be converted to text format & included in the hint directly or provided as a proper database connection, like the other databases.
Missing ground truth validation: Though there are ground truth query scripts, we should have a file of all query answers and validation functions to check whether an agent's predicted answer is correct. For example, if the answer is a list of companies and quantities, the function should check that each company and quantity is present in the agent's string response.

Improve wording for queries and DB descriptions

…, Cohere (ucbepic#38) to leaderboard Verified Pass@1 numbers were re-computed from the raw submission JSONs using common_scaffold/validate/validate.py: PR ucbepic#31 Pi Coding Agent + Claude Opus 4.6 → 0.5603 (ucbepic#1) PR ucbepic#32 Oracle Forge (Tenacious) + Sonnet 4.6 → 0.4554 (ucbepic#4) PR ucbepic#38 Oracle Forge (Cohere) + Gemini 2.0 F. → 0.128 (ucbepic#10) Adds a Submission column on both the README table and the website leaderboard linking each submission to its PR. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Improve wording for queries and DB descriptions

9b19751

Ruiqi-Chen-0216 merged commit e5cf7c8 into ucbepic:main Jul 8, 2025

Ruiying-Ma pushed a commit that referenced this pull request Dec 5, 2025

Merge pull request #1 from shreyashankar/main

a71ac4e

Improve wording for queries and DB descriptions

anushruthasura mentioned this pull request Mar 10, 2026

Enable inline validation after agent runs #14

Closed

4 tasks

This was referenced Apr 22, 2026

Add PR #31, #32, #38 to leaderboard #39

Merged

[Leaderboard] Pi Coding Agent — Claude Opus 4.6 — 62.6% Pass@1 #31

Closed

Fix team links on leaderboard #40

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve wording for queries and DB descriptions#1

Improve wording for queries and DB descriptions#1
Ruiqi-Chen-0216 merged 1 commit intoucbepic:mainfrom
shreyashankar:main

shreyashankar commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shreyashankar commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants