This repository provides an implementation of ArbGraph, a framework for improving the reliability of long-form retrieval-augmented generation (RAG) via pre-generation evidence arbitration.
The code accompanies our paper and is released for research use and partial reproducibility.
ArbGraph addresses a key limitation of long-form RAG systems: handling noisy and contradictory evidence.
Instead of resolving conflicts during generation, ArbGraph performs pre-generation arbitration by:
- decomposing documents into atomic claims,
- modeling support and contradiction relations,
- estimating claim credibility via conflict-aware arbitration,
- generating outputs from a validated evidence set.
- Retrieval
- Atomic Claim Extraction (
atomization.py) - Claim Alignment (
claim_alignment.py) - Evidence Graph Construction (
evidence_graph.py) - Conflict Arbitration (
conflict_arbitration.py) - Generation (
longform_generation.py)
python run_arbgraph.pyInstall dependencies:
pip install -r requirements.txt- Default backbone: Qwen3-4B-Instruct
- Retrieval based on Wikipedia
- This is a research prototype and may require GPU for efficient execution