GitHub - TuojingAI/CounterScene: Bridging generative world models and causal reasoning for realistic yet adversarial safety-critical scenario generation.

CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation

Bowen Jing, Ruiyang Hao, Weitao Zhou, Haibao Yu

Paper | Overview | Main Results | Citation

Overview

CounterScene studies safety-critical scenario generation through counterfactual causal reasoning in generative world models.

Instead of applying arbitrary perturbations, CounterScene identifies the agent whose behavior is causally maintaining safety and performs a minimal intervention that transforms a safe scene into a realistic safety-critical interaction while preserving coherent multi-agent dynamics.

Counterfactual causal reasoning for safety-critical generative BEV world model. Given an observed traffic scene where the adversarial agent waits and the ego vehicle passes safely, CounterScene asks: what if the adversarial agent did not wait? By intervening on the agent trajectory, it constructs a counterfactual world that induces safety-critical interactions while preserving realistic traffic dynamics.

Key Components

Causal adversarial agent selection identifies the agent whose current behavior suppresses the greatest latent risk.
Causal Interaction Graph (CIG) models conflict-aware dependencies among agents.
Counterfactual guidance perturbs only the selected agent trajectory during diffusion.
Closed-loop rollout allows all other agents to react naturally, preserving realistic multi-agent dynamics.

Overview of CounterScene. The framework consists of four modules: (1) causal adversarial agent selection, (2) a Causal Interaction Graph (CIG), (3) a diffusion-based interactive BEV world model with a SceneTransformer denoiser, and (4) counterfactual guidance that perturbs only the adversarial agent trajectory during denoising.

Notes: We will release the full codebase, trained checkpoints, and evaluation scripts upon acceptance of the paper.

Main Results

CounterScene achieves stronger realism-effectiveness trade-offs than prior baselines on nuScenes, with lower trajectory error and substantially higher collision rates, especially at longer horizons.

Horizon	Method	ADE	FDE	ORR	HBR	CR
Short (1-4s)	CCDiff	0.380	0.908	0.4%	1.5%	1.3%
Short (1-4s)	CounterScene	0.288	0.703	0.5%	1.6%	3.3%
Mid (5-7s)	CCDiff	1.128	2.914	1.3%	1.5%	8.7%
Mid (5-7s)	CounterScene	0.982	2.639	1.3%	2.0%	14.7%
Long (8-10s)	CCDiff	2.092	5.898	2.3%	1.5%	12.3%
Long (8-10s)	CounterScene	1.877	5.141	1.9%	1.8%	22.7%

Metric note. ADE/FDE measure trajectory realism, while ORR, HBR, and CR evaluate off-road rate, hard-brake rate, and collision rate.

For additional qualitative comparisons, ablations, zero-shot transfer, and full per-horizon tables, please refer to the paper.

Qualitative Results

Scene 103 — Merging Cut-in

In the factual scene, the side-road vehicle yields and the ego passes safely. CounterScene identifies that yielding vehicle as the causal variable and compresses its spatiotemporal margin, producing an aggressive cut-in.

Scene 905 — Rear-End Collision

The trailing vehicle maintains a safe following margin in the factual scene. CounterScene compresses the margin, causing a realistic rear-end crash without distorting the rest of the scene.

Scene 913 — Head-On Collision

CounterScene applies a minimal spatial intervention that redirects the causally critical vehicle into the ego path, culminating in a realistic head-on collision.

Citation

If you find this work useful, please cite:

@article{jing2026counterscene,
  title={CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation},
  author={Jing, Bowen and Hao, Ruiyang and Zhou, Weitao and Yu, Haibao},
  journal={arXiv preprint arXiv:2603.21104},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation

Overview

Key Components

Main Results

Qualitative Results

Scene 103 — Merging Cut-in

Scene 905 — Rear-End Collision

Scene 913 — Head-On Collision

Citation

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation

Overview

Key Components

Main Results

Qualitative Results

Scene 103 — Merging Cut-in

Scene 905 — Rear-End Collision

Scene 913 — Head-On Collision

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!