📖 BookAgent

Orchestrating Safety-Aware Visual Narratives via Multi-Agent Cognitive Calibration

🚀 Overview

We introduce BookAgent, a safety-aware multi-agent framework for end-to-end visual storybook generation.
Unlike prior pipelines that decouple text and image generation, BookAgent jointly plans, generates, verifies, and repairs multi-modal narratives.

💡 Unlike stage-wise pipelines, BookAgent introduces a closed-loop cognitive generation paradigm for long-horizon multi-modal storytelling.

🧠 Framework

BookAgent is built upon a closed-loop multi-agent architecture with three key stages:

Value-Aligned Storyboarding (VAS)
Ensures safety and transforms raw drafts into structured story plans.
Iterative Cross-modal Refinement (ICR)
A generate–verify–revise loop that enforces text-image grounding and identity consistency.
Temporal Cognitive Calibration (TCC)
Performs global reasoning and selective repair to maintain long-horizon consistency.

🖥️ Demo Interface

We build a fully functional interactive system for storybook creation, supporting:

✏️ Story input and page control
🎨 Style customization
🔁 Iterative global refinement
🧩 Character consistency via reference sheets

🚀 Run Locally

Prerequisites

Node.js

Steps

Install dependencies:
```
npm install
```
Set your API key in .env.local:
```
GEMINI_API_KEY=your_api_key_here
```
Run the app:
```
npm run dev
```

✨ Key Features

🔁 Closed-loop generation (not one-shot)
🎭 Character identity consistency across pages
🧠 Multi-agent collaboration
🛡️ Child-safe content generation
📚 Long-horizon narrative reasoning

📊 Results

BookAgent significantly improves:

📖 Narrative coherence
🧍 Character consistency
🛡️ Safety compliance

compared to prior methods such as StoryGPT-V and MovieAgent.

🙏 Acknowledgements

We thank Google AI Studio for providing an intuitive platform for rapid prototyping and deployment of our interactive demo system.

📌 Citation

@article{gao2026bookagent,
  title={BookAgent: Orchestrating Safety-Aware Visual Narratives via Multi-Agent Cognitive Calibration},
  author={Gao, Bo and Liu, Chang and Miao, Yuyang and Ma, Siyuan and Lim, Ser-Nam},
  journal={ACL Findings},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
components		components
services		services
.gitignore		.gitignore
App.tsx		App.tsx
README.md		README.md
index.html		index.html
index.tsx		index.tsx
metadata.json		metadata.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
types.ts		types.ts
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📖 BookAgent

🚀 Overview

🧠 Framework

🖥️ Demo Interface

🚀 Run Locally

Prerequisites

Steps

✨ Key Features

📊 Results

🙏 Acknowledgements

📌 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📖 BookAgent

🚀 Overview

🧠 Framework

🖥️ Demo Interface

🚀 Run Locally

Prerequisites

Steps

✨ Key Features

📊 Results

🙏 Acknowledgements

📌 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages