Novel Generator

AI novel generation harness with information-theoretic quality evaluation.

A configurable pipeline that generates Korean web novels chapter-by-chapter, with deterministic quality scoring, parallel scene generation, and step-by-step orchestration.

Architecture

Genre ─→ stepPlots() ─→ stepSeed() ─→ stepPlan() ─→ stepChapters()
          3 plots        3-temp         plausibility    parallel scenes
          generated      evolution +    check +         + bridge stitch
                         crossover      master plan     + quality gate

Each step is managed by NovelHarness — a configurable runner that applies consistent model selection, evaluation criteria, and tracking across the entire pipeline. Users control progression between steps; the harness manages quality.

Chapter Generation Pipeline

WriterAgent        → Scene-by-scene or parallel generation
RuleGuardAgent     → Code-based rules (ending repetition, cliché, length)
ConstraintChecker  → Knowledge graph validation (dead characters, timeline paradox)
QualityLoop        → Deterministic gate → LLM critic (only for borderline cases)
PolisherAgent      → Final style pass

Deterministic Quality Scoring (10 dimensions, $0)

Dimension	Weight	Method
Information theory	21%	Shannon entropy, JSD pivot detection, arc correlation
Narrative	15%	Entity density, causal connectors, tension escalation
Character voice	12%	Speech pattern matching, formality consistency
Immersion	12%	Concreteness ratio, scene grounding, psychic distance
Rhythm	10%	Sentence length distribution, ending diversity
Hook ending	8%	Last paragraph pattern analysis
Anti-repetition	8%	Subject repetition, 4-gram frequency
Dialogue ratio	6%	30-60% optimal range
Sensory diversity	4%	5 senses coverage
Length	4%	Target character count adherence

Gate logic: Score > 0.85 → pass (skip LLM). Score 0.70~0.85 → LLM critic. Score < 0.70 → escalate to LLM with weak-dimension hints (not silently rejected — that would be an anti-pattern).

Honest limitations: These are statistical proxies, not literary judgment. A text with many emotion keywords but no coherent story can score well on entropy. A minimalist Hemingway-style chapter may score low. The scorer catches "definitely bad" structural issues — it does not measure "genuinely good" writing. LLM critic remains necessary for the 30% of chapters in the middle zone.

Information Theory Scorer

Built on academic foundations:

Shannon Entropy — emotion distribution complexity per paragraph (dynamism detection)
Jensen-Shannon Divergence — surprise/pivot detection between consecutive segments
Pearson Correlation — blueprint tension curve vs actual sentiment curve alignment
Korean Emotion Lexicon — 300+ keywords with valence/arousal, 18 categories including web-novel body language

Sources: Narrative Information Theory, Fabula Entropy Indexing, Syuzhet

Tracking Systems (7)

HierarchicalMemory — short/mid/long-term chapter memory
CharacterTracker — emotion, relationships, location, secrets per chapter
ThreadTracker — open narrative threads with deadline reminders
EventTimeline — searchable event index by character/location/type
ToneManager — arc-level tone profiles and tension curves
ProgressMonitor — pacing feedback (too fast/slow)
FeedbackAccumulator — bottom-up correction planning

Known risk: Trackers depend on LLM extraction, which can hallucinate. A single missed state change (e.g., "dropped the sword" not tracked) snowballs across subsequent chapters. Mitigations: sanity checks reject suspicious bulk changes, and users can manually correct tracker state between chapters.

Presets

Preset	Model	Pipeline	Speed
Default	gpt-5.4	Full (6 agents)	~2 min/ch
Budget	gpt-4o	Full	~1.5 min/ch
Fast	gpt-4o-mini	Writer + Guard + Constraint	~15s/ch (parallel)

Parallel mode trade-off (Fast only): Scenes are generated simultaneously via Promise.all and stitched with bridge passes. This is 2-3x faster but sacrifices inter-scene causal coherence — a detail in Scene A won't influence Scene B. Use only when speed matters more than tight scene-to-scene continuity. Default and Budget presets use sequential generation.

Usage

import { NovelHarness, getFastConfig } from './lib/harness';

const harness = new NovelHarness(getFastConfig());

// Step-by-step (UI)
for await (const event of harness.stepPlots("판타지")) { /* show plots */ }
for await (const event of harness.stepSeed("판타지", selectedPlot)) { /* show seed */ }
for await (const event of harness.stepPlan()) { /* show master plan */ }
for await (const event of harness.stepChapters(1, 10)) { /* stream chapters */ }

// Full auto (CLI)
for await (const event of harness.runFullPipeline("판타지", { endChapter: 10 })) {
  console.log(event.type);
}

Setup

cd web
npm install
cp .env.example .env.local  # Add your OpenAI API key
npm run dev

Tech Stack

Next.js (App Router)
TypeScript
OpenAI GPT-5.4 / GPT-4o (via configurable model selection)
Zod (schema validation)
Information Theory (Shannon entropy, JSD, Pearson correlation)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
docs		docs
src/novel_generator		src/novel_generator
web		web
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Novel Generator

Architecture

Chapter Generation Pipeline

Deterministic Quality Scoring (10 dimensions, $0)

Information Theory Scorer

Tracking Systems (7)

Presets

Usage

Setup

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Novel Generator

Architecture

Chapter Generation Pipeline

Deterministic Quality Scoring (10 dimensions, $0)

Information Theory Scorer

Tracking Systems (7)

Presets

Usage

Setup

Tech Stack

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages