SDXL XY Plot Generator

A flexible tool for generating XY comparison grids with Stable Diffusion XL models. Create visual comparisons of different parameters like checkpoints, LoRAs, prompts, and generation settings in a grid format.

Features

Flexible Axis Configuration: Plot any two parameters against each other
Multiple Model Support: Compare checkpoints, LoRAs, or both
Batch Processing: Generate entire grids efficiently with memory optimization
Kohya Format Support: Load prompts from Kohya-formatted text files
CLI Overrides: Modify any configuration value from the command line
Memory Optimization: Automatic VRAM management for GPUs with limited memory

Installation

Install required dependencies:

pip install -r requirements.txt

Ensure you have CUDA-capable GPU (or CPU mode will be used automatically)

Quick Start

Create a config.toml file (see Configuration section)
Run the generator:

python main.py --config config.toml

Configuration

The configuration uses TOML format with the following structure:

Basic Structure

[xy_plot]
x_axis = "prompt"      # What varies horizontally
y_axis = "checkpoint"  # What varies vertically

[checkpoint]
# Single file = constant, used as base for all generations
path = "/path/to/model.safetensors"
# OR directory = multiple files for axis variation
path = "/path/to/models/"
# OR explicit list
values = ["/path/to/model1.safetensors", "/path/to/model2.safetensors"]

[prompt]
# From file (supports Kohya format)
values_file = "prompts.txt"
# OR inline list
values = ["prompt 1", "prompt 2", "prompt 3"]

[generation]
negative_prompt = "low quality, blurry"
seed = 42
resolution = 1024
steps = 30
cfg_scale = 7.0

Multi-Parameter Rules

Key Concept: Any parameter with multiple values can be used as an axis. Parameters with single values are constants applied to all generations.

Example 1: Checkpoints vs Prompts

[xy_plot]
x_axis = "prompt"
y_axis = "checkpoint"

[checkpoint]
# Multiple values = can be used as axis
values = [
    "/models/anime.safetensors",
    "/models/realistic.safetensors",
    "/models/artistic.safetensors"
]

[prompt]
values = ["portrait", "landscape", "abstract art"]

# Result: 3x3 grid (3 checkpoints × 3 prompts = 9 images)

Example 2: LoRA Weights vs Prompts

[xy_plot]
x_axis = "prompt"
y_axis = "lora1_weights"

[checkpoint]
# Single value = base model (constant)
path = "/models/base_sdxl.safetensors"

[lora1_weights]
path = "/loras/style.safetensors"
weights = [0.5, 0.75, 1.0, 1.25]  # Multiple weights

[prompt]
values = ["anime style", "realistic", "oil painting"]

# Result: 3x4 grid (3 prompts × 4 weights = 12 images)

Example 3: Two LoRAs with Different Weights

[xy_plot]
x_axis = "lora1_weights"
y_axis = "lora2_weights"

[checkpoint]
path = "/models/base.safetensors"  # Base model

[lora1_weights]
path = "/loras/style.safetensors"
weights = [0.5, 1.0, 1.5]

[lora2_weights]
path = "/loras/character.safetensors"
weights = [0.5, 1.0, 1.5]

[generation]
prompt = "portrait, masterpiece"  # Fixed prompt

# Result: 3x3 grid testing LoRA combinations

Parameter Types

Checkpoints

[checkpoint]
# Single file (constant)
path = "/path/to/model.safetensors"

# Directory (finds all .safetensors files)
path = "/path/to/models/"

# Explicit list
values = ["/path/to/model1.safetensors", "/path/to/model2.safetensors"]

LoRAs

[lora1]  # or [lora2], [lora3], etc.
# Single LoRA
path = "/path/to/lora.safetensors"
weight = 1.0

# Directory of LoRAs
path = "/path/to/loras/"
weight = 0.8  # Applied to all

# Multiple weights for same LoRA
[lora1_weights]
path = "/path/to/lora.safetensors"
weights = [0.5, 0.75, 1.0, 1.25]

Prompts

[prompt]
# From file (supports Kohya format with --n, --steps, etc.)
values_file = "prompts.txt"

# Inline list
values = [
    "beautiful landscape",
    "portrait photography",
    "abstract art"
]

Generation Parameters

[resolution]
values = [512, 768, 1024]

[steps]
values = [20, 30, 40, 50]

[cfg_scale]
values = [5.0, 7.0, 9.0, 11.0]

[seed]
values = [42, 123, 456, 789]

Valid Axis Combinations

You can plot any two variable parameters:

checkpoint vs prompt
checkpoint vs lora1
lora1 vs lora2
prompt vs cfg_scale
steps vs resolution
lora1_weights vs prompt
etc.

Important: Maximum 2 parameters can have multiple values (one for each axis).

Prompt Files

Create a prompts.txt file with one prompt per line:

# Comments are ignored
a beautiful anime girl, masterpiece --n low quality --steps 30 --cfg 7.5
landscape photography, mountains --negative ugly --cfg-scale 8.0
cyberpunk city at night --w 1024 --h 1024

# Kohya-style flags are automatically removed:
# --n, --negative: negative prompt
# --s, --steps: sampling steps
# --cfg, --cfg-scale: CFG scale
# --seed: random seed
# --w, --width, --h, --height: dimensions

Command Line Usage

Basic Usage

python main.py --config config.toml

Override Configuration

# Change single values
python main.py --set checkpoint.path=/path/to/model.safetensors

# Change axis assignment
python main.py --set xy_plot.x_axis=checkpoint --set xy_plot.y_axis=prompt

# Override generation parameters
python main.py --set generation.steps=50 --set generation.cfg_scale=8.5

# Use different prompt file
python main.py --set prompt.values_file=other_prompts.txt

# Override with lists
python main.py --set "checkpoint.values=[/model1.safetensors,/model2.safetensors]"

Utility Commands

# Preview configuration without generating
python main.py --dry-run

# Save modified configuration
python main.py --set generation.steps=50 --save-config my_config.toml

# Multiple overrides
python main.py \
  --set checkpoint.path=/models/new_model.safetensors \
  --set generation.steps=40 \
  --set output.output_dir=./results

Output

The generator creates:

XY Grid Image: Full resolution grid with all combinations
Preview Image: Smaller preview (25% by default)
Individual Images: Each generated image (optional)
Metadata JSON: Complete configuration and parameters used

Output structure:

output/
├── xy_plot_20240101_120000.png        # Main grid
├── xy_plot_20240101_120000_preview.png # Preview
├── xy_plot_20240101_120000_metadata.json
└── individual_20240101_120000/        # Individual images (optional)
    ├── y00_x00_model1_prompt1.png
    ├── y00_x01_model1_prompt2.png
    └── ...

Memory Optimization

For GPUs with limited VRAM:

[memory_optimization]
enable_cpu_offload = true      # Move models between CPU/GPU
enable_attention_slicing = true # Reduce attention memory usage
enable_vae_tiling = true       # Decode images in tiles

[generation]
resolution = 768  # Reduce if still running out of memory

Tips

Start Small: Test with 2-3 values per axis before large grids
Use Preview: Check the preview image before opening large grids
Save Configurations: Use --save-config to save successful setups
Monitor VRAM: The tool shows GPU memory and auto-enables optimizations
Batch Similar Models: Group similar checkpoints/LoRAs for better comparison

Troubleshooting

Out of Memory:

Enable memory optimizations in config
Reduce resolution
Use fewer steps
Process fewer models at once

LoRA Not Loading:

Ensure base checkpoint is specified
Check file paths are correct
Verify LoRA is compatible with the base model

Slow Generation:

Disable CPU offload if you have enough VRAM
Reduce resolution or steps
Use fewer complex LoRAs

Examples

See the examples/ directory for complete configuration examples for common use cases.

Override format:

Simple: generation.steps=50
Lists: prompt.values=[val1,val2,val3]
Strings: checkpoint.path=/path/to/file
Booleans: memory_optimization.enable_cpu_offload=false

Processing Logic

Parameter Resolution

Load base configuration from TOML
Apply CLI overrides
Identify variable parameters (those with multiple values)
Validate exactly 2 parameters are variable
Assign parameters to X and Y axes

Generation Flow

Initialize Generator
- Load required libraries (diffusers, transformers)
- Detect GPU capabilities
- Apply memory optimizations
Parameter Cartesian Product
- Create all combinations of X and Y values
- Each combination inherits constant parameters
- Total images = len(X values) × len(Y values)
Model Management
- Checkpoints: Load when changed, keep in memory if unchanged
- LoRAs: Apply on top of current checkpoint
- Multiple LoRAs: Can stack (lora1 + lora2)
- Memory clearing between major model changes
Image Generation For each cell (x, y):
- Load/apply required models
- Generate image with combined parameters
- Handle failures with placeholder images
- Save individual images if requested
Grid Assembly
- Create canvas with margins for labels
- Draw wrapped text labels for both axes
- Paste generated images in grid positions
- Add grid lines and axis titles

Output Specifications

Files Created

Main Grid Image (xy_plot_TIMESTAMP.png)
- Full resolution grid
- Labeled axes with text wrapping
- Grid lines for clarity
- All images at specified resolution
Preview Image (xy_plot_TIMESTAMP_preview.png)
- Scaled down version (default 25%)
- Same layout as main grid
- Quick viewing for large grids

Metadata File (xy_plot_TIMESTAMP_metadata.json)

{
  "timestamp": "YYYYMMDD_HHMMSS",
  "x_axis": "parameter_name",
  "y_axis": "parameter_name",
  "x_values": ["value1", "value2"],
  "y_values": ["value1", "value2"],
  "config": {full configuration object}
}

Individual Images (optional, in subdirectory)
- Named: yYY_xXX_ylabel_xlabel.png
- Each with accompanying metadata JSON

Grid Layout

        [Title X]
[Title] [Label1] [Label2] [Label3]
[Y1]    [Image]  [Image]  [Image]
[Y2]    [Image]  [Image]  [Image]
[Y3]    [Image]  [Image]  [Image]

Model Handling

Checkpoint Behavior

Single checkpoint: Base model for all generations
Multiple checkpoints: Can be axis variable
Checkpoint switching: Full model reload required
Memory: Previous checkpoint unloaded before loading new

LoRA Behavior

Requires base checkpoint
Can apply multiple LoRAs simultaneously
Weight parameter controls strength
Can vary: LoRA selection, LoRA weights, or both
Efficient: Only LoRA weights change, base model stays loaded

Valid Combinations

Checkpoint only (no LoRA)
Checkpoint + Single LoRA
Checkpoint + Multiple LoRAs
Single Checkpoint + LoRA variations
Multiple Checkpoints + Same LoRA
Multiple Checkpoints + Multiple LoRAs

Memory Management

Automatic Optimizations

Detect GPU VRAM on startup
Enable CPU offload for <12GB VRAM
Enable attention slicing for <16GB VRAM
Enable VAE tiling for <12GB VRAM

Manual Controls

Configure optimizations in config file
Reduce resolution for lower VRAM usage
Adjust batch processing
Force garbage collection between models

Recovery Mechanisms

Catch CUDA OOM errors
Retry with reduced resolution
Create placeholder for failed generations
Continue with remaining images

Extension Points

Adding New Parameters

Any parameter that affects generation can be made variable:

Add parameter to configuration parsing
Define how it applies to generation
Handle single vs multiple values
Add to valid axis parameters

Adding New Model Types

Define loading mechanism
Specify combination rules with existing models
Handle memory management
Update parameter parsing

Custom Schedulers

Add to SCHEDULERS dictionary
Map configuration names to scheduler classes
No other changes needed

Output Formats

Image saving uses PIL
Can add different formats
Metadata is JSON (could add CSV, etc.)

Error Handling

Validation Errors

Missing configuration sections
Invalid axis parameters
Too many variable parameters
File not found errors

Generation Errors

Model loading failures
CUDA out of memory
Invalid parameter combinations
Corrupted model files

Recovery Strategy

Log error with context
Create placeholder if possible
Continue with remaining images
Report summary at end

Performance Considerations

Optimization Strategies

Minimize model reloading
Keep base model in memory when varying LoRAs
Process in optimal order (group by model)
Clear memory proactively

Bottlenecks

Model loading (disk I/O)
VRAM limitations
CPU-GPU transfer (when offloading)
Image encoding/decoding

Scaling Limits

Grid size limited by memory (both RGB and VRAM)
Practical limit ~10x10 grid at 1024px
Can generate larger grids at lower resolution
Individual image saving allows unlimited grid size

Future Enhancements

Potential Features

3D Plots: Third parameter via multiple grids
Animation: Create GIFs from parameter sweeps
Parallel Generation: Multiple GPUs support
Smart Ordering: Optimize generation order
Partial Grids: Resume interrupted generations
Parameter Extraction: Use prompts file parameters
Automatic Optimal Settings: Based on hardware
Web Interface: Browser-based configuration
Batch Jobs: Queue multiple grids
Comparison Metrics: Auto-calculate similarity scores

Architecture Improvements

Plugin System: Modular parameter handlers
Pipeline Abstraction: Support other models (SD1.5, etc.)
Configuration Inheritance: Base configs with overrides
Result Database: Track all generations
Distributed Processing: Network-based generation

This specification provides complete information for reimplementation or enhancement while remaining implementation-agnostic.


These documents provide comprehensive user instructions and technical specifications for the SDXL XY Plot Generator, making it easy for users to understand the system and for developers to extend or reimplement it.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
examples		examples
.gitignore		.gitignore
README.md		README.md
config.toml		config.toml
context.md		context.md
install.sh		install.sh
main.py		main.py
prompts.txt		prompts.txt
requirements.txt		requirements.txt
run.sh		run.sh
sdxl_generator.py		sdxl_generator.py
xy_plot_generator.py		xy_plot_generator.py

nomad-engineer/sd-xyplotter

Folders and files

Latest commit

History

Repository files navigation