CorePulse

A modular toolkit for advanced diffusion model manipulation, providing unprecedented control over how Stable Diffusion processes and interprets your prompts.

Core Concepts

Prompt Injection

Inject different prompts into specific architectural blocks of the UNet during generation. This allows you to control different aspects of your image:

Content blocks (middle layers) → What appears in your image
Style blocks (output layers) → How it looks and feels
Composition blocks (input layers) → Overall layout and structure

Token-Level Attention Masking

Control which parts of your prompt have influence by masking attention to specific tokens/words. This is different from spatial masking - it works at the linguistic level rather than image regions.

Selective attention → Choose which words in your prompt get processed
Token-level control → Fine-grained control over prompt interpretation
Linguistic precision → Target specific concepts without changing prompt text

Example: In "a cat playing at a park", mask attention to "cat" tokens while preserving attention to "playing at a park"

Regional/Spatial Injection

Apply prompt injections only to specific regions of the image using spatial masks. This enables surgical control over different areas of your image.

Targeted replacement → Change specific image regions while preserving context
Spatial precision → Control exactly where changes occur
Context preservation → Background and surroundings remain untouched

Example: Apply "golden retriever dog" only to a circular region in the center, keeping the park environment identical

Attention Manipulation

Control how much the model focuses on specific words in your prompt by directly modifying attention weights. Unlike changing the prompt text, this amplifies or reduces the model's internal focus on existing words.

Amplify attention (>1.0) → Make the model pay more attention to specific words
Reduce attention (<1.0) → Decrease focus on certain words
Spatial control → Apply attention changes only to specific image regions

Example: In "a photorealistic portrait of an astronaut", boost attention on "photorealistic" to enhance realism without changing the prompt

Multi-Scale Control

Apply different prompts to different resolution levels of the UNet architecture. This approach lets you control structure and details independently:

Structure Level (lowest resolution) → Overall composition, global layout, major objects
Mid-Level (medium resolution) → Regional features, object relationships, local composition
Detail Level (highest resolution) → Fine textures, surface details, small elements

Example: Generate a castle's overall silhouette with "gothic cathedral" at structure level, while adding "intricate stone carvings" at detail level

The key insight: different resolution levels control different aspects of the final image. By targeting them separately, you achieve unprecedented control over the generation process.

Technical Features

Multi-Architecture Support: SDXL and SD1.5 with automatic detection
Block-Level Control: Target specific UNet blocks (input:0, middle:0, output:1, etc.)
Flexible Interfaces: Simple one-liners to advanced multi-block configurations
Seamless Integration: Drop-in compatibility with HuggingFace Diffusers
Context Management: Automatic patch cleanup with Python context managers
Precise Timing Control: Sigma-based injection windows for optimal effect

Important Concepts

Sigma Ranges: Timing is Everything

Prompt injections are applied during specific phases of the diffusion process using sigma values (noise levels):

High Sigma (~15-3): Early denoising steps, global structure formation
Medium Sigma (~3-0.5): Mid-process, composition and major features
Low Sigma (~0.5-0): Final steps, detail refinement

⚠️ Critical: Default sigma ranges may be too narrow! If your injections seem to have no effect, try wider ranges:

#  Too narrow - might only inject on 1-2 steps
injector.add_injection("middle:0", "dragon", sigma_start=1.0, sigma_end=0.3)

#  Better - injects across most steps  
injector.add_injection("middle:0", "dragon", sigma_start=15.0, sigma_end=0.0)

#  Always inject - bypass timing completely
injector.add_injection("middle:0", "dragon", sigma_start=1000.0, sigma_end=-1000.0)

Quick Examples

Prompt Injection: Content/Style Separation

from core_pulse import SimplePromptInjector
from diffusers import StableDiffusionXLPipeline

pipeline = StableDiffusionXLPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")

# Inject "white cat" into content blocks while keeping base prompt
with SimplePromptInjector(pipeline) as injector:
    injector.configure_injections(
        block="middle:0",  # Content block
        prompt="white cat",
        weight=2.0,        # Strong enough to be visible
        sigma_start=15.0,  # Start early in process
        sigma_end=0.0      # Continue through final steps
    )
    
    # Base prompt provides context, injection overrides content
    result = injector("a blue dog in a garden", num_inference_steps=30)
    # Result: A white cat in a garden (content replaced, context preserved)

Regional/Spatial Injection: Surgical Precision

from core_pulse import RegionalPromptInjector
from core_pulse.prompt_injection.spatial import create_center_circle_mask

# Create a spatial mask for the region you want to modify
mask = create_center_circle_mask(image_size=(1024, 1024), radius=300)

with RegionalPromptInjector(pipeline) as injector:
    injector.add_regional_injection(
        block="middle:0",
        prompt="golden retriever dog",  # Replace with this
        mask=mask,                      # Only in this region
        weight=2.5,
        sigma_start=15.0,
        sigma_end=0.0
    )
    
    # The mask ensures only the center region changes
    result = injector("a cat playing at a park", num_inference_steps=30)
    # Result: Dog in center, park environment perfectly preserved

Available Mask Shapes:

from core_pulse.prompt_injection.spatial import (
    create_rectangle_mask,     # Custom rectangular regions
    create_circle_mask,        # Circular regions  
    create_left_half_mask,     # Left/right halves
    create_top_half_mask,      # Top/bottom halves
    create_center_square_mask, # Centered shapes
    MaskFactory.from_image     # Load custom masks from images
)

Attention Manipulation: Focus Control

from core_pulse import AttentionMapInjector

# Boost attention on specific words without changing the prompt
with AttentionMapInjector(pipeline) as injector:
    injector.add_attention_manipulation(
        prompt="a photorealistic portrait of an astronaut",
        block="all",  
        target_phrase="photorealistic",
        attention_scale=5.0,  # 5x more attention on "photorealistic"
        sigma_start=15.0,     # Apply throughout generation
        sigma_end=0.0
    )
    
    # Same prompt, but model focuses much more on making it photorealistic
    result = injector(
        prompt="a photorealistic portrait of an astronaut",
        num_inference_steps=30
    )

Multi-Scale Control: Structure + Details

from core_pulse import MultiScaleInjector

# Control structure and details independently
with MultiScaleInjector(pipeline) as injector:
    # Structure: What the overall composition should be
    injector.add_structure_injection(
        "gothic cathedral silhouette, imposing architecture",
        weight=2.0
    )
    
    # Details: What the surface textures should look like  
    injector.add_detail_injection(
        "weathered stone, intricate carvings, moss-covered surfaces",
        weight=1.8
    )
    
    # Base prompt provides the scene context
    result = injector(
        prompt="a building in a misty landscape",
        num_inference_steps=30
    )
    # Result: Gothic cathedral structure with detailed stone textures

When to Use Which Technique

Technique	Use When	Example
Prompt Injection	You want to replace/add content while keeping context	Generate a cat in a dog scene
Token-Level Attention Masking	You want to selectively ignore/emphasize parts of your prompt	Mask out "cat" tokens, keep "playing at park"
Regional/Spatial Injection	You want surgical precision - change specific image regions only	Replace center region with dog, keep park untouched
Attention Manipulation	You want to emphasize existing words more strongly	Make "photorealistic" really count
Multi-Scale Control	You want different structure and details	Castle structure + stone texture details
Combined Techniques	Complex control over multiple aspects	Regional masking + multi-scale control

Installation

uv sync  # Install all dependencies
uv sync --extra examples  # Include example dependencies
uv sync --extra dev  # Include development tools

Advanced Usage

CorePulse offers multiple levels of control:

Interfaces by Complexity

SimplePromptInjector → One-liner injection for quick experiments
AdvancedPromptInjector → Multi-block, multi-prompt configurations
MultiScaleInjector → Resolution-aware structure/detail control
AttentionMapInjector → Precise attention weight control
RegionalPromptInjector → Spatial masks for region-specific control

Architecture Components

UNetPatcher → Low-level UNet modification engine
UNetBlockMapper → Automatic block detection for any model
PromptInjectionProcessor → Custom attention processors with sigma timing
Utilities → Auto-detection, validation, convenience functions

Real-World Examples

Content/Style Split (examples.py):

Generate a cat with oil painting style in a photorealistic scene

Token-Level Attention Masking (attention_masking_examples.py):

Selective prompt token control: mask "cat" tokens while preserving "park" context
Linguistic precision: control what parts of prompt get processed
Demonstrates token-level vs full prompt processing

Regional/Spatial Injection (spatial_injection_examples.py):

Surgical region replacement: change center region while preserving surroundings
Spatial masking: modify specific image areas while keeping context intact
Demonstrates spatial precision control vs full scene modification

Multi-Scale Architecture (advanced_control_examples.py):

Structure level: "medieval fortress" → Controls overall building shape
Detail level: "weathered stone textures" → Controls surface appearance
Independent control of composition vs fine details

Attention Boost (attention_examples.py):

Amplify "photorealistic" attention for enhanced realism
Reduce background element attention for focus control

Regional Control (sdxl_examples.py):

Left half: crystal castle, Right half: fire dragon
Spatial masks with soft blending

Troubleshooting

"My injections don't seem to work / images look identical"

This is usually a sigma range issue. The injection system works perfectly, but narrow sigma ranges mean injections only apply to 1-2 denoising steps out of 20-50.

Quick Fix:

# Instead of default ranges, use wide ranges for guaranteed effect
injector.add_injection(
    block="middle:0", 
    prompt="your injection", 
    weight=3.0,
    sigma_start=15.0,    # Start early
    sigma_end=0.0        # End late  
)

Debug Steps:

Use extreme weights (5.0-10.0) to test if injection works at all
Use wide sigma ranges (15.0 → 0.0) to maximize injection window
Use more inference steps (30-50) to give more chances for injection
Use dramatically different prompts ("dragon" vs "building") to see clear differences

"Injections are too weak"

Increase weight: Try 2.0-5.0 instead of 1.0
Use multiple blocks: Inject into several blocks for cumulative effect
Check semantic compatibility: Conflicting prompts can cancel each other out

"Multi-scale injections create chaotic results"

Multi-scale control is powerful but requires thoughtful prompt design:

Use semantically compatible prompts:
- Good: "gothic cathedral" (structure) + "weathered stone" (details)
- Bad: "modern building" (structure) + "organic textures" (details)
Avoid overlapping sigma ranges: Structure and detail injections should target different phases:
- Structure: Early phases (sigma 15.0→0.5) for composition
- Details: Later phases (sigma 3.0→0.0) for surface textures
Start simple: Test structure-only, then detail-only, then combine
Think hierarchically: Structure defines the "what", details define the "how it looks"

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
core_pulse		core_pulse
media		media
tests		tests
.gitignore		.gitignore
README.md		README.md
advanced_control_examples.py		advanced_control_examples.py
attention_examples.py		attention_examples.py
attention_masking_examples.py		attention_masking_examples.py
examples.py		examples.py
pyproject.toml		pyproject.toml
sdxl_examples.py		sdxl_examples.py
spatial_injection_examples.py		spatial_injection_examples.py
test_multi_scale_examples.py		test_multi_scale_examples.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CorePulse

Core Concepts

Prompt Injection

Token-Level Attention Masking

Regional/Spatial Injection

Attention Manipulation

Multi-Scale Control

Technical Features

Important Concepts

Sigma Ranges: Timing is Everything

Quick Examples

Prompt Injection: Content/Style Separation

Regional/Spatial Injection: Surgical Precision

Attention Manipulation: Focus Control

Multi-Scale Control: Structure + Details

When to Use Which Technique

Installation

Advanced Usage

Interfaces by Complexity

Architecture Components

Real-World Examples

Troubleshooting

"My injections don't seem to work / images look identical"

"Injections are too weak"

"Multi-scale injections create chaotic results"

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CorePulse

Core Concepts

Prompt Injection

Token-Level Attention Masking

Regional/Spatial Injection

Attention Manipulation

Multi-Scale Control

Technical Features

Important Concepts

Sigma Ranges: Timing is Everything

Quick Examples

Prompt Injection: Content/Style Separation

Regional/Spatial Injection: Surgical Precision

Attention Manipulation: Focus Control

Multi-Scale Control: Structure + Details

When to Use Which Technique

Installation

Advanced Usage

Interfaces by Complexity

Architecture Components

Real-World Examples

Troubleshooting

"My injections don't seem to work / images look identical"

"Injections are too weak"

"Multi-scale injections create chaotic results"

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages