framersai
diff --git a/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 3 additions & 1 deletion b/‎README.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎docs/features/CHARACTER_CONSISTENCY.md‎
Lines changed: 128 additions & 0 deletions b/‎docs/features/CHARACTER_CONSISTENCY.md‎
Lines changed: 128 additions & 0 deletions
diff --git a/‎docs/features/STYLE_TRANSFER.md‎
Lines changed: 101 additions & 0 deletions b/‎docs/features/STYLE_TRANSFER.md‎
Lines changed: 101 additions & 0 deletions
diff --git a/‎docs/getting-started/HIGH_LEVEL_API.md‎
Lines changed: 3 additions & 2 deletions b/‎docs/getting-started/HIGH_LEVEL_API.md‎
Lines changed: 3 additions & 2 deletions
@@ -1,3 +1,26 @@
+## [Unreleased]
+
+### Added
+- `transferStyle()` high-level API for image-guided style transfer via Flux Redux
+- Character consistency fields on `ImageGenerationRequest`: `referenceImageUrl`, `faceEmbedding`, `consistencyMode`
+- Replicate: dual-endpoint support (modern `/models/.../predictions` + legacy `/predictions`)
+- Replicate: 10 new models in catalog (Flux 1.1 Pro, Ultra, Redux, Canny, Depth, Fill Pro, Pulid, SDXL Lightning, SDXL, Real-ESRGAN)
+- Replicate: character consistency via Pulid auto-selection when `consistencyMode: 'strict'`
+- Replicate: ControlNet image input (`controlImage`, `controlType`) for Flux Canny/Depth
+- Fal: `editImage()` support (img2img + inpainting)
+- Fal: 4 new models in catalog (Pro 1.1, Ultra, LoRA, Realism)
+- Fal: IP-Adapter character consistency mapping
+- SD-Local: IP-Adapter character consistency via ControlNet injection
+- `PolicyAwareImageRouter`: `'character-consistency'` capability filtering
+- `AvatarPipeline`: per-stage consistency mode (`strict` for expressions, `balanced` for body)
+- `docs/features/CHARACTER_CONSISTENCY.md`
+- `docs/features/STYLE_TRANSFER.md`
+- 59 new tests across providers, APIs, and integration scenarios
+- OpenAI, Stability, OpenRouter, BFL: graceful debug warning when `referenceImageUrl` is set but unsupported
+
+### Changed
+- Replicate: default inpaint model upgraded from `flux-fill` to `flux-fill-pro`
+
 ## <small>0.1.177 (2026-04-04)</small>
 
 * fix(api): include systemBlocks on exported AgentOptions interface ([d79ddab](https://github.com/framersai/agentos/commit/d79ddab))
 
@@ -795,10 +795,11 @@ const resilient = agent({
 | `generateObject(opts)` | Zod-validated structured output extraction |
 | `streamObject(opts)` | Streaming structured output |
 | `embedText(opts)` | Text embedding generation (single or batch) |
-| `generateImage(opts)` | Image generation (OpenAI, Stability, Replicate, BFL, Fal) |
+| `generateImage(opts)` | Image generation with character consistency (7 providers) |
 | `editImage(opts)` | Image editing/inpainting |
 | `upscaleImage(opts)` | Image upscaling |
 | `variateImage(opts)` | Image variations |
+| `transferStyle(opts)` | Style transfer via Flux Redux / img2img |
 | `generateVideo(opts)` | Video generation |
 | `analyzeVideo(opts)` | Video analysis and understanding |
 | `detectScenes(opts)` | Scene detection in video |
@@ -829,6 +830,7 @@ import type {
   AgencyOptions,          // agency() configuration
   GenerateTextOptions,    // generateText() / streamText() options
   GenerateImageOptions,   // generateImage() options
+  TransferStyleOptions,   // transferStyle() options
   GenerateObjectOptions,  // generateObject() options
   EmbedTextOptions,       // embedText() options
   ExtensionDescriptor,    // Extension pack descriptor
 
@@ -0,0 +1,128 @@
+# Character Consistency — Face-Preserving Image Generation
+
+> Generate images that maintain a consistent character identity across multiple outputs using reference images and face embeddings.
+
+---
+
+## Overview
+
+Character consistency lets you anchor generated images to a reference face or character, ensuring the same person appears across portraits, expressions, full-body shots, and scene illustrations. AgentOS supports three levels of consistency via the `consistencyMode` parameter:
+
+| Mode | Strength | Use Case |
+|------|----------|----------|
+| `'strict'` | 0.85–0.9 | Avatar expression sheets, emotion variants. Face must match exactly. |
+| `'balanced'` | 0.6 | Full-body shots, different angles. Recognizable but allows natural variation. |
+| `'loose'` | 0.3 | "Inspired by" generations. Style/mood carries over, face may drift. |
+
+## Provider Support
+
+| Provider | Mechanism | Models |
+|----------|-----------|--------|
+| **Replicate** | Pulid (strict), Flux image input (balanced/loose) | `zsxkib/pulid`, `black-forest-labs/flux-dev` |
+| **Fal** | IP-Adapter | `fal-ai/flux/dev` |
+| **SD-Local** | ControlNet + IP-Adapter extension | Any SD 1.5 / SDXL checkpoint |
+| OpenAI | Not supported (graceful ignore) | — |
+| Stability | Not supported (graceful ignore) | — |
+
+## Basic Usage
+
+```typescript
+import { generateImage } from '@framers/agentos';
+
+// Generate a consistent expression variant
+const result = await generateImage({
+  provider: 'replicate',
+  prompt: 'Portrait of the character smiling warmly, soft lighting',
+  referenceImageUrl: 'https://storage.example.com/character-neutral.png',
+  consistencyMode: 'strict',
+});
+```
+
+When `consistencyMode` is `'strict'` and no model is explicitly set, Replicate auto-selects `zsxkib/pulid` for maximum face consistency.
+
+## Fields Reference
+
+### `referenceImageUrl`
+
+URL or base64 data URI of the reference character image. Each provider maps this to its native mechanism:
+
+- **Replicate (Pulid):** `main_face_image` input
+- **Replicate (standard Flux):** `image` input with `image_strength`
+- **Fal:** `ip_adapter_image` body field
+- **SD-Local:** ControlNet `input_image` with IP-Adapter preprocessor
+
+### `faceEmbedding`
+
+Optional 512-dimensional vector from InsightFace or equivalent. Used by the `AvatarPipeline` for drift detection — after generating each image, the pipeline extracts the face embedding from the output and compares it to this anchor via cosine similarity. Images that drift below the threshold (default 0.6) are regenerated.
+
+### `consistencyMode`
+
+Controls how aggressively the provider preserves the reference identity:
+
+```typescript
+// Strict — for expression sheets where faces must match
+await generateImage({
+  prompt: 'Character looking angry, dramatic lighting',
+  referenceImageUrl: neutralPortrait,
+  consistencyMode: 'strict',  // Pulid auto-selected on Replicate
+});
+
+// Balanced — for full-body shots
+await generateImage({
+  prompt: 'Full body shot of the character walking through a market',
+  referenceImageUrl: neutralPortrait,
+  consistencyMode: 'balanced',
+});
+
+// Loose — for "inspired by" mood pieces
+await generateImage({
+  prompt: 'Abstract portrait in the style of the character',
+  referenceImageUrl: neutralPortrait,
+  consistencyMode: 'loose',
+});
+```
+
+## AvatarPipeline Integration
+
+The `AvatarPipeline` uses consistency modes per stage:
+
+| Stage | Mode | Rationale |
+|-------|------|-----------|
+| `neutral_portrait` | none | This IS the anchor — no reference exists yet |
+| `face_embedding` | none | Extraction, not generation |
+| `expression_sheet` | `'strict'` | Facial identity must match across all emotions |
+| `animated_emotes` | `'strict'` | Same character in motion |
+| `full_body` | `'balanced'` | Body proportions can vary; face should be recognizable |
+| `additional_angles` | `'balanced'` | 3/4 and profile views naturally differ from frontal |
+
+```typescript
+import { AvatarPipeline } from '@framers/agentos/media/avatar';
+
+const pipeline = new AvatarPipeline(faceService, imageGenerator);
+const result = await pipeline.generate({
+  characterId: 'hero_001',
+  identity: {
+    displayName: 'Kael Stormwind',
+    ageBand: 'young_adult',
+    faceDescriptor: 'sharp jawline, green eyes, short dark hair, small scar above left eyebrow',
+  },
+  generationConfig: {
+    baseModel: 'black-forest-labs/flux-dev',
+    provider: 'replicate',
+  },
+  stages: ['neutral_portrait', 'face_embedding', 'expression_sheet', 'full_body'],
+});
+```
+
+## Choosing the Right Mode
+
+- **Avatars and expression sheets:** Always `'strict'`. The face is the product.
+- **Scene illustrations with known characters:** `'balanced'`. Character should be recognizable but the scene composition matters more.
+- **Style exploration and mood boards:** `'loose'`. The reference influences the vibe, not the pixels.
+- **No reference at all:** Omit `referenceImageUrl` entirely. The fields are fully optional.
+
+## Related
+
+- [Image Generation](./IMAGE_GENERATION.md) — Provider-agnostic generation API
+- [Style Transfer](./STYLE_TRANSFER.md) — Transfer visual aesthetics between images
+- [Image Editing](./IMAGE_EDITING.md) — Img2img, inpainting, upscaling
@@ -0,0 +1,101 @@
+# Style Transfer — Image-Guided Aesthetic Translation
+
+> Apply the visual style of one image to another using `transferStyle()`, backed by Flux Redux and cross-provider img2img.
+
+---
+
+## Overview
+
+`transferStyle()` takes a source image and a style reference image, then produces an output that combines the content of the source with the visual aesthetic of the reference. This is useful for:
+
+- Converting photographs to specific art styles (oil painting, anime, pixel art)
+- Applying a brand's visual identity to generated content
+- Creating consistent visual themes across a set of images
+
+## `transferStyle()` API
+
+```typescript
+import { transferStyle } from '@framers/agentos';
+
+const result = await transferStyle({
+  image: './photo.jpg',
+  styleReference: './monet-waterlilies.jpg',
+  prompt: 'Impressionist oil painting, visible brushstrokes, warm golden light',
+  strength: 0.7,
+});
+
+console.log(result.images[0].url);
+console.log(result.provider);  // 'replicate'
+console.log(result.model);     // 'black-forest-labs/flux-redux-dev'
+```
+
+## Parameters
+
+| Parameter | Type | Default | Description |
+|-----------|------|---------|-------------|
+| `image` | `string \| Buffer` | **required** | Source image (file path, URL, data URI, or Buffer) |
+| `styleReference` | `string \| Buffer` | **required** | Reference image whose style to apply |
+| `prompt` | `string` | **required** | Text guiding the transfer direction |
+| `strength` | `number` | `0.7` | How much reference style to apply (0 = unchanged, 1 = full transfer) |
+| `provider` | `string` | auto-detect | Override provider selection |
+| `model` | `string` | provider default | Override model selection |
+| `size` | `string` | — | Output dimensions (e.g. `'1024x1024'`) |
+| `negativePrompt` | `string` | — | Content to avoid |
+| `seed` | `number` | — | Reproducibility seed |
+| `policyTier` | `string` | — | Content policy tier for provider routing |
+
+## Provider Routing
+
+When no provider is specified, `transferStyle()` auto-detects the best available provider from environment variables:
+
+| Priority | Provider | Model | How It Works |
+|----------|----------|-------|-------------|
+| 1 | Replicate | Flux Redux Dev | Purpose-built for image-guided generation. Style reference as primary input. |
+| 2 | Fal | Flux Dev | img2img with style description in prompt |
+| 3 | Stability | stable-image-core | img2img with strength parameter |
+| 4 | OpenAI | gpt-image-1 | editImage with descriptive prompt |
+
+Replicate with Flux Redux produces the best results for style transfer because the model was trained specifically for image-conditioned generation.
+
+## Strength Guide
+
+| Range | Effect | Use Case |
+|-------|--------|----------|
+| 0.1–0.3 | Subtle color grading, minor texture shifts | Brand color overlays |
+| 0.4–0.6 | Moderate style influence, composition preserved | "In the style of" variations |
+| 0.7–0.8 | Strong style transfer, content recognizable | Art style conversion |
+| 0.9–1.0 | Near-complete adoption of reference aesthetic | Full aesthetic transformation |
+
+## Examples
+
+```typescript
+// Photograph → anime style
+const anime = await transferStyle({
+  image: './portrait-photo.jpg',
+  styleReference: './ghibli-frame.png',
+  prompt: 'Studio Ghibli anime style, cel shading, vibrant colors',
+  strength: 0.75,
+});
+
+// Photograph → pixel art
+const pixel = await transferStyle({
+  image: './landscape.jpg',
+  styleReference: './pixel-art-reference.png',
+  prompt: '16-bit pixel art, limited palette, retro game aesthetic',
+  strength: 0.8,
+});
+
+// Apply brand visual identity
+const branded = await transferStyle({
+  image: './product-photo.jpg',
+  styleReference: './brand-style-guide.png',
+  prompt: 'Clean, modern, brand-consistent visual treatment',
+  strength: 0.5,
+});
+```
+
+## Related
+
+- [Image Generation](./IMAGE_GENERATION.md) — Text-to-image generation
+- [Image Editing](./IMAGE_EDITING.md) — Img2img, inpainting, upscaling
+- [Character Consistency](./CHARACTER_CONSISTENCY.md) — Face-preserving generation
@@ -6,7 +6,7 @@ Everything is one import. Pick the function that fits your task:
 import {
   generateText, streamText,        // Text generation
   generateObject, streamObject,    // Structured output (Zod validated)
-  generateImage,                   // Image generation
+  generateImage, transferStyle,    // Image generation & style transfer
   generateVideo, analyzeVideo,     // Video generation & analysis
   generateMusic, generateSFX,      // Audio generation
   performOCR,                      // Vision / OCR
@@ -23,7 +23,8 @@ import {
 | `generateText()` | One-shot text generation | `await generateText({ provider: 'openai', prompt: '...' })` |
 | `streamText()` | Stream text in real-time | `for await (const d of streamText({...}).textStream) {}` |
 | `generateObject()` | Extract structured JSON (Zod) | `await generateObject({ schema: z.object({...}), prompt: '...' })` |
-| `generateImage()` | Generate images | `await generateImage({ provider: 'openai', prompt: '...' })` |
+| `generateImage()` | Generate images (with character consistency) | `await generateImage({ provider: 'openai', prompt: '...' })` |
+| `transferStyle()` | Style transfer between images | `await transferStyle({ image: src, styleReference: ref, prompt: '...' })` |
 | `generateVideo()` | Generate video from text/image | `await generateVideo({ prompt: '...' })` |
 | `generateMusic()` | Generate music | `await generateMusic({ prompt: '...' })` |
 | `performOCR()` | Extract text from images | `await performOCR({ imagePath: './doc.png' })` |