Qwen-Image Fast-API

A production-ready RunPod serverless endpoint for Alibaba's Qwen-Image model - a powerful text-to-image generation model with superior text rendering capabilities in both English and Chinese.

Features

Official Qwen-Image Model - 20B MMDiT image foundation model
GPU Optimized - Runs on A100 80GB, H100 PCIe, H100 HBM3, H100 NVL, and high-end workstation GPUs
Auto-scaling - Scales to 0 when idle to save costs
Network Volume Storage - Model cached persistently across all workers
Fast Cold Starts - Optimized Docker image with pre-installed dependencies

Model Specifications

Model: Qwen/Qwen-Image (20B parameters)
Recommended VRAM: 80GB (A100/H100 recommended)
Precision: bfloat16 (CUDA) / float32 (CPU)
Default Resolution: 1024x1024
Text Rendering: Exceptional quality for both English and Chinese text
License: Apache 2.0

API Usage

Input Format

{
  "input": {
    "prompt": "Your image description here",
    "negative_prompt": " ",
    "width": 1024,
    "height": 1024,
    "num_inference_steps": 50,
    "true_cfg_scale": 4.0,
    "seed": null
  }
}

Parameters

Parameter	Type	Default	Description
`prompt`	string	required	Description of the image to generate
`negative_prompt`	string	`" "`	What to avoid in the image
`width`	integer	`1024`	Image width (pixels)
`height`	integer	`1024`	Image height (pixels)
`num_inference_steps`	integer	`50`	Number of denoising steps (higher = better quality, slower)
`true_cfg_scale`	float	`4.0`	Classifier-free guidance scale
`seed`	integer	`null`	Random seed for reproducibility (optional)

Output Format

{
  "image": "base64_encoded_png_data",
  "seed": 12345
}

Example Request (Python)

import runpod
import base64
from PIL import Image
import io

runpod.api_key = "your_api_key_here"

endpoint = runpod.Endpoint("YOUR_ENDPOINT_ID")

request = {
    "input": {
        "prompt": "A serene mountain landscape with Chinese calligraphy 'Harmony'",
        "width": 1024,
        "height": 1024,
        "num_inference_steps": 50,
        "seed": 42
    }
}

run = endpoint.run_sync(request)

# Decode and save image
img_data = base64.b64decode(run['image'])
image = Image.open(io.BytesIO(img_data))
image.save('output.png')

print(f"Generated with seed: {run['seed']}")

Example Request (cURL)

curl -X POST https://api.runpod.ai/v2/YOUR_ENDPOINT_ID/runsync \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "input": {
      "prompt": "A futuristic cityscape at sunset",
      "width": 1024,
      "height": 1024,
      "num_inference_steps": 50
    }
  }'

Deployment Configuration

The template is configured with optimal settings in runpod.toml:

GPU Types: A100 80GB PCIe, H100 PCIe, H100 HBM3, H100 NVL, RTX 6000 Blackwell, RTX 6000 Blackwell Workstation, RTX Pro 6000 Max-Q Workstation
Recommended VRAM: 80GB
Container Disk: 5GB (code + dependencies)
Network Volume: ~100GB (persistent model storage) - ⚠️ REQUIRED
Workers: 0-3 (auto-scaling)
Timeout: 600 seconds per job

⚠️ Important: Network Volume Required

You MUST attach a network volume (~100GB) when deploying this endpoint.

The Qwen-Image model is ~57GB and requires significant disk space. Without a network volume:

❌ Deployment will fail due to insufficient disk space
❌ Model cannot be downloaded or cached
❌ Workers will crash during initialization

The network volume:

✅ Stores the model persistently across all workers
✅ Prevents re-downloading the model on every cold start
✅ Enables faster scaling and startup times

Performance

Cold Start: ~60-120 seconds (model download on first run)
Warm Inference: ~20-40 seconds (depends on steps and resolution)
Memory Usage: ~50-60GB VRAM for 1024x1024 images (20B parameter model)

Tips for Best Results

Prompt Quality: Be specific and descriptive
Steps: 30-50 steps for good quality, 50-100 for best quality
CFG Scale: 3.5-5.0 works well for most prompts
Text Rendering: Qwen-Image excels at rendering text - great for logos, signs, and calligraphy
Seed: Use the same seed to reproduce images

License

This endpoint uses the Qwen-Image model licensed under Apache 2.0. For more information, visit the official Qwen-Image repository.

Support

For issues or questions about this RunPod template, please open an issue on the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.runpod		.runpod
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
handler.py		handler.py
runpod.toml		runpod.toml
runpod_startup.sh		runpod_startup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Qwen-Image Fast-API

Features

Model Specifications

API Usage

Input Format

Parameters

Output Format

Example Request (Python)

Example Request (cURL)

Deployment Configuration

⚠️ Important: Network Volume Required

Performance

Tips for Best Results

License

Support

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 2

Languages

License

arkodeepsen/qwen-image

Folders and files

Latest commit

History

Repository files navigation

Qwen-Image Fast-API

Features

Model Specifications

API Usage

Input Format

Parameters

Output Format

Example Request (Python)

Example Request (cURL)

Deployment Configuration

⚠️ Important: Network Volume Required

Performance

Tips for Best Results

License

Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 2

Languages

Packages