-
Notifications
You must be signed in to change notification settings - Fork 0
Provider Bytedance
Mode: video · Models: 2
Vendor: BytePlus · Official API docs: OmniHuman 1.5 overview
ByteDance models on the BytePlus Vision AI platform. OmniHuman 1.5 is an audio-driven avatar model — give it a single portrait image plus an audio clip and it generates a talking/performing video (expression and motion are driven by the audio, not a text prompt). A separate ByteDance Upscaler restores and upscales an existing clip to 1080p.
| id | Name | Input type |
|---|---|---|
bytedance-omnihuman-v1.5 |
ByteDance OmniHuman | i2v |
bytedance-video-upscaler |
ByteDance Upscaler | v2v |
# audio-driven talking avatar: portrait image + audio clip
gen-ai generate -m bytedance-omnihuman-v1.5 \
-i ./portrait.jpg -a ./speech.mp3 \
-p "subtle head movement, slow camera push-in"
# upscale an existing clip to 1080p
gen-ai generate -m bytedance-video-upscaler --video ./clip.mp4{ "name": "picsart_generate",
"arguments": {
"model": "bytedance-omnihuman-v1.5",
"imageUrls": ["https://example.com/portrait.jpg"],
"audioUrl": "https://example.com/speech.mp3",
"prompt": "subtle head movement, slow camera push-in"
} }{ "name": "picsart_generate",
"arguments": {
"model": "bytedance-video-upscaler",
"videoUrl": "https://example.com/clip.mp4"
} }Full parameter surface for every model, sourced from gen-ai models info <id> --json. CLI flags show the primary short form; the canonical --kebab-case long form always works too.
Try bytedance-video-upscaler in Playground ↗
Input type: v2v
| Param | CLI flag | Type | Values |
|---|---|---|---|
videoUrl |
--video |
file | required video |
Try bytedance-omnihuman-v1.5 in Playground ↗
Input type: i2v
| Param | CLI flag | Type | Values |
|---|---|---|---|
prompt |
-p |
text | free text |
imageUrls |
-i |
file | required image (up to 1) |
audioUrl |
-a |
file | required audio |
Notes: OmniHuman 1.5 derives emotion and lip-sync from the audio, so
promptis optional and only steers camera/motion. The video upscaler takes only a source video.
gen-ai pricing bytedance-omnihuman-v1.5Cost scales with the duration of the generated video (driven by the length of the input audio clip).
Picsart CLI & MCP · Repo · AI Playground app
Getting Started
Interfaces
Concepts
Model Reference
Providers
- All providers
- Async
- ByteDance
- Creatify
- ElevenLabs
- Flux (Black Forest Labs)
- Grok (xAI)
- Happy Horse
- HeyGen
- Hunyuan
- Ideogram
- Kling
- LTX (Lightricks)
- Luma
- MiniMax
- OpenAI
- OVI
- Picsart
- Pika
- PixVerse
- Qwen (Alibaba)
- Recraft
- Reve
- Runway
- Seedance
- Seedream
- Topaz
- VEED
- Videography
- Wan (Alibaba)
More