-
Notifications
You must be signed in to change notification settings - Fork 0
Provider Google
h-avoyan edited this page Jun 18, 2026
·
3 revisions
Modes: image · video · audio · Models: 13
Vendor: Google AI for Developers · Vertex AI · Official API docs: Image · Video (Veo) · Music (Lyria)
Google contributes across all three modes: the Veo video family, the Nano Banana (Gemini Image) family, Gemini TTS, and Lyria music.
| id | Name | Mode | Input |
|---|---|---|---|
veo-3.1 |
Veo 3.1 | video | t2v |
veo-3.1-fast |
Veo 3.1 Fast | video | t2v |
veo-3.1-lite |
Veo 3.1 Lite | video | t2v |
gemini-3-pro-image |
Nano Banana Pro | image | t2i |
gemini-3.1-flash-image |
Nano Banana 2 | image | t2i |
gemini-2.5-flash-image |
Nano Banana | image | t2i |
gemini-2.5-flash-tts |
Gemini 2.5 Flash TTS | audio | tts |
gemini-2.5-pro-tts |
Gemini 2.5 Pro TTS | audio | tts |
lyria-3-clip |
Lyria 3 Clip | audio | music |
lyria-3-pro |
Lyria 3 Pro | audio | music |
gen-ai models --provider googlelists the current set (13 models).
gen-ai generate -m veo-3.1 -p "a drone shot over a snowy ridge at golden hour" \
--ar 16:9 -r 1080p -d 8 --audio-gen{ "name": "picsart_generate",
"arguments": { "model": "veo-3.1", "prompt": "a drone shot over a snowy ridge", "duration": 8, "resolution": "1080p", "generateAudio": true } }Parameters — veo-3.1
| Param | CLI flag | Type | Values |
|---|---|---|---|
prompt |
-p |
text | required |
aspectRatio |
--ar |
enum |
16:9 · 9:16
|
duration |
-d |
enum |
4 · 6 · 8
|
resolution |
-r |
enum |
720p · 1080p · 4k
|
generateAudio |
--audio-gen |
boolean | native audio |
negativePrompt |
--neg |
text | what to avoid |
startFrame / endFrame
|
--start-frame / --end-frame
|
file | keyframes |
imageUrls |
-i |
file | reference images |
Veo clips are chainable with gen-ai extend (+7s per segment).
gen-ai generate -m gemini-3-pro-image -p "a cinematic product render of a smart speaker" --ar 16:9 -r 4KParameters — gemini-3-pro-image
| Param | CLI flag | Type | Values |
|---|---|---|---|
prompt |
-p |
text | required |
aspectRatio |
--ar |
enum |
1:1 · 16:9 · 9:16 · 3:4 · 4:3 · 2:3 · 21:9
|
resolution |
-r |
enum |
1K · 2K · 4K
|
count |
-n |
enum |
1 · 2 · 4 · 6 · 8 · 10
|
thinkingBudget |
--thinking |
range | reasoning depth |
imageUrls |
-i |
file | multi-image input for editing |
gen-ai generate -m gemini-2.5-pro-tts -p "Here is your daily briefing." # speech
gen-ai generate -m lyria-3-pro -p "uplifting cinematic orchestral score" # musicSources:
gen-ai models info veo-3.1 --jsonandgen-ai models info gemini-3-pro-image --json.
Picsart CLI & MCP · Repo · AI Playground app
Getting Started
Interfaces
Concepts
Model Reference
Providers
- All providers
- Async
- ByteDance
- Creatify
- ElevenLabs
- Flux (Black Forest Labs)
- Grok (xAI)
- Happy Horse
- HeyGen
- Hunyuan
- Ideogram
- Kling
- LTX (Lightricks)
- Luma
- MiniMax
- OpenAI
- OVI
- Picsart
- Pika
- PixVerse
- Qwen (Alibaba)
- Recraft
- Reve
- Runway
- Seedance
- Seedream
- Topaz
- VEED
- Videography
- Wan (Alibaba)
More