Step-Video-T2V

New txt2vid project:

> A Step-Video-T2V, a state-of-the-art (SoTA) text-to-video pre-trained model with 30 billion parameters and the capability to generate videos up to 204 frames. To enhance both training and inference efficiency, we propose a deep compression VAE for videos, achieving 16x16 spatial and 8x temporal compression ratios. Direct Preference Optimization (DPO) is applied in the final stage to further enhance the visual quality of the generated videos. Step-Video-T2V's performance is evaluated on a novel video generation benchmark, Step-Video-T2V-Eval, demonstrating its SoTA text-to-video quality compared to both open-source and commercial engines.

https://github.com/stepfun-ai/Step-Video-T2V

https://github.com/user-attachments/assets/66f29973-b544-4867-9ce9-b530ffa9a80f

Model | height/width/frame | Peak GPU Memory | 50 steps w flash-attn | 50 steps w/o flash-attn
-- | -- | -- | -- | --
Step-Video-T2V | 544px992px204f | 77.64 GB | 743 s | 1232 s
Step-Video-T2V | 544px992px136f | 72.48 GB | 408 s | 605 s

Models | 🤗Huggingface | 🤖Modelscope
-- | -- | --
Step-Video-T2V | [download](https://huggingface.co/stepfun-ai/stepvideo-t2v) | [download](https://www.modelscope.cn/models/stepfun-ai/stepvideo-t2v)
Step-Video-T2V-Turbo (Inference Step Distillation) | [download](https://huggingface.co/stepfun-ai/stepvideo-t2v-turbo) | [download](https://www.modelscope.cn/models/stepfun-ai/stepvideo-t2v-turbo)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Step-Video-T2V #10812

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Model	height/width/frame	Peak GPU Memory	50 steps w flash-attn	50 steps w/o flash-attn
Step-Video-T2V	544px992px204f	77.64 GB	743 s	1232 s
Step-Video-T2V	544px992px136f	72.48 GB	408 s	605 s

Models	🤗Huggingface	🤖Modelscope
Step-Video-T2V	download	download
Step-Video-T2V-Turbo (Inference Step Distillation)	download	download

Step-Video-T2V #10812

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions