Can we use this for realtime generation? #11

muhammadumair894 · 2024-05-27T15:02:43Z

Training and Inference Hardware.
Specifically, the GPU with 8G
VRAM can generate up to 3 minutes of video in one inference.

Hi there,

First of all, splendid work! I really appreciate the generalizability of this model and am looking forward to the code release.

I have a question regarding the paper's mention that a single 8GB GPU can produce a 3-minute video. Is it possible to make this process real-time or to stream the generated video so it appears as a real-time conversation?

muhammadumair894 · 2024-05-27T15:03:24Z

How much time one inference takes for 3 min generation?

liutaocode · 2024-07-30T14:16:13Z

Currently, motion generation is a long-term process (taking several seconds to several minutes). It cannot achieve real-time performance like vasa1. If the segments are too short, there will be a lack of smoothness between segments.

nitinmukesh closed this as completed Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we use this for realtime generation? #11

Can we use this for realtime generation? #11

muhammadumair894 commented May 27, 2024

muhammadumair894 commented May 27, 2024

liutaocode commented Jul 30, 2024

Can we use this for realtime generation? #11

Can we use this for realtime generation? #11

Comments

muhammadumair894 commented May 27, 2024

muhammadumair894 commented May 27, 2024

liutaocode commented Jul 30, 2024