Batch Size Ablations #91

fan23j · 2024-06-17T00:06:29Z

Hi,

Thank you for your work and well-organized repo. Reading through the paper, I was unable to locate ablations on effect of batch size (or effective batch size) on the generation performance. Could you provide any insight into how batch size affects the quality of video generation? In particular if using effective batch size through gradient accumulation steps, would you increase the total training iters to compensate?

Intuitively, it would be obvious that a higher batch size correlates with better performance (as shown through the efficacy of image-video joint training), but I was curious whether the benefits tapered off at all with the specific model size since the whole pipeline is relatively expensive to train, especially if we have to scale for gradient accumulation steps.

Thanks.

maxin-cn · 2024-06-21T03:25:43Z

Hi,

Thank you for your work and well-organized repo. Reading through the paper, I was unable to locate ablations on effect of batch size (or effective batch size) on the generation performance. Could you provide any insight into how batch size affects the quality of video generation? In particular if using effective batch size through gradient accumulation steps, would you increase the total training iters to compensate?

Intuitively, it would be obvious that a higher batch size correlates with better performance (as shown through the efficacy of image-video joint training), but I was curious whether the benefits tapered off at all with the specific model size since the whole pipeline is relatively expensive to train, especially if we have to scale for gradient accumulation steps.

Thanks.

Thanks for your interest. I also think a larger batch size leads to better performance. But in my experience so far, using gradient accumulative does not provide significant gains for text-to-video tasks.

maxin-cn added the question Further information is requested label Jun 21, 2024

fan23j closed this as completed Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch Size Ablations #91

Batch Size Ablations #91

fan23j commented Jun 17, 2024

maxin-cn commented Jun 21, 2024

Batch Size Ablations #91

Batch Size Ablations #91

Comments

fan23j commented Jun 17, 2024

maxin-cn commented Jun 21, 2024