-
Notifications
You must be signed in to change notification settings - Fork 293
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Hi,
I am training my model (VQ + other models) with gradient accumulation.
Since the VQ module uses EMA updates at every forward pass,
it is difficult to align them with the gradient updates.
What I want is to accumulate the necessary data during forward passes,
but defer the actual EMA update until a specific step (e.g., when gradients are applied).
I also noticed a parameter, manual_ema_update.
Can this be used to manually trigger EMA updates at the right time?
lucidrains
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request