torch.Tensor for optimizer parameters #127699

ad8e · 2024-06-02T00:19:16Z

🚀 The feature, motivation and pitch

The LR allows torch.Tensor, see: #120934 (comment)

But the other parameters don't, such as AdamW's beta2.

This causes torch.compile to fail (fallback to eager) when I compile an optimizer with changing parameters, since torch.compile does not allow non-constant floats.

Allowing changing the other (non-LR) parameters would be helpful because:

beta1 (momentum) needs a scheduler as the eigenvalue distribution changes
beta2 needs a scheduler when finetuning a model, if the optimizer states aren't loaded or if the data distribution changes. Otherwise, a long warmup is required (~200-1000 steps depending on beta2). I'm currently having problems with DCP optimizer saving, so this matters to me.
weight decay needs a scheduler if the math is done correctly

Alternatives

Either torch.compile working with floats or AdamW allowing torch.tensor params would work for me.

Additional context

No response

cc @vincentqb @jbschlosser @albanD @janeyx99 @crcrpar

The text was updated successfully, but these errors were encountered:

cpuhrsch added module: optimizer Related to torch.optim triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Jun 3, 2024

janeyx99 added the needs design label Jun 3, 2024

ad8e mentioned this issue Jun 4, 2024

LambdaLR has incorrect multiplicative behavior when using torch.tensor LR #126854

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.Tensor for optimizer parameters #127699

torch.Tensor for optimizer parameters #127699

ad8e commented Jun 2, 2024 •

edited by pytorch-bot bot

Loading

torch.Tensor for optimizer parameters #127699

torch.Tensor for optimizer parameters #127699

Comments

ad8e commented Jun 2, 2024 • edited by pytorch-bot bot Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

ad8e commented Jun 2, 2024 •

edited by pytorch-bot bot

Loading