Using torch_qaic gradScaler and making lora_dropout=0.05 #320

quic-swatia · 2025-03-18T11:20:48Z

In case of finetuning on qaic, torch_qaic gradScaler will be used
Moving back to lora_dropout = 0.05 on ML Framework team's ask.

QEfficient/finetune/utils/train_utils.py

vbaddi · 2025-03-19T07:57:13Z

QEfficient/finetune/utils/train_utils.py

    if train_config.grad_scaler:
-        scaler = GradScaler()
+        if device.startswith("qaic"):
+            scaler = qaic_GradScaler()


… 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com>

1. In case of finetuning on qaic, torch_qaic gradScaler will be used 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com> Co-authored-by: Swati Allabadi <quic-swatia@quicinc.com>

1. In case of finetuning on qaic, torch_qaic gradScaler will be used 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com> Co-authored-by: Swati Allabadi <quic-swatia@quicinc.com> Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

1. In case of finetuning on qaic, torch_qaic gradScaler will be used 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com> Co-authored-by: Swati Allabadi <quic-swatia@quicinc.com>

1. In case of finetuning on qaic, torch_qaic gradScaler will be used 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com> Co-authored-by: Swati Allabadi <quic-swatia@quicinc.com> Signed-off-by: eplatero <quic_eplatero@quicinc.com>

quic-swatia requested a review from vbaddi March 18, 2025 11:20

quic-swatia self-assigned this Mar 18, 2025

quic-swatia requested review from ochougul and quic-rishinr as code owners March 18, 2025 11:20

quic-swatia force-pushed the gradScaler branch 2 times, most recently from 49ed776 to ea3acc7 Compare March 18, 2025 20:07

vbaddi requested changes Mar 19, 2025

View reviewed changes

1. In case of finetuning on qaic, torch_qaic gradScaler will be used.…

376c6a6

… 2. Moving back to lora_dropout = 0.05 on ML Framework team's ask. Signed-off-by: Swati Allabadi <quic-swatia@quicinc.com>

quic-swatia force-pushed the gradScaler branch from ea3acc7 to 376c6a6 Compare March 19, 2025 08:22

vbaddi approved these changes Mar 19, 2025

View reviewed changes

quic-swatia merged commit 2f50ce3 into quic:main Mar 19, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using torch_qaic gradScaler and making lora_dropout=0.05 #320

Using torch_qaic gradScaler and making lora_dropout=0.05 #320

Uh oh!

quic-swatia commented Mar 18, 2025

Uh oh!

Uh oh!

vbaddi Mar 19, 2025

Uh oh!

quic-swatia Mar 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Using torch_qaic gradScaler and making lora_dropout=0.05 #320

Using torch_qaic gradScaler and making lora_dropout=0.05 #320

Uh oh!

Conversation

quic-swatia commented Mar 18, 2025

Uh oh!

Uh oh!

vbaddi Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

quic-swatia Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants