Can we just use the sloth gradient checkpointing by uncommenting this line? #30

vkaul11 · 2024-05-21T04:13:05Z

I was not clear about how to use the code ?
https://github.com/jzhang38/EasyContext/blob/main/train.py#L28 By uncommenting this line we can enable sloth code?

jzhang38 · 2024-05-21T04:30:22Z

Yes you can. It will produce the same loss. But it does not enable greater batch size in my experiments.

vkaul11 · 2024-05-21T04:39:47Z

I am getting this error though when I do this. Any idea why ?
File "/workspace/cookbook-internal/recipes/common/peft.py", line 89, in load_train_model
model = prepare_model_for_kbit_training(model)
File "/usr/local/lib/python3.10/dist-packages/peft/utils/other.py", line 137, in prepare_model_for_kbit_training
model.gradient_checkpointing_enable(**gc_enable_kwargs)
File "/workspace/cookbook-internal/recipes/common/sloth_activation.py", line 63, in new_gradient_checkpointing_enable
assert gradient_checkpointing_kwargs == None
AssertionError
Maybe using QLora instead of Lora complicates things?

vkaul11 · 2024-05-21T04:40:06Z

I need it reduce memory footprint not batch size

vkaul11 · 2024-05-21T13:11:16Z

A question assert gradient_checkpointing_kwargs == None is there which throws an error. Do I need to set gradient_checkpointing_kwargs to something or I need to comment this line?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we just use the sloth gradient checkpointing by uncommenting this line? #30

Can we just use the sloth gradient checkpointing by uncommenting this line? #30

vkaul11 commented May 21, 2024

jzhang38 commented May 21, 2024

vkaul11 commented May 21, 2024 •

edited

Loading

vkaul11 commented May 21, 2024

vkaul11 commented May 21, 2024

Can we just use the sloth gradient checkpointing by uncommenting this line? #30

Can we just use the sloth gradient checkpointing by uncommenting this line? #30

Comments

vkaul11 commented May 21, 2024

jzhang38 commented May 21, 2024

vkaul11 commented May 21, 2024 • edited Loading

vkaul11 commented May 21, 2024

vkaul11 commented May 21, 2024

vkaul11 commented May 21, 2024 •

edited

Loading