Add PagedAdamW32bit #900

xzuyn · 2023-10-27T11:09:23Z

No description provided.

kohya-ss · 2023-10-29T06:01:51Z

Thank you for this! I didn't know PagedAdamW32bit.

oliverban · 2023-11-17T19:11:05Z

This is great, but isn't the 32bit part still expensive in terms of memoy? Or is this different for images? I really like this way of optimization but if we can get a PagedAdamW8Bit, wouldn't that be even better? I read that but they were talking about LLMs.

xzuyn · 2023-11-18T12:26:32Z

This is great, but isn't the 32bit part still expensive in terms of memory?

In terms of memory usage it's more expensive than using 8bit, but since its Paged that memory isn't eating into GPU VRAM, only system RAM. So if you have the RAM for it, using 32bit is higher precision. No idea how much that translates into real world quality though.

I really like this way of optimization but if we can get a PagedAdamW8Bit, wouldn't that be even better?

It's already implemented. I added PagedAdamW32bit here since only PagedAdamW8bit was added.

There may also be a 16bit paged version just named PagedAdamW but that may just be an alias for PagedAdamW32bit or PagedAdamW8bit. I'll have to check into this later.

edit: From what it seems, PagedAdamW is like AdamW so I will do another PR shortly after I confirm it runs.

oliverban · 2023-11-21T12:33:30Z

This is great, but isn't the 32bit part still expensive in terms of memory?

In terms of memory usage it's more expensive than using 8bit, but since its Paged that memory isn't eating into GPU VRAM, only system RAM. So if you have the RAM for it, using 32bit is higher precision. No idea how much that translates into real world quality though.

I really like this way of optimization but if we can get a PagedAdamW8Bit, wouldn't that be even better?

It's already implemented. I added PagedAdamW32bit here since only PagedAdamW8bit was added.

There may also be a 16bit paged version just named PagedAdamW but that may just be an alias for PagedAdamW32bit or PagedAdamW8bit. I'll have to check into this later.

edit: From what it seems, PagedAdamW is like AdamW so I will do another PR shortly after I confirm it runs.

Great news! I have 64 GB system RAM so should easily fit! Thanks for looking into this! <3

v21.6.4

Update train_util.py

d8f6867

kohya-ss changed the base branch from main to dev October 29, 2023 06:01

kohya-ss merged commit a9ed4ed into kohya-ss:dev Oct 29, 2023
1 check passed

wkpark pushed a commit to wkpark/sd-scripts that referenced this pull request Feb 27, 2024

Merge pull request kohya-ss#900 from bmaltais/dev2

47a1a4f

v21.6.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PagedAdamW32bit #900

Add PagedAdamW32bit #900

xzuyn commented Oct 27, 2023

kohya-ss commented Oct 29, 2023

oliverban commented Nov 17, 2023

xzuyn commented Nov 18, 2023 •

edited

Loading

oliverban commented Nov 21, 2023

Add PagedAdamW32bit #900

Add PagedAdamW32bit #900

Conversation

xzuyn commented Oct 27, 2023

kohya-ss commented Oct 29, 2023

oliverban commented Nov 17, 2023

xzuyn commented Nov 18, 2023 • edited Loading

oliverban commented Nov 21, 2023

xzuyn commented Nov 18, 2023 •

edited

Loading