Estimate adapter memory overhead in choose_num_blocks() #346

justheuristic · 2023-07-12T23:45:48Z

This PR changes the way the server determines how many blocks it can handle if that server has at least one adapter. It does so by including the adapter parameters in the total block size that divides the free GPU memory.

ToDos:

make sure adapters are actually loaded in the same dtype that was used
update artek0chumak/bloom-560m-safe-peft to disable dropout
create github issues from Support peft LoRA adapters #335 comments

borzunov · 2023-07-13T16:36:00Z

src/petals/utils/peft.py

+                block, block_index=0, adapter_name=adapter, peft_config=peft_config, peft_state_dict=peft_state_dict
+            )
+        adapter_parameters = sum(p.numel() for p in block.parameters()) - base_block_parameters
+    bytes_per_parameter = torch.finfo(resolve_block_dtype(block_config, torch_dtype)).bits / 8


Doesn't adapters have a different dtype from the base block?

they should have the same, lest forward pass would cause an error

src/petals/server/server.py

src/petals/utils/peft.py

Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>

estimate adapter memory overhead

86d1afb

justheuristic requested a review from borzunov July 13, 2023 15:29

borzunov reviewed Jul 13, 2023

View reviewed changes

src/petals/server/server.py Outdated Show resolved Hide resolved

borzunov reviewed Jul 13, 2023

View reviewed changes

src/petals/utils/peft.py Show resolved Hide resolved

borzunov changed the title ~~Estimate adapter memory overhead for the purpose of num_blocks~~ Estimate adapter memory overhead in choose_num_blocks() Jul 13, 2023

justheuristic and others added 5 commits July 14, 2023 00:18

Update src/petals/server/server.py

d18e2d1

Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>

Update src/petals/utils/peft.py

58a92cb

Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>

Merge branch 'main' into fix-num-blocks

65675d0

black

832cb51

review

49b6cc8

justheuristic merged commit 010857a into main Jul 13, 2023
7 checks passed

justheuristic deleted the fix-num-blocks branch July 13, 2023 22:03

borzunov mentioned this pull request Jul 14, 2023

Fix bugs in _choose_num_blocks() added in #346 #354

Merged

borzunov added a commit that referenced this pull request Jul 14, 2023

Fix bugs in _choose_num_blocks() added in #346 (#354)

9703358

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Estimate adapter memory overhead in choose_num_blocks() #346

Estimate adapter memory overhead in choose_num_blocks() #346

justheuristic commented Jul 12, 2023 •

edited

borzunov Jul 13, 2023

justheuristic Jul 13, 2023

Estimate adapter memory overhead in choose_num_blocks() #346

Estimate adapter memory overhead in choose_num_blocks() #346

Conversation

justheuristic commented Jul 12, 2023 • edited

borzunov Jul 13, 2023

Choose a reason for hiding this comment

justheuristic Jul 13, 2023

Choose a reason for hiding this comment

justheuristic commented Jul 12, 2023 •

edited