Add support for auto packing ratio #683

irenedea · 2023-10-19T22:47:58Z

Adds 'auto' packing ratio for finetuning.

If 'auto' is specified, packing will be profiled for the dataloader configuration across a maximum of 20 packing ratios until the highest packing ratio with 0 waste is found.

If there are multiple ranks, we take the minimum 'auto' packing ratio across all ranks.

Testing

Manual tests
- 1 epoch on mosaicml/dolly_hhrlhf, auto packing had 1.3% waste overall
  - finetune-auto-pack-test-CHlwSQ https://wandb.ai/mosaic-ml/irene-test/runs/7ccl7k7a
    - time/token: 9390338, time/sample: 4896, time/batch: 51, time/epoch: 1
  - finetune-auto-pack-test-baseline2-zptNpN https://wandb.ai/mosaic-ml/irene-test/runs/b0xqxmlj
    - time/token: 9518016, time/sample: 59288, time/batch: 618, time/epoch: 1
- No packing run are the same before and after PR
  - before: finetune-auto-pack-test-baseline2-zptNpN , after: finetune-ratio-no-pack-branch-test-8N5I1y
- Packing ratio of 3.0 is same before and after PR
  - before: finetune-ratio-3-pack-test-kW8gp6 , after: finetune-ratio-3-pack-with-auto-test-YVP00E
Unit tests
- Tests packing on small tensors
- Tests packing on small tensors with leftovers and waste calculation
- Tests auto packing ratio selection for single and multi ranks
- Tests auto packing with dataloader
- Adds 'auto' packing_ratio parameterization to existing dataloader tests

llmfoundry/data/packing.py

llmfoundry/data/dataloader.py

…d we only load a single batch.

llmfoundry/data/packing.py

alextrott16

LGTM. Thanks for adding this!!
Will hold off on approval so this can get another set of eyes!

Also left a couple comments (nothing major) along with a suggestion for an alternative way to search for the optimal packing ratio. I think re-using my brute force approach might not be the best way to go. If you agree, that will require more of an overhaul of this code, unfortunately. But potentially better to do that than simply inherit my hasty decision :)

llmfoundry/data/packing.py

scripts/train/finetune_example/mpt-7b-arc-easy--gpu.yaml

tests/test_dataloader.py

llmfoundry/data/packing.py

dakinggg

Looking good!

Two more manual tests I would like to see before merging:
(1) a short training run without packing, before and after this PR. These should be identical.
(2) a short training run with a set packing ratio, before and after this PR. These should be identical.

llmfoundry/data/denoising.py

mcli/mcli-llama2-finetune.yaml

scripts/train/finetune_example/mpt-7b-arc-easy--gpu.yaml

tests/test_dataloader.py

llmfoundry/data/finetuning/dataloader.py

llmfoundry/data/packing.py

dakinggg

LGTM except for one determinism related comment

llmfoundry/data/packing.py

Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

dakinggg

LGTM, with one more question relating to the randomness

llmfoundry/data/packing.py

irenedea added 8 commits September 22, 2023 13:53

Add support for auto packing ratio

6d53fca

Add test

1c0f157

Refactor and change to generator

b32bac0

Add simple tests

d9dcdbc

Add auto packing tests

93d7926

Add auto packing to test_dataloader

ec71fec

Merge branch 'main' into packing-collator

239fce4

use correct max leftovers to keep

0db972a

irenedea commented Oct 19, 2023

View reviewed changes

llmfoundry/data/packing.py Show resolved Hide resolved

irenedea added 3 commits October 20, 2023 00:21

Handle dataspec change

6c321d3

Merge branch 'main' into packing-collator

d6793bb

Add dataloader test

a852c23

irenedea commented Oct 20, 2023

View reviewed changes

llmfoundry/data/dataloader.py Show resolved Hide resolved

irenedea force-pushed the packing-collator branch 2 times, most recently from 206000d to 36dc1a0 Compare October 20, 2023 22:42

Add distributed autopacking

d48fb97

irenedea force-pushed the packing-collator branch from 2bf4366 to d48fb97 Compare October 21, 2023 00:33

irenedea added 3 commits October 21, 2023 00:39

Update comments for profile_packing script refactor

8c08405

add torch cuda check

6aab1ad

Use 0 workers for profiling because one batch is loaded per worker an…

aeffb4b

…d we only load a single batch.

irenedea force-pushed the packing-collator branch from bb27d78 to aeffb4b Compare October 21, 2023 17:45

irenedea commented Oct 21, 2023

View reviewed changes

llmfoundry/data/packing.py Show resolved Hide resolved

Merge branch 'main' into packing-collator

044bb00

irenedea requested review from dakinggg and alextrott16 and removed request for dakinggg October 21, 2023 19:14

irenedea marked this pull request as ready for review October 21, 2023 19:15

alextrott16 reviewed Oct 23, 2023

View reviewed changes

llmfoundry/data/packing.py Outdated Show resolved Hide resolved

llmfoundry/data/packing.py Show resolved Hide resolved

scripts/train/finetune_example/mpt-7b-arc-easy--gpu.yaml Outdated Show resolved Hide resolved

tests/test_dataloader.py Outdated Show resolved Hide resolved

irenedea commented Oct 23, 2023

View reviewed changes

llmfoundry/data/packing.py Show resolved Hide resolved

irenedea and others added 4 commits October 23, 2023 19:33

Fix code quality

96b4829

Merge branch 'main' into packing-collator

2ad0c31

Merge branch 'main' into packing-collator

83e8d3a

Merge branch 'main' into packing-collator

f8ba32f

dakinggg reviewed Oct 29, 2023

View reviewed changes

Merge branch 'main' into packing-collator

1ee68b8

irenedea force-pushed the packing-collator branch from 6cc7bc5 to f899b69 Compare November 2, 2023 22:50

Address PR comments

d88cdcc

irenedea force-pushed the packing-collator branch from f899b69 to d88cdcc Compare November 2, 2023 23:06

irenedea added 2 commits November 2, 2023 16:06

Merge branch 'main' into packing-collator

913c47f

Set random seed for auto packing to make it deterministic

57cb170

dakinggg reviewed Nov 3, 2023

View reviewed changes

llmfoundry/data/packing.py Outdated Show resolved Hide resolved

llmfoundry/data/packing.py Show resolved Hide resolved

Fix typo

de6b45d

Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

dakinggg approved these changes Nov 4, 2023

View reviewed changes

llmfoundry/data/packing.py Show resolved Hide resolved

Update max_leftover_bins_to_keep to keep all and remove unused variables

2ff88c2

irenedea merged commit ca8e6b5 into mosaicml:main Nov 5, 2023
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for auto packing ratio #683

Add support for auto packing ratio #683

irenedea commented Oct 19, 2023 •

edited

alextrott16 left a comment

dakinggg left a comment

dakinggg left a comment

dakinggg left a comment

Add support for auto packing ratio #683

Add support for auto packing ratio #683

Conversation

irenedea commented Oct 19, 2023 • edited

Testing

alextrott16 left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

irenedea commented Oct 19, 2023 •

edited