Adding different device intialization to eval #466

bcui19 · 2023-07-18T18:20:07Z

Allow mixed initialization for eval.py in LLM-Foundry. Run on r1z1 here:

mpt30b-chat-hf-eval-r1z1-hs6a7u

scripts/eval/eval.py

dakinggg

lgtm

panchalhp-db · 2023-07-20T03:09:14Z

llmfoundry/utils/config_utils.py

+                "Reverting to `cfg.model.init_device='cpu'`.")
+            model_cfg.init_device = 'cpu'
+        if model_cfg.init_device == 'meta':
+            init_context = init_empty_weights()


Hey there! I noticed that when running the quick start example here: https://docs.mosaicml.com/projects/mcli/en/latest/guides/first_llm.html the run fails because init_empty_weights is not defined

----------Begin global rank 7 STDERR---------- ╭───────────────────── Traceback (most recent call last) ──────────────────────╮ │ /llm-foundry/scripts/train/train.py:328 in <module> │ │ │ │ 325 │ │ yaml_cfg = om.load(f) │ │ 326 │ cli_cfg = om.from_cli(args_list) │ │ 327 │ cfg = om.merge(yaml_cfg, cli_cfg) │ │ ❱ 328 │ main(cfg) │ │ 329 │ │ │ │ /llm-foundry/scripts/train/train.py:203 in main │ │ │ │ 200 │ │ cfg.pop('fsdp_config') │ │ 201 │ │ fsdp_config = None │ │ 202 │ │ │ ❱ 203 │ init_context = process_init_device(cfg.model, fsdp_config) │ │ 204 │ │ │ 205 │ # build tokenizer │ │ 206 │ tokenizer = build_tokenizer(cfg.tokenizer) │ │ │ │ /llm-foundry/llmfoundry/utils/config_utils.py:70 in process_init_device │ │ │ │ 67 │ │ │ │ "Reverting to `cfg.model.init_device='cpu'`.") │ │ 68 │ │ │ model_cfg.init_device = 'cpu' │ │ 69 │ │ if model_cfg.init_device == 'meta': │ │ ❱ 70 │ │ │ init_context = init_empty_weights() │ │ 71 │ │ if model_cfg.init_device == 'mixed': │ │ 72 │ │ │ if fsdp_config is None: │ │ 73 │ │ │ │ raise NotImplementedError( │ ╰──────────────────────────────────────────────────────────────────────────────╯ NameError: name 'init_empty_weights' is not defined

Just wanted to flag it :)

Adding different device intialization to eval

f0e2bf0

bcui19 requested review from dakinggg and bmosaicml July 18, 2023 18:20

dakinggg reviewed Jul 18, 2023

View reviewed changes

scripts/eval/eval.py Outdated Show resolved Hide resolved

Unifying process init device

e0934b2

bcui19 requested a review from dakinggg July 19, 2023 16:59

dakinggg approved these changes Jul 19, 2023

View reviewed changes

bcui19 merged commit bb0ad42 into main Jul 19, 2023
10 checks passed

panchalhp-db reviewed Jul 20, 2023

View reviewed changes

dakinggg deleted the fix_mixed_init_eval branch October 11, 2023 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding different device intialization to eval #466

Adding different device intialization to eval #466

bcui19 commented Jul 18, 2023 •

edited

dakinggg left a comment

panchalhp-db Jul 20, 2023

dakinggg Jul 20, 2023

Adding different device intialization to eval #466

Adding different device intialization to eval #466

Conversation

bcui19 commented Jul 18, 2023 • edited

dakinggg left a comment

Choose a reason for hiding this comment

panchalhp-db Jul 20, 2023

Choose a reason for hiding this comment

dakinggg Jul 20, 2023

Choose a reason for hiding this comment

bcui19 commented Jul 18, 2023 •

edited