Enable evaluation with LLMs <7B #3478

geoffreyangus · 2023-07-24T23:11:21Z

No description provided.

…e model weights are re-registered after training. TODO: fix evaluation

for more information, see https://pre-commit.ci

…into zero_copy_load

for more information, see https://pre-commit.ci

…into zero_copy_load

for more information, see https://pre-commit.ci

github-actions · 2023-07-25T00:51:40Z

Unit Test Results

  6 files +      1   6 suites +1 58m 23s ⏱️ - 20m 15s
34 tests - 2 740 29 ✔️ - 2 727   5 💤 - 7 0 ❌ - 6
88 runs - 2 721 72 ✔️ - 2 713 16 💤 - 2 0 ❌ - 6

Results for commit 3f0bc9f. ± Comparison against base commit 539e8e0.

♻️ This comment has been updated with latest results.

for more information, see https://pre-commit.ci

…into deepspeed-eval

for more information, see https://pre-commit.ci

tgaddair · 2023-07-27T04:54:04Z

ludwig/api.py

@@ -1567,7 +1568,8 @@ def load(
        # Upgrades deprecated fields and adds new required fields in case the config loaded from disk is old.
        config_obj = ModelConfig.from_dict(config)

-        if backend_param is None and "backend" in config:
+        # Ensure that the original backend is used if it was specified in the config and user requests it
+        if use_backend_from_config or (backend_param is None and "backend" in config):


Hmm, is use_backend_from_config needed here? Seems like the same effect can be achieved by letting backend param be None. Am I missing something?

Yeah, so the issue comes from the evaluate CLI:

ludwig/ludwig/evaluate.py

Lines 267 to 274 in adc82cc

args.backend = initialize_backend(args.backend)

if args.backend.is_coordinator():

print_ludwig("Evaluate", LUDWIG_VERSION)

logger.info(f"Dataset path: {args.dataset}")

logger.info(f"Model path: {args.model_path}")

logger.info("")

evaluate_cli(**vars(args))

The backend is initialized fresh when running ludwig evaluate. In doing this, the backend config is entirely ignored. This made it difficult to iterate quickly on this PR (my strategy was to run ludwig train once, then run ludwig evaluate to debug batch evaluation and prediction).

Okay, going to push a quick change to fix the issue here, by not plumbing through the backend used in the CLI evaluate code path.

ludwig/backend/ray.py

ludwig/distributed/deepspeed.py

for more information, see https://pre-commit.ci

tgaddair

LGTM!

arnavgarg1 and others added 29 commits July 14, 2023 14:35

Zero copy model weights for distributed LLM training

5fe168a

Reduce CPU memory usage during extract

e4284a0

Avoid using a copy of the model before stripping out tensors

efd390e

gate behind ray backend

b89a037

Move serialization to its own function

a8c46b2

typo

4d214c9

refactor

3302589

Address some comments'

f021d7d

Move adapter initialization point earlier

f4f8f4b

Refactor to right abstraction level

3090740

Add unit tests

52aa4df

WIP: makes adapter initialization a separate step and ensures that th…

1653933

…e model weights are re-registered after training. TODO: fix evaluation

[pre-commit.ci] auto fixes from pre-commit.com hooks

947b4d8

for more information, see https://pre-commit.ci

cleanup

de5eabe

[pre-commit.ci] auto fixes from pre-commit.com hooks

05ecb5a

for more information, see https://pre-commit.ci

cleanup

7bfdb7b

Merge branch 'zero_copy_load' of https://github.com/ludwig-ai/ludwig …

7db5f9f

…into zero_copy_load

[pre-commit.ci] auto fixes from pre-commit.com hooks

e8ccd03

for more information, see https://pre-commit.ci

cleanup

b1e9e4a

Merge branch 'zero_copy_load' of https://github.com/ludwig-ai/ludwig …

0fdcb98

…into zero_copy_load

pr comments

561cec2

cleanup

c3cd863

WIP: trying travis' approach of using a single worker with 4 GPUs

d103239

enables batch prediction

0230903

enables batch prediction

5013a6b

cleanup

bd2132d

Merge branch 'master' into deepspeed-eval

b92b055

[pre-commit.ci] auto fixes from pre-commit.com hooks

2c1be3f

for more information, see https://pre-commit.ci

fix dist strategy init

2e4ec73

fix deepspeed refactor

eee6918

geoffreyangus requested review from tgaddair and arnavgarg1 July 25, 2023 01:54

geoffreyangus and others added 8 commits July 26, 2023 16:57

fix batch prediction

5b4c80a

merge master

5ebe8f3

[pre-commit.ci] auto fixes from pre-commit.com hooks

4d76054

for more information, see https://pre-commit.ci

fix tests

f395f9e

[pre-commit.ci] auto fixes from pre-commit.com hooks

0b6452f

for more information, see https://pre-commit.ci

fix tests

d0deda4

Merge branch 'deepspeed-eval' of https://github.com/ludwig-ai/ludwig …

c4bda19

…into deepspeed-eval

[pre-commit.ci] auto fixes from pre-commit.com hooks

adc82cc

for more information, see https://pre-commit.ci

tgaddair reviewed Jul 27, 2023

View reviewed changes

geoffreyangus and others added 9 commits July 27, 2023 11:16

fixed predictor device mismatch error

cb44edd

[pre-commit.ci] auto fixes from pre-commit.com hooks

dd6eb65

for more information, see https://pre-commit.ci

lint

56e7c03

fix merge conflict

fe3f94b

add deepspeed requirement

cbf39dc

deepspeed versioning

df30e33

[pre-commit.ci] auto fixes from pre-commit.com hooks

465a69d

for more information, see https://pre-commit.ci

Revert use_backend_from_config

9b6fbc9

Avoid plumbing backend before loading config

3f0bc9f

tgaddair mentioned this pull request Jul 29, 2023

Testing new llm feature with mosaicml/mpt-7b-8k, bigscience/bloomz-3b and openlm-research/open_llama_3b_v2 #3485

Closed

tgaddair linked an issue Jul 29, 2023 that may be closed by this pull request

Testing new llm feature with mosaicml/mpt-7b-8k, bigscience/bloomz-3b and openlm-research/open_llama_3b_v2 #3485

Closed

tgaddair approved these changes Jul 29, 2023

View reviewed changes

tgaddair merged commit 8f5cec6 into master Jul 29, 2023
16 checks passed

tgaddair deleted the deepspeed-eval branch July 29, 2023 19:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable evaluation with LLMs <7B #3478

Enable evaluation with LLMs <7B #3478

geoffreyangus commented Jul 24, 2023

github-actions bot commented Jul 25, 2023 •

edited

tgaddair Jul 27, 2023

geoffreyangus Jul 27, 2023 •

edited by tgaddair

tgaddair Jul 29, 2023

tgaddair left a comment

	args.backend = initialize_backend(args.backend)
	if args.backend.is_coordinator():
	print_ludwig("Evaluate", LUDWIG_VERSION)
	logger.info(f"Dataset path: {args.dataset}")
	logger.info(f"Model path: {args.model_path}")
	logger.info("")

	evaluate_cli(**vars(args))

Enable evaluation with LLMs <7B #3478

Enable evaluation with LLMs <7B #3478

Conversation

geoffreyangus commented Jul 24, 2023

github-actions bot commented Jul 25, 2023 • edited

Unit Test Results

tgaddair Jul 27, 2023

Choose a reason for hiding this comment

geoffreyangus Jul 27, 2023 • edited by tgaddair

Choose a reason for hiding this comment

tgaddair Jul 29, 2023

Choose a reason for hiding this comment

tgaddair left a comment

Choose a reason for hiding this comment

github-actions bot commented Jul 25, 2023 •

edited

geoffreyangus Jul 27, 2023 •

edited by tgaddair