[llm] Replace `model_name` with required `base_model`, add preset LLM registry, update internal adapter modules #3423

tgaddair · 2023-05-25T05:05:30Z

Summary

The new base_model parameter is also just a string, backed by a anyOf schema that either validates whether the provided string is a Ludwig-supported preset through JSON or an otherwise valid model in the Huggingface repo at runtime (i.e. via validation checks).

It is the first required parameter in Ludwig, outside of input and output features, and it uses a bespoke schema tweak to enforce that. It is also the first parameter to use an anyOf implementation; this is necessary because oneOf validation requires a value to match exactly one option schema (a preset LLM name would partially match both schemas and cause an error).

If the user does choose a preset LLM then after JSON validation but before config object initialization this value will be swapped for the full slash-delimited path defined in the MODEL_PRESETS dictionary.

Usage

base_model: llama-7b

is valid and can be verified by JSON or by the validation checks. By the end of config object initialization it has been replaced by huggyllama/llama-7b (which is defined in MODEL_PRESETS).

base_model: bigscience/bloom-3b

is also valid but can only be verified during config-object initialization by the extra validation checks.

and

base_model:  <anything other than a string>

will cause an immediate validation failure.

Necessary follow-up PRs, prompted by changes here

Looks like the required field is missing for model-level parameters (e.g. input_features). Probably an accidental regression. - Fix: int: Add back required for input and output features to the Ludwig JSON schema #3442
Add official support for required parameters using marshmallow's in-built options
Add support for arbitrary selector or "controller" fields (other than type).
Investigate whether the extra metadata dicts inserted into marshmallow fields are necessary anymore

for more information, see https://pre-commit.ci

github-actions · 2023-05-25T06:16:08Z

Unit Test Results

  6 files ±0   6 suites ±0 1h 10m 6s ⏱️ - 6m 17s
33 tests ±0 29 ✔️ ±0   4 💤 ±0 0 ❌ ±0
99 runs ±0 87 ✔️ ±0 12 💤 ±0 0 ❌ ±0

Results for commit 0eaa15f. ± Comparison against base commit f905b32.

♻️ This comment has been updated with latest results.

ludwig/schema/llms/base_model.py

…into llm-model-class

ludwig/schema/model_types/llm.py

arnavgarg1

LGTM

ludwig/schema/llms/base_model.py

…ate base model usage. still missing: llm.yaml updates

ludwig/config_validation/validation.py

ludwig/schema/llms/base_model.py

ludwig/schema/metadata/configs/llm.yaml

ludwig/schema/model_types/llm.py

ludwig/utils/backward_compatibility.py

tests/ludwig/schema/test_model_config.py

tests/integration_tests/test_preprocessing.py

…lidate inside the field class itself

ludwig/schema/llms/base_model.py

…m-model-class

ludwig/schema/llms/base_model.py

ludwig/schema/model_types/utils.py

tests/ludwig/schema/test_model_config.py

…l-class

tgaddair added 5 commits May 24, 2023 15:36

Refactor

f6dc3db

Added BaseModelConfig

c25daad

Added test

299e3c7

Fix

86bc26f

Fixed

4cf3f33

tgaddair requested a review from arnavgarg1 May 25, 2023 05:05

tgaddair and others added 2 commits May 24, 2023 22:06

Merge branch 'master' into llm-model-class

0f35735

[pre-commit.ci] auto fixes from pre-commit.com hooks

550c8fb

for more information, see https://pre-commit.ci

martindavis reviewed May 25, 2023

View reviewed changes

ludwig/schema/llms/base_model.py Outdated Show resolved Hide resolved

tgaddair added 4 commits May 25, 2023 09:09

More checks, fixed tests

7706dbf

Merge branch 'llm-model-class' of https://github.com/ludwig-ai/ludwig …

c602079

…into llm-model-class

Fixed constant

e592bb5

Adjusted expected impact

f04fd77

ohho reviewed May 25, 2023

View reviewed changes

ludwig/schema/model_types/llm.py Outdated Show resolved Hide resolved

arnavgarg1 approved these changes May 25, 2023

View reviewed changes

tgaddair and others added 4 commits May 25, 2023 14:25

Test LLM serialization

df316de

Try disallow null preset

5e1110e

Replacing use of schema utils to fix allOf schema output.

554fa8f

Use remove_fields

567e473

martindavis reviewed Jun 1, 2023

View reviewed changes

ludwig/schema/llms/base_model.py Outdated Show resolved Hide resolved

ksbrar and others added 9 commits June 2, 2023 00:07

Attempt merge

e374c65

other code

46bff8a

test fixes?

3aae7b5

merge in latest

79e9dae

comment out metadata

c034a11

comment out metadata

af9e4db

fix tuple error

fbe6172

fixes - required base_model, anyof, registry

2011b88

fixes - add post validation checks for huggingface custom models, upd…

b2dd0be

…ate base model usage. still missing: llm.yaml updates

ksbrar requested a review from arnavgarg1 June 23, 2023 17:18

ksbrar reviewed Jun 23, 2023

View reviewed changes

martindavis reviewed Jun 23, 2023

View reviewed changes

tests/ludwig/schema/test_model_config.py Outdated Show resolved Hide resolved

ksbrar added 5 commits June 23, 2023 14:25

update backcompat func

dd0052b

update other llm tests

f0168e8

martin comment - update test

6a2985b

other fixes, self-review

2aef164

Merge remote-tracking branch 'upstream/master' into llm-model-class

644b553

arnavgarg1 reviewed Jun 23, 2023

View reviewed changes

tests/integration_tests/test_preprocessing.py Outdated Show resolved Hide resolved

ksbrar and others added 3 commits June 23, 2023 19:52

update base_model and other constants usage

cc15b8f

more cleanup - deduplicate the huggingface validation logic and conso…

d4d4916

…lidate inside the field class itself

merge in latest

9027793

ksbrar requested a review from justinxzhao June 27, 2023 18:30

justinxzhao approved these changes Jun 27, 2023

View reviewed changes

ludwig/schema/llms/base_model.py Show resolved Hide resolved

ksbrar added 2 commits June 27, 2023 14:49

add docstring for presets dict

3833e3f

Merge branch 'llm-model-class' of github.com:ludwig-ai/ludwig into ll…

ae4efca

…m-model-class

arnavgarg1 reviewed Jun 27, 2023

View reviewed changes

ludwig/schema/llms/base_model.py Outdated Show resolved Hide resolved

arnavgarg1 reviewed Jun 27, 2023

View reviewed changes

ludwig/schema/model_types/utils.py Outdated Show resolved Hide resolved

arnavgarg1 reviewed Jun 27, 2023

View reviewed changes

tests/ludwig/schema/test_model_config.py Outdated Show resolved Hide resolved

ksbrar and others added 9 commits June 27, 2023 15:32

pr comments

c2245de

rm config upgrade fucn

0005801

rm string option from adapter, other adjustments

0cc4b87

Merge remote-tracking branch 'upstream/llm-model-class' into llm-mode…

09ee221

…l-class

fix test

6f32424

update parameter metadata usage

dc7cb30

fix test

96fbbd1

fix test

b0fb4b6

add meta sections of parameter metadata

0eaa15f

ksbrar merged commit 183c3ef into master Jun 29, 2023
16 checks passed

ksbrar deleted the llm-model-class branch June 29, 2023 00:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[llm] Replace `model_name` with required `base_model`, add preset LLM registry, update internal adapter modules #3423

[llm] Replace `model_name` with required `base_model`, add preset LLM registry, update internal adapter modules #3423

tgaddair commented May 25, 2023 •

edited by ksbrar

github-actions bot commented May 25, 2023 •

edited

arnavgarg1 left a comment

[llm] Replace model_name with *required* base_model, add preset LLM registry, update internal adapter modules #3423

[llm] Replace model_name with *required* base_model, add preset LLM registry, update internal adapter modules #3423

Conversation

tgaddair commented May 25, 2023 • edited by ksbrar

Summary

Usage

Necessary follow-up PRs, prompted by changes here

github-actions bot commented May 25, 2023 • edited

Unit Test Results

arnavgarg1 left a comment

Choose a reason for hiding this comment

[llm] Replace `model_name` with required `base_model`, add preset LLM registry, update internal adapter modules #3423

[llm] Replace `model_name` with required `base_model`, add preset LLM registry, update internal adapter modules #3423

tgaddair commented May 25, 2023 •

edited by ksbrar

github-actions bot commented May 25, 2023 •

edited