refactor: store l1 and l2 as model state and not command line arguments #3654

jackgerrits · 2022-01-27T15:04:43Z

This change is important as these options being serialized as a float or double matters as enough precision is lost to affect save_resume model in the float case. Moving this to the model resolves the todo, makes it consistent with all other state and removes the need for options to understand double precision.

This change automatically supports "upgrading" models. If an old keep style model is loaded it will update the state accordingly and on subsequent save/load will use the save load data section

When loading value is

If command line value is non-default, use that value
Else if model is non-default, use that value
Else use default

Must merge after #3656

jackgerrits · 2022-01-27T15:05:03Z

vowpalwabbit/vw_versions.h

@@ -59,5 +59,9 @@ constexpr VW::version_struct VERSION_PASS_UINT64{8, 3, 3};

 /// Added serialized seen min and max labels in the --active reduction
 constexpr VW::version_struct VERSION_FILE_WITH_ACTIVE_SEEN_LABELS{9, 0, 0};
+
+/// Moved option values from command line to model data
+constexpr VW::version_struct VERSION_FILE_WITH_L1_AND_L2_STATE_IN_MODEL_DATA{8, 11, 0};


Decide if okay for 9 or later and update accordingly

We should make sure we decide this before merging the PR, since it could be a little awkward to have to change this once it is in the repo, and it could cause odd situations where a source-build does the wrong thing.

The source build should do the right thing according to version.txt. It is just whether or not this functionality is "turned on"

Will update this to 9.0.0 prior to merging

…rrits/vowpal_wabbit into jagerrit/l1l2_state_model

test/train-sets/ref/automl_readable.txt

rajan-chari · 2022-01-27T15:55:01Z

Wondering about a RunTest (or unit test) to verify things since this is a gd change. Are there some tests the cover save/resume with non trivial l1, l2 values?

rajan-chari · 2022-01-27T15:58:40Z

vowpalwabbit/gd.cc

@@ -1211,12 +1230,10 @@ base_learner* setup(VW::setup_base_i& stack_builder)
               .default_value(0.f)
               .help("Degree of l2 regularization applied to activated sparse parameters"))
      .add(make_option("l1_state", all.sd->gravity)
-               .keep(all.save_resume)
-               .default_value(0.)
+               .default_value(L1_STATE_DEFAULT)


If the model file does not have l1,l2 (i.e. older model), and we don't pick up l1, l2 values from command line will this be breaking change?

No, if neither the model file or command line contains l1/l2 information then the default value will be used as before. There is no change in behavior

jackgerrits · 2022-01-27T16:04:38Z

Test 1 in slow.vwtest.json handles saving and loading models with --l1 and --l2 which causes l1_state and l2_state to be modified. (

vowpal_wabbit/test/save_resume_test.py

Line 190 in 2f189ca

errors += do_test(filename, "--l1 1e-04")

) This is how I found the issue in the first place.

I can add more tests though. I'll test the override behavior.

jackgerrits and others added 2 commits January 27, 2022 10:04

refactor: store l1 and l2 as model state and not command line arguments

46b76a6

Merge branch 'master' into jagerrit/l1l2_state_model

a24e3f8

jackgerrits commented Jan 27, 2022

View reviewed changes

jackgerrits added 3 commits January 27, 2022 10:22

conditionally read the value

18d4e11

Merge branch 'jagerrit/l1l2_state_model' of https://github.com/jackge…

3d3f89f

…rrits/vowpal_wabbit into jagerrit/l1l2_state_model

Fix load semantics

880153a

jackgerrits force-pushed the jagerrit/l1l2_state_model branch from 36e7563 to 880153a Compare January 27, 2022 15:36

rajan-chari reviewed Jan 27, 2022

View reviewed changes

test/train-sets/ref/automl_readable.txt Outdated Show resolved Hide resolved

rajan-chari reviewed Jan 27, 2022

View reviewed changes

jackgerrits and others added 7 commits January 27, 2022 11:14

add tests exercising overriding

9bcf0c7

Add more tests

f5cb01d

Skip tests

d9b1bbb

update version

82aafb9

Merge branch 'master' into jagerrit/l1l2_state_model

90f32fe

update tests

5772934

Merge branch 'master' into jagerrit/l1l2_state_model

0145a13

bassmang approved these changes Jan 28, 2022

View reviewed changes

jackgerrits merged commit fe0f6fd into VowpalWabbit:master Jan 28, 2022

jackgerrits deleted the jagerrit/l1l2_state_model branch January 28, 2022 18:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: store l1 and l2 as model state and not command line arguments #3654

refactor: store l1 and l2 as model state and not command line arguments #3654

jackgerrits commented Jan 27, 2022 •

edited

Loading

jackgerrits Jan 27, 2022

lokitoth Jan 27, 2022

jackgerrits Jan 27, 2022

jackgerrits Jan 27, 2022

rajan-chari commented Jan 27, 2022

rajan-chari Jan 27, 2022

jackgerrits Jan 27, 2022

jackgerrits commented Jan 27, 2022

refactor: store l1 and l2 as model state and not command line arguments #3654

refactor: store l1 and l2 as model state and not command line arguments #3654

Conversation

jackgerrits commented Jan 27, 2022 • edited Loading

jackgerrits Jan 27, 2022

Choose a reason for hiding this comment

lokitoth Jan 27, 2022

Choose a reason for hiding this comment

jackgerrits Jan 27, 2022

Choose a reason for hiding this comment

jackgerrits Jan 27, 2022

Choose a reason for hiding this comment

rajan-chari commented Jan 27, 2022

rajan-chari Jan 27, 2022

Choose a reason for hiding this comment

jackgerrits Jan 27, 2022

Choose a reason for hiding this comment

jackgerrits commented Jan 27, 2022

jackgerrits commented Jan 27, 2022 •

edited

Loading