Fix to save the state of FTRL models #1912

bremen79 · 2019-06-05T19:41:07Z

I have made a fix to save the state of FTRL models. Currently the state is not saved at all because it calls the the gd save function that doesn't save the state unless it finds a gd structure, that is absent in ftrl models.

I have added a integer in the vw structure the indicates the size of the ftrl model to save and changed read/write of gd accordingly.

Note that the saving for Pistol and Coin betting still does not work: It correctly save the state, but the test fails. Not sure why, I think it is the loss in precision due to the saving and the exponential nature of these algorithms. On the other hand, the saving state for FTRL Proximal now works perfectly.

…n by average lenght of the feature vectors

…than default one in ooa and cbify

JohnLangford · 2019-06-05T19:53:20Z

test/save_resume_test.py

@@ -153,8 +153,9 @@ def do_test(filename, args, verbose=None, repeat_args=None, known_failure=False)
        errors += do_test(filename, '--loss_function logistic --link logistic')
        errors += do_test(filename, '--nn 2')
        errors += do_test(filename, '--binary')
-        errors += do_test(filename, '--ftrl', known_failure=True)
+        errors += do_test(filename, '--ftrl')
        errors += do_test(filename, '--pistol', known_failure=True)


Are pistol and coin still in known failure?

Yes. As I write above, the data structure is saved correctly, but the weights seem to be different if trained continuously or trained, saved, resumed. The difference does not happen immediately, but after some samples. This is why I suspect numerical issues.

This violates my understanding of computers so I suspect there is something that we're missing. I probably won't be able to debug before the next release, but it should get done...

I am trying to debug this issue a bit more

JohnLangford · 2019-06-05T19:56:17Z

vowpalwabbit/global_data.h

@@ -565,6 +565,7 @@ struct vw
  bool adaptive;            // Should I use adaptive individual learning rates?
  bool normalized_updates;  // Should every feature be normalized
  bool invariant_updates;   // Should we use importance aware/safe updates
+  uint32_t ftrl_size;


Could we pass this as an argument to save_load instead? That seems more elegant than sticking it in the global data structure.

Oops, I merged. Should I revert, or do you want to do a separate pull request?

Found the bug, not sure what best way to fix it, see mail

JohnLangford · 2019-06-05T20:24:40Z

The test 67 failure is fixed in the current patch that I'm working on.

This reverts commit 538e9bb.

* tag '8.7.0': (354 commits) Update version to 8.7.0 (VowpalWabbit#1926) Fix misconfiguration (VowpalWabbit#1925) Ataymano ataymano/warnings fixes (VowpalWabbit#1924) Update new version script (VowpalWabbit#1922) Run clang-format over codebase (VowpalWabbit#1921) change semantics of lambda (VowpalWabbit#1920) Bremen79 fix save ftrl (VowpalWabbit#1919) fix for daemon race condition (VowpalWabbit#1918) Revert "Fix to save the state of FTRL models (VowpalWabbit#1912)" (VowpalWabbit#1916) fix static library build (VowpalWabbit#1913) more warnings (VowpalWabbit#1915) Fix to save the state of FTRL models (VowpalWabbit#1912) remove warnings (VowpalWabbit#1911) fix closing invalid file descriptor with memory_io_buf (VowpalWabbit#1910) Optional exception (VowpalWabbit#1906) Contextual Memory Tree (VowpalWabbit#1799) Coin betting (VowpalWabbit#1903) Ataymano/c wrapper fix2 (VowpalWabbit#1859) Use Appveyor MSBuildLogger (VowpalWabbit#1904) fix for no label confidence (VowpalWabbit#1901) ...

* releases: (354 commits) Update version to 8.7.0 (VowpalWabbit#1926) Fix misconfiguration (VowpalWabbit#1925) Ataymano ataymano/warnings fixes (VowpalWabbit#1924) Update new version script (VowpalWabbit#1922) Run clang-format over codebase (VowpalWabbit#1921) change semantics of lambda (VowpalWabbit#1920) Bremen79 fix save ftrl (VowpalWabbit#1919) fix for daemon race condition (VowpalWabbit#1918) Revert "Fix to save the state of FTRL models (VowpalWabbit#1912)" (VowpalWabbit#1916) fix static library build (VowpalWabbit#1913) more warnings (VowpalWabbit#1915) Fix to save the state of FTRL models (VowpalWabbit#1912) remove warnings (VowpalWabbit#1911) fix closing invalid file descriptor with memory_io_buf (VowpalWabbit#1910) Optional exception (VowpalWabbit#1906) Contextual Memory Tree (VowpalWabbit#1799) Coin betting (VowpalWabbit#1903) Ataymano/c wrapper fix2 (VowpalWabbit#1859) Use Appveyor MSBuildLogger (VowpalWabbit#1904) fix for no label confidence (VowpalWabbit#1901) ...

* dfsg: (354 commits) Update version to 8.7.0 (VowpalWabbit#1926) Fix misconfiguration (VowpalWabbit#1925) Ataymano ataymano/warnings fixes (VowpalWabbit#1924) Update new version script (VowpalWabbit#1922) Run clang-format over codebase (VowpalWabbit#1921) change semantics of lambda (VowpalWabbit#1920) Bremen79 fix save ftrl (VowpalWabbit#1919) fix for daemon race condition (VowpalWabbit#1918) Revert "Fix to save the state of FTRL models (VowpalWabbit#1912)" (VowpalWabbit#1916) fix static library build (VowpalWabbit#1913) more warnings (VowpalWabbit#1915) Fix to save the state of FTRL models (VowpalWabbit#1912) remove warnings (VowpalWabbit#1911) fix closing invalid file descriptor with memory_io_buf (VowpalWabbit#1910) Optional exception (VowpalWabbit#1906) Contextual Memory Tree (VowpalWabbit#1799) Coin betting (VowpalWabbit#1903) Ataymano/c wrapper fix2 (VowpalWabbit#1859) Use Appveyor MSBuildLogger (VowpalWabbit#1904) fix for no label confidence (VowpalWabbit#1901) ...

bremen79 added 14 commits March 6, 2019 18:02

first version of the KT algorithm

5783622

changed from 'kt' to 'approximate cocob' and implemented normalizatio…

07a7e04

…n by average lenght of the feature vectors

variant that works with squared loss

9430ede

fixed all bugs: works great in binary classification, slightly worse …

9fba67d

…than default one in ooa and cbify

cleaned version, no bias used

83633b8

bias and fix bug

d31cef3

another bug fix

49bccd5

removed bias and added default params for logistic

12f6a84

prediction is now stateless

2591f6b

added comments

7540eca

Merge branch 'master' into coin_pr_version

048c0f7

added tests

a8bc3a5

fix to ftrl state saving

b19a82e

merge

c13457d

JohnLangford reviewed Jun 5, 2019

View reviewed changes

bremen79 and others added 2 commits June 5, 2019 16:59

moved ftrl_size to a parameter of save_load_online_state

14ae4ae

Merge branch 'master' into fix_save_ftrl

39512b8

JohnLangford merged commit 538e9bb into VowpalWabbit:master Jun 5, 2019

JohnLangford added a commit that referenced this pull request Jun 5, 2019

Revert "Fix to save the state of FTRL models (#1912)"

3c1a10d

This reverts commit 538e9bb.

JohnLangford mentioned this pull request Jun 5, 2019

Revert "Fix to save the state of FTRL models" #1916

Merged

JohnLangford added a commit that referenced this pull request Jun 6, 2019

Revert "Fix to save the state of FTRL models (#1912)" (#1916)

2cbe61b

This reverts commit 538e9bb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to save the state of FTRL models #1912

Fix to save the state of FTRL models #1912

bremen79 commented Jun 5, 2019

JohnLangford Jun 5, 2019

bremen79 Jun 5, 2019

JohnLangford Jun 5, 2019

bremen79 Jun 5, 2019

JohnLangford Jun 5, 2019

bremen79 Jun 5, 2019

JohnLangford Jun 5, 2019

bremen79 Jun 5, 2019

JohnLangford commented Jun 5, 2019 •

edited

Fix to save the state of FTRL models #1912

Fix to save the state of FTRL models #1912

Conversation

bremen79 commented Jun 5, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JohnLangford commented Jun 5, 2019 • edited

JohnLangford commented Jun 5, 2019 •

edited