PUBDEV-8047: AutoML Save/Load implementation #6437

tomasfryda · 2022-11-24T15:07:48Z

https://h2oai.atlassian.net/browse/PUBDEV-8047

Known issues:

missing automl_training_frame
- ~~get_leaderboard("ALL") on loaded object produces NAs for predict_time_per_row_ms on loaded models~~
  - fixed by computing the extended leaderboard before saving, the predict_time_per_row_ms is successfully saved and loaded.
- ~~make_leaderboard("ALL") on loaded object without provided leaderboard_frame fails on predict_time_per_row_ms~~
  - this still happens but the behavior is the same as for normal automl object so Save/Load doesn't change anything in this case.

h2o-automl/src/main/java/ai/h2o/automl/AutoML.java

tomasfryda · 2022-11-25T11:46:27Z

h2o-automl/src/main/java/ai/h2o/automl/AutoML.java

+  protected static AutoML readAutoML(AutoBuffer ab, Futures fs) {
+    try (PersistenceContext pc = PersistenceContext.begin()) {
+      AutoML aml = new AutoML(ab.get(), null, ab.get(), false);
+      aml._leaderboard = (Leaderboard) ab.getKey(fs);
+      aml._eventLog = (EventLog) ab.getKey(fs);
+//      aml._trainingFrame = (Frame) ab.getKey(fs);
+      fs.blockForPending();
+      for (Key mk : aml.leaderboard().getModelKeys()) {
+        Model m = (Model) PersistenceContext.getKey(ab, fs, mk);
+        if (aml._buildSpec.build_control.keep_cross_validation_predictions)
+          for (Key k : m._output._cross_validation_predictions)
+            PersistenceContext.loadKey(ab, fs, k);
+        if (aml._buildSpec.build_control.keep_cross_validation_models)
+          for (Key k : m._output._cross_validation_models)
+            PersistenceContext.loadKey(ab, fs, k);
+        if (aml._buildSpec.build_control.keep_cross_validation_fold_assignment)
+          PersistenceContext.loadKey(ab,fs ,m._output._cross_validation_fold_assignment_frame_id);
+//        if (m instanceof StackedEnsembleModel)
+//          PersistenceContext.loadKey(ab, fs, ((StackedEnsembleModel)m)._output._metalearner._parms._train);
+      }
+      DKV.put(aml);
+      return aml;
+    } catch (Exception e) {
+      throw new RuntimeException(e);
+    }
+  }


The objects have to be deserialized exactly in the same order as they were serialized => we need to keep track of what was already loaded and in that case skip the loading of that object.

ledell

This looks great, thank you!

tomasfryda added python R labels Nov 24, 2022

tomasfryda requested review from ledell and sebhrusen November 24, 2022 15:07

tomasfryda self-assigned this Nov 24, 2022

tomasfryda commented Nov 25, 2022

View reviewed changes

h2o-automl/src/main/java/ai/h2o/automl/AutoML.java Outdated Show resolved Hide resolved

tomasfryda commented Nov 25, 2022

View reviewed changes

tomasfryda force-pushed the tomf_PUBDEV-8047_save_load_automl branch 2 times, most recently from d4fb774 to 5ba924b Compare November 28, 2022 11:41

Initial AutoML Save/Load implementation

74490f2

tomasfryda force-pushed the tomf_PUBDEV-8047_save_load_automl branch from 5ba924b to 74490f2 Compare November 28, 2022 13:09

tomasfryda added 2 commits November 29, 2022 08:35

Force predict_time_per_row_ms calculation before saving

7d9cb25

Subsample the training frame for predict_time_per_row_ms calculation

006d7b2

tomasfryda marked this pull request as ready for review November 30, 2022 15:46

Remove unused parts in AutoMLImportV3

a7fefad

tomasfryda added the please review label Dec 19, 2022

ledell approved these changes Jan 10, 2023

View reviewed changes

h2o-ops mentioned this pull request May 14, 2023

Create a function to save and load AutoML objects #7602

Open

sebhrusen mentioned this pull request Oct 23, 2023

how to save the trained autoML object or all models in it? #15853

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PUBDEV-8047: AutoML Save/Load implementation #6437

PUBDEV-8047: AutoML Save/Load implementation #6437

tomasfryda commented Nov 24, 2022 •

edited

tomasfryda Nov 25, 2022

ledell left a comment

PUBDEV-8047: AutoML Save/Load implementation #6437

Are you sure you want to change the base?

PUBDEV-8047: AutoML Save/Load implementation #6437

Conversation

tomasfryda commented Nov 24, 2022 • edited

tomasfryda Nov 25, 2022

Choose a reason for hiding this comment

ledell left a comment

Choose a reason for hiding this comment

tomasfryda commented Nov 24, 2022 •

edited