Enable running MOJO spark pipeline without H2O init #4935

exalate-issue-sync · 2023-05-22T18:01:27Z

Usecase: train a pipeline containing an h2o stage, store it to file. Then read it from a file in another script and use it for scoring without explicitly initialising H2O (like with similar calls: H2OConf(spark))

Without that an exception would be thrown:
java.lang.ClassNotFoundException: org.apache.spark.ml.h2o.models.H2OMOJOModel

exalate-issue-sync · 2023-05-22T18:01:29Z

Jakub Hava commented: After analyzing this, loading the classes can't be handled automatically. Therefore we can document this behavior and say that user needs to initialize the PySparkling in this case using the Initializer.load_sparkling_jar call

CC: [~accountid:5a73cf5fdca0242a1ca84b4b]

DinukaH2O · 2023-05-23T12:09:37Z

JIRA Issue Migration Info

Jira Issue: SW-817
Assignee: Jakub Hava
Reporter: Stefan Pacinda
State: Resolved
Fix Version: 2.1.28
Attachments: N/A
Development PRs: Available

Linked PRs from JIRA

#696
#697

hasithjp · 2023-05-29T15:07:41Z

JIRA Issue Migration Info Cont'd

Jira Issue Created Date: 2018-04-24T03:33:59.365-0700

DinukaH2O assigned jakubhava May 23, 2023

DinukaH2O closed this as completed May 23, 2023

DinukaH2O added the fixVersion/2.1.28 label May 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable running MOJO spark pipeline without H2O init #4935

Enable running MOJO spark pipeline without H2O init #4935

exalate-issue-sync bot commented May 22, 2023

exalate-issue-sync bot commented May 22, 2023

DinukaH2O commented May 23, 2023

hasithjp commented May 29, 2023

Enable running MOJO spark pipeline without H2O init #4935

Enable running MOJO spark pipeline without H2O init #4935

Comments

exalate-issue-sync bot commented May 22, 2023

exalate-issue-sync bot commented May 22, 2023

DinukaH2O commented May 23, 2023

hasithjp commented May 29, 2023