You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Jakub Hava commented: Just an update - we know the fix for it with [~accountid:557058:389d9607-5bd8-4611-8c6a-755fe9295223], it doesn't affect the h2o core, but just sparkling-water. We're just trying to see what are all the consequences, if it's all fine, we could put it into the fix release tomorrow I was mentioning on Slack. What do you think [~accountid:557058:7e008760-093e-4668-9387-9ca6f3fd2aa7]?
#90702
Code to repro-
from Kuba - looks like the issue. frame is not re-evaluated after the column is added.
{code:java}
import csv file
spark_df = sqlContext.read.format('com.databricks.spark.csv').options(header='true', inferschema='true').load('BostonHousing.csv')
create h2o context
from pysparkling import *
hc = H2OContext.getOrCreate(sc)
boston = hc.as_h2o_frame(spark_df)
import h2o
from h2o.estimators.glm import H2OGeneralizedLinearEstimator
predictors = boston.columns[:-1]
response = "medv"
boston_glm2 = H2OGeneralizedLinearEstimator(nfolds=2,Lambda=.01)
boston_glm2.train(x = predictors, y = response,training_frame = boston)
pred = boston_glm2.predict(boston)
boston["predict"] = pred['predict']
sp_boston = hc.as_spark_frame(boston)
sp_boston
{code}
The text was updated successfully, but these errors were encountered: