You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
H2O calculates unique() values on categorical vectors simply by copying the domain of the vector(+ adding NA, this is currently WIP). If the column is a result of AstRowSlice operation, the domain is WRONG, as it contains domain of the original vector.
{code:python}
# make sure domains are recalculated with each temp assign
df_example = h2o.H2OFrame({'time': ['M','M','M','D','D','M','M','D'],
'amount': [1,4,5,0,0,1,3,0]})
One solution is to reduce the domain (discussion: #4848 (comment)) when the rows are sliced - controversial step. Second solution is to adjust the domain in-place after the unique() operation is done.
The text was updated successfully, but these errors were encountered:
Pavel Pscheidl commented: The domain should remain as-is, as this is desired behavior. The domain has to be collected in-place when doing the unique() operation. Being resolved as a part of [https://0xdata.atlassian.net/browse/HEXDEV-762|https://0xdata.atlassian.net/browse/HEXDEV-762|smart-link] .
H2O calculates unique() values on categorical vectors simply by copying the domain of the vector(+ adding NA, this is currently WIP). If the column is a result of AstRowSlice operation, the domain is WRONG, as it contains domain of the original vector.
{code:python}
# make sure domains are recalculated with each temp assign
df_example = h2o.H2OFrame({'time': ['M','M','M','D','D','M','M','D'],
'amount': [1,4,5,0,0,1,3,0]})
{code}
One solution is to reduce the domain (discussion: #4848 (comment)) when the rows are sliced - controversial step. Second solution is to adjust the domain in-place after the unique() operation is done.
The text was updated successfully, but these errors were encountered: