You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
made a simple script to try AutoML on a modified Titanic dataset (predicting an enum variable).
If I leave the target ('Survived' = [0,1]) as an integer, everything runs fine - but as this is a classification problem I force 'Survived' to be type 'enum' by setting the values to 'yes' and 'no'.
H2OXGBoostEstimator trains properly
Here's the error:
{{OSError: Job with key $03017f00000132d4ffffffff$_823029d133bd6a203b841a5297ef42c6 failed with an exception: java.lang.IllegalArgumentException: Test/Validation dataset has categorical column 'Survived' which is real-valued in the training data}}
training data:
[^train.csv]
demo script
[^titanic-demo.py]
The text was updated successfully, but these errors were encountered:
Sebastien Poirier commented: [~accountid:5d34759b6e55370bc308bdbb] Sorry for tackling this late: this is actually caused by [https://0xdata.atlassian.net/browse/PUBDEV-5975|https://0xdata.atlassian.net/browse/PUBDEV-5975|smart-link] .
You ran the AutoML instance twice using the same {{project_name}}, which in this case is considered as a rerun and will append new models to the existing leaderboard, with the {{models}} of the first run trying to get re-scored against the new modified {{leaderboard_frame}}.
Issue [https://0xdata.atlassian.net/browse/PUBDEV-5975|https://0xdata.atlassian.net/browse/PUBDEV-5975|smart-link] is aiming at solving those rerun scenarios.
Now thanks to this ticket, I’m considering binding the {{leaderboard}} to the corresponding leaderboard_frame.
made a simple script to try AutoML on a modified Titanic dataset (predicting an enum variable).
If I leave the target ('Survived' = [0,1]) as an integer, everything runs fine - but as this is a classification problem I force 'Survived' to be type 'enum' by setting the values to 'yes' and 'no'.
H2OXGBoostEstimator trains properly
Here's the error:
{{OSError: Job with key $03017f00000132d4ffffffff$_823029d133bd6a203b841a5297ef42c6 failed with an exception: java.lang.IllegalArgumentException: Test/Validation dataset has categorical column 'Survived' which is real-valued in the training data}}
training data:
[^train.csv]
demo script
[^titanic-demo.py]
The text was updated successfully, but these errors were encountered: