You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ignored_columns: (Optional, Python only) Specify the column or columns (as a list/vector) to be excluded from the model. This is the converse of the x argument.
If I read it naively, I would expect H2OAutoML object in Python API allows 'ignored_columns' to be specified explicitly. But in reality, it only allows specifying 'x' (=included column names) to its train() method, but never exposes 'ignored_columns' directly. http://docs.h2o.ai/h2o/latest-stable/h2o-py/docs/modeling.html#h2oautoml
Angela Bartz commented: [~accountid:5b153fb1b0d76456f36daced] can you verify whether ignored_columns is used in AutoML or if this parameter is ignored?
Sebastien Poirier commented: [~accountid:557058:6e44bc1a-dd50-499b-a331-2e049f28773b] I think the {{ignored_columns}} section should be removed from AutoML documentation as it is not directly exposed to end user.
As described in this ticket, AutoML uses this params from the REST API only internally on both Python+R clients (like for all algos in the R client) using simple formula:
{{ignored_columns = all_columns - x - y - fold_column - weights_column}}
In my opinion, this is a good thing, and exposing this parameter for other algos on the Python API was an unfortunate mistake as it plays a role very similar to {{x}} parameter, which can only create confusion and misuse.
I am referring to the following description on this page.
http://docs.h2o.ai/h2o/latest-stable/h2o-docs/automl.html
If I read it naively, I would expect H2OAutoML object in Python API allows 'ignored_columns' to be specified explicitly. But in reality, it only allows specifying 'x' (=included column names) to its train() method, but never exposes 'ignored_columns' directly.
http://docs.h2o.ai/h2o/latest-stable/h2o-py/docs/modeling.html#h2oautoml
Internally, it derives 'ignored_columns' parameter for the REST request from the 'x' vector and other parameters such as fold_column or weight_column (cf. PUBDEV-4509 ) iff 'x' is specified.
https://github.com/h2oai/h2o-3/blob/jenkins-rel-xia-2/h2o-py/h2o/automl/autoh2o.py#L336
See also PUBDEV-5057 as it is the issue why the description has been added in the first place.
The text was updated successfully, but these errors were encountered: