Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Variable importance heatmap for AutoML inside h2o.explain shows too many features #7366

Closed
exalate-issue-sync bot opened this issue May 11, 2023 · 4 comments
Assignees

Comments

@exalate-issue-sync
Copy link

I think we forgot to limit the number of features shown in the heatmap to the top 20? Here's an image where it's showing way more (this dataset has 4k features – [guillermo|https://www.openml.org/d/41159]).

The {{h2o.varimp_heatmap()}} has default {{top_n = 20}} but it doesn’t look like it’s working.

!Screen Shot 2021-08-20 at 5.10.52 PM.png|width=200,height=183!

{noformat}library(OpenML)
library(h2o)

data_id <- 41159
#guillermo: https://www.openml.org/d/41159 target: "class"

h2o.init()

omld <- getOMLDataSet(data.id = data_id)
y <- omld$target.features
data <- as.h2o(omld$data)
data_name <- omld$desc$name

ss <- h2o.splitFrame(data, ratios = 0.8, seed = 1)
train <- ss[[1]]
test <- ss[[2]]
dim(train)

aml_fullt <- h2o.automl(y = y,
training_frame = train,
leaderboard_frame = test,
max_runtime_secs = 3600,
project_name = paste0(data_name, "_fullt"),
seed = 1)

lb_fullt <- aml_fullt@leaderboard
print(lb_fullt, n = nrow(lb_fullt))
aml_fullt@training_info$duration_secs

h2o.explain(aml_fullt, test) #plot is from here{noformat}

@exalate-issue-sync
Copy link
Author

Tomas Fryda commented: {{top_n}} refers to number of models not features.

@exalate-issue-sync
Copy link
Author

Erin LeDell commented: Duplicate of https://h2oai.atlassian.net/browse/PUBDEV-8014

@h2o-ops-ro
Copy link
Collaborator

JIRA Issue Details

Jira Issue: PUBDEV-8289
Assignee: Tomas Fryda
Reporter: Erin LeDell
State: Closed
Fix Version: 3.34.0.1
Attachments: Available (Count: 1)
Development PRs: N/A

@h2o-ops-ro
Copy link
Collaborator

Attachments From Jira

Attachment Name: Screen Shot 2021-08-20 at 5.10.52 PM.png
Attached By: Erin LeDell
File Link:https://h2o-3-jira-github-migration.s3.amazonaws.com/PUBDEV-8289/Screen Shot 2021-08-20 at 5.10.52 PM.png

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants