Permalink
Newer
Older
100644 738 lines (553 sloc) 33.1 KB
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1 # Migrating to H2O 3.0
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
2
3 We're excited about the upcoming release of the latest and greatest version of H2O, and we hope you are too! H2O 3.0 has lots of improvements, including:
4
May 15, 2015 @jessica0xdata Couple of doc updates
5 - Powerful Python APIs
6 - Flow, a brand-new intuitive web UI
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
7 - The ability to share, annotate, and modify workflows
May 15, 2015 @jessica0xdata Couple of doc updates
8 - Versioned REST APIs with full metadata
9 - Spark integration using Sparkling Water
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
10 - Improved algorithm accuracy and speed
11
May 15, 2015 @jessica0xdata Couple of doc updates
12 and much more! Overall, H2O has been retooled for better accuracy and performance and to provide additional functionality. If you're a current user of H2O, we strongly encourage you to upgrade to the latest version to take advantage of the latest features and capabilities.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
13
May 15, 2015 @jessica0xdata Couple of doc updates
14 Please be aware that H2O 3.0 will supersede all previous versions of H2O as the primary version as of May 15th, 2015. Support for previous versions will be offered for a limited time, but there will no longer be any significant updates to the previous version of H2O.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
15
16 The following information and links will inform you about what's new and different and help you prepare to upgrade to H2O 3.0.
17
May 15, 2015 @jessica0xdata Couple of doc updates
18 Overall, H2O 3.0 is more stable, elegant, and simplified, with additional capabilities not available in previous versions of H2O.
May 15, 2015 @jessica0xdata Batch docs update
19
May 15, 2015 @jessica0xdata Revise Migration Guide
20 ---
21
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
22 ## Algorithm Changes
May 15, 2015 @jessica0xdata Revise Migration Guide
23
Oct 21, 2015 @jessica0xdata Booklet & other doc updates
24 Most of the algorithms available in previous versions of H2O have been improved in terms of speed and accuracy. Currently available model types include:
25
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
26 ### Supervised
Oct 21, 2015 @jessica0xdata Booklet & other doc updates
27
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
28 - **Generalized Linear Model (GLM)**: Binomial classification, multinomial classification, regression (including logistic regression)
Oct 21, 2015 @jessica0xdata Booklet & other doc updates
29 - **Distributed Random Forest (DRF)**: Binomial classification, multinomial classification, regression
30 - **Gradient Boosting Machine (GBM)**: Binomial classification, multinomial classification, regression
31 - **Deep Learning (DL)**: Binomial classification, multinomial classification, regression
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
32 - Naive Bayes
33 - Stacked Ensembles
34 - XGBoost
Oct 21, 2015 @jessica0xdata Booklet & other doc updates
35
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
36 ### Unsupervised
Oct 21, 2015 @jessica0xdata Booklet & other doc updates
37
38 - K-means
39 - Principal Component Analysis
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
40 - Autoencoder
41 - Generalized Low Rank Models
May 15, 2015 @jessica0xdata Revise Migration Guide
42
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
43 ### Miscellaneous
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
44
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
45 - **Word2vec**
May 15, 2015 @jessica0xdata Revise Migration Guide
46
47 Check back for updates, as these algorithms will be re-introduced in an improved form in a future version of H2O.
48
49 **Note**: The SpeeDRF model has been removed, as it was originally intended as an optimization for small data only. This optimization will be added to the Distributed Random Forest model automatically for small data in a future version of H2O.
50
51 ---
52
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
53 ## Parsing Changes
Jun 16, 2015 @jessica0xdata Update FAQ with info from gitter; update team list
54
Oct 27, 2015 @jessica0xdata Updates
55 In H2O Classic, the parser reads all the data and tries to guess the column type. In H2O 3.0, the parser reads a subset and makes a type guess for each column. In Flow, you can view the preliminary parse results in the **Edit Column Names and Types** area. To change the column type, select an option from the drop-down menu to the right of the column. H2O 3.0 can also automatically identify mixed-type columns; in H2O Classic, if one column is mixed integers or real numbers using a string, the output is blank.
Jun 16, 2015 @jessica0xdata Update FAQ with info from gitter; update team list
56
57 ---
58
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
59 ## Web UI Changes
May 15, 2015 @jessica0xdata Batch docs update
60
May 15, 2015 @jessica0xdata Couple of doc updates
61 Our web UI has been completely overhauled with a much more intuitive interface that is similar to IPython Notebook. Each point-and-click action is translated immediately into an individual workflow script that can be saved for later interactive and offline use. As a result, you can now revise and rerun your workflows easily, and can even add comments and rich media.
May 15, 2015 @jessica0xdata Batch docs update
62
May 15, 2015 @jessica0xdata Couple of doc updates
63 For more information, refer to our [Getting Started with Flow](https://github.com/h2oai/h2o-dev/blob/master/h2o-docs/src/product/flow/README.md) guide, which comprehensively documents how to use Flow. You can also view this brief [video](https://www.youtube.com/watch?v=wzeuFfbW7WE), which provides an overview of Flow in action.
May 15, 2015 @jessica0xdata Batch docs update
64
May 15, 2015 @jessica0xdata Revise Migration Guide
65 ---
66
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
67 ## API Users
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
68
Oct 27, 2015 @jessica0xdata Updates
69 H2O's new Python API allows Pythonistas to use H2O in their favorite environment. Using the Python command line or an integrated development environment like IPython Notebook, H2O users can control clusters and manage massive datasets quickly.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
70
71 H2O's REST API is the basis for the web UI (Flow), as well as the R and Python APIs, and is versioned for stability. It is also easier to understand and use, with full metadata available dynamically from the server, allowing for easier integration by developers.
72
May 15, 2015 @jessica0xdata Revise Migration Guide
73 ---
74
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
75 ## Java Users
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
76
May 15, 2015 @jessica0xdata Couple of doc updates
77 Generated Java REST classes ease REST API use by external programs running in a Java Virtual Machine (JVM).
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
78
May 15, 2015 @jessica0xdata Couple of doc updates
79 As in previous versions of H2O, users can export trained models as Java objects for easy integration into JVM applications. H2O is currently the only ML tool that provides this capability, making it the data science tool of choice for enterprise developers.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
80
May 15, 2015 @jessica0xdata Revise Migration Guide
81 ---
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
82
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
83 ## R Users
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
84
May 15, 2015 @jessica0xdata Couple of doc updates
85 If you use H2O primarily in R, be aware that as a result of the improvements to the R package for H2O scripts created using previous versions (Nunes 2.8.6.2 or prior) will require minor revisions to work with H2O 3.0.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
86
Oct 27, 2015 @jessica0xdata Updates
87 To assist our R users in upgrading to H2O 3.0, a "shim" tool has been developed. The [shim](https://github.com/h2oai/h2o-dev/blob/9795c401b7be339be56b1b366ffe816133cccb9d/h2o-r/h2o-package/R/shim.R) reviews your script, identifies deprecated or revised parameters and arguments, and suggests replacements.
88
89 >**Note**: As of Slater v.3.2.0.10, this shim will no longer be available.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
90
May 15, 2015 @jessica0xdata Revise Migration Guide
91 There is also an [R Porting Guide](#PortingGuide) that provides a side-by-side comparison of the algorithms in the previous version of H2O with H2O 3.0. It outlines the new, revised, and deprecated parameters for each algorithm, as well as the changes to the output.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
92
May 15, 2015 @jessica0xdata Revise Migration Guide
93 ---
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
94
May 15, 2015 @jessica0xdata Revise Migration Guide
95 <a name="PortingGuide"></a>
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
96
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
97 # Porting R Scripts
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
98
May 15, 2015 @jessica0xdata Revise Migration Guide
99 This document outlines how to port R scripts written in previous versions of H2O (Nunes 2.8.6.2 or prior, also known as "H2O Classic") for compatibility with the new H2O 3.0 API. When upgrading from H2O to H2O 3.0, most functions are the same. However, there are some differences that will need to be resolved when porting any scripts that were originally created using H2O to H2O 3.0.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
100
May 15, 2015 @jessica0xdata Revise Migration Guide
101 The original R script for H2O is listed first, followed by the updated script for H2O 3.0.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
102
May 15, 2015 @jessica0xdata Revise Migration Guide
103 Some of the parameters have been renamed for consistency. For each algorithm, a table that describes the differences is provided.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
104
May 15, 2015 @jessica0xdata Revise Migration Guide
105 For additional assistance within R, enter a question mark before the command (for example, `?h2o.glm`).
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
106
May 15, 2015 @jessica0xdata Revise Migration Guide
107 There is also a "shim" available that will review R scripts created with previous versions of H2O, identify deprecated or renamed parameters, and suggest replacements. For more information, refer to the repo [here](https://github.com/h2oai/h2o-dev/blob/d9693a97da939a2b77c24507c8b40a5992192489/h2o-r/h2o-package/R/shim.R).
108
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
109 ## Changes from H2O 2.8 to H2O 3.0
May 15, 2015 @jessica0xdata Revise Migration Guide
110
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
111 ### `h2o.exec`
May 15, 2015 @jessica0xdata Revise Migration Guide
112 The `h2o.exec` command is no longer supported. Any workflows using `h2o.exec` must be revised to remove this command. If the H2O 3.0 workflow contains any parameters or commands from H2O Classic, errors will result and the workflow will fail.
113
114 The purpose of `h2o.exec` was to wrap expressions so that they could be evaluated in a single `\Exec2` call. For example,
115 `h2o.exec(fr[,1] + 2/fr[,3])`
116 and
117 `fr[,1] + 2/fr[,3]`
118 produced the same results in H2O. However, the first example makes a single REST call and uses a single temp object, while the second makes several REST calls and uses several temp objects.
119
120 Due to the improved architecture in H2O 3.0, the need to use `h2o.exec` has been eliminated, as the expression can be processed by R as an "unwrapped" typical R expression.
121
122 Currently, the only known exception is when `factor` is used in conjunction with `h2o.exec`. For example, `h2o.exec(fr$myIntCol <- factor(fr$myIntCol))` would become `fr$myIntCol <- as.factor(fr$myIntCol)`
123
124 Note also that an array is not inside a string:
125
126 An int array is [1, 2, 3], *not* "[1, 2, 3]".
127
128 A String array is ["f00", "b4r"], *not* "[\"f00\", \"b4r\"]"
129
130 Only string values are enclosed in double quotation marks (`"`).
131
132 <a name="h2operf"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
133 ### `h2o.performance`
May 15, 2015 @jessica0xdata Revise Migration Guide
134
135 To access any exclusively binomial output, use `h2o.performance`, optionally with the corresponding accessor. The accessor can only use the model metrics object created by `h2o.performance`. Each accessor is named for its corresponding field (for example, `h2o.AUC`, `h2o.gini`, `h2o.F1`). `h2o.performance` supports all current algorithms except for K-Means.
136
137 If you specify a data frame as a second parameter, H2O will use the specified data frame for scoring. If you do not specify a second parameter, the training metrics for the model metrics object are used.
138
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
139 ### `xval` and `validation` slots
May 15, 2015 @jessica0xdata Revise Migration Guide
140
141 The `xval` slot has been removed, as `nfolds` is not currently supported.
142
143 The `validation` slot has been merged with the `model` slot.
144
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
145 ### Principal Components Regression (PCR)
May 15, 2015 @jessica0xdata Revise Migration Guide
146
147 Principal Components Regression (PCR) has also been deprecated. To obtain PCR values, create a Principal Components Analysis (PCA) model, then create a GLM model from the scored data from the PCA model.
148
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
149 ### Saving and Loading Models
Jun 13, 2015 @jessica0xdata One more update for intercept (R docs)
150
151 Saving and loading a model from R is supported in version 3.0.0.18 and later. H2O 3.0 uses the same binary serialization method as previous versions of H2O, but saves the model and its dependencies into a directory, with each object as a separate file. The `save_CV` option for available in previous versions of H2O has been deprecated, as `h2o.saveAll` and `h2o.loadAll` are not currently supported. The following commands are now supported:
152
153 - `h2o.saveModel`
154 - `h2o.loadModel`
155
156
May 15, 2015 @jessica0xdata Revise Migration Guide
157
158 **Table of Contents**
159
160 - [GBM](#GBM)
161 - [GLM](#GLM)
162 - [K-Means](#Kmeans)
163 - [Deep Learning](#DL)
164 - [Distributed Random Forest](#DRF)
165
166
167
168 <a name="GBM"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
169 ## GBM
May 15, 2015 @jessica0xdata Revise Migration Guide
170
Oct 27, 2015 @jessica0xdata Updates
171 N-fold cross-validation and grid search are currently supported in H2O 3.0.
May 15, 2015 @jessica0xdata Revise Migration Guide
172
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
173 ### Renamed GBM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
174
175 The following parameters have been renamed, but retain the same functions:
176
177 H2O Classic Parameter Name | H2O 3.0 Parameter Name
178 -------------------|-----------------------
179 `data` | `training_frame`
180 `key` | `model_id`
181 `n.trees` | `ntrees`
182 `interaction.depth` | `max_depth`
183 `n.minobsinnode` | `min_rows`
184 `shrinkage` | `learn_rate`
185 `n.bins` | `nbins`
186 `validation` | `validation_frame`
187 `balance.classes` | `balance_classes`
188 `max.after.balance.size` | `max_after_balance_size`
189
190
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
191 ### Deprecated GBM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
192
193 The following parameters have been removed:
194
195 - `group_split`: Bit-set group splitting of categorical variables is now the default.
196 - `importance`: Variable importances are now computed automatically and displayed in the model output.
197 - `holdout.fraction`: The fraction of the training data to hold out for validation is no longer supported.
Oct 27, 2015 @jessica0xdata Updates
198 - `grid.parallelism`: Specifying the number of parallel threads to run during a grid search is no longer supported.
May 15, 2015 @jessica0xdata Revise Migration Guide
199
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
200 ### New GBM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
201
202 The following parameters have been added:
203
204 - `seed`: A random number to control sampling and initialization when `balance_classes` is enabled.
205 - `score_each_iteration`: Display error rate information after each tree in the requested set is built.
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
206 - `build_tree_one_node`: Run on a single node to use fewer CPUs.
May 15, 2015 @jessica0xdata Revise Migration Guide
207
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
208 ### GBM Algorithm Comparison
May 15, 2015 @jessica0xdata Revise Migration Guide
209
210 H2O Classic | H2O 3.0
211 ------------- | -------------
212 `h2o.gbm <- function(` | `h2o.gbm <- function(`
213 `x,` |`x,`
214 `y,` |`y,`
215 `data,` | `training_frame,`
216 `key = "",` | `model_id,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
217 &nbsp; | `checkpoint`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
218 `distribution = 'multinomial',` | `distribution = c("AUTO", "gaussian", "bernoulli", "multinomial", "poisson", "gamma", "tweedie"),`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
219 &nbsp; | `tweedie_power = 1.5,`
May 15, 2015 @jessica0xdata Revise Migration Guide
220 `n.trees = 10,` | `ntrees = 50`
221 `interaction.depth = 5,` | `max_depth = 5,`
222 `n.minobsinnode = 10,` | `min_rows = 10,`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
223 `shrinkage = 0.1,` | `learn_rate = 0.1,`
224 &nbsp; | `sample_rate = 1`
225 &nbsp; | `col_sample_rate = 1`
May 15, 2015 @jessica0xdata Revise Migration Guide
226 `n.bins = 20,`| `nbins = 20,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
227 &nbsp; | `nbins_top_level,`
228 &nbsp; | `nbins_cats = 1024,`
May 15, 2015 @jessica0xdata Revise Migration Guide
229 `validation,` | `validation_frame = NULL,`
230 `balance.classes = FALSE` | `balance_classes = FALSE,`
231 `max.after.balance.size = 5,` | `max_after_balance_size = 1,`
232 &nbsp; | `seed,`
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
233 &nbsp; | `build_tree_one_node = FALSE,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
234 &nbsp; | `nfolds = 0,`
235 &nbsp; | `fold_column = NULL,`
236 &nbsp; | `fold_assignment = c("AUTO", "Random", "Modulo"),`
237 &nbsp; | `keep_cross_validation_predictions = FALSE,`
238 &nbsp; | `score_each_iteration = FALSE,`
239 &nbsp; | `stopping_rounds = 0,`
Jun 3, 2016 Documentation: Added new stopping_metric option
240 &nbsp; | `stopping_metric = c("AUTO", "deviance", "logloss", "MSE", "AUC", "r2", "misclassification," "mean_per_class_error"),`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
241 &nbsp; | `stopping_tolerance = 0.001,`
242 &nbsp; | `offset_column = NULL,`
243 &nbsp; | `weights_column = NULL,`
May 15, 2015 @jessica0xdata Revise Migration Guide
244 `group_split = TRUE,` |
245 `importance = FALSE,` |
246 `holdout.fraction = 0,` |
247 `class.sampling.factors = NULL,` |
248 `grid.parallelism = 1)` |
249
250
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
251 ### Output
May 15, 2015 @jessica0xdata Revise Migration Guide
252
253 The following table provides the component name in H2O, the corresponding component name in H2O 3.0 (if supported), and the model type (binomial, multinomial, or all). Many components are now included in `h2o.performance`; for more information, refer to [(`h2o.performance`)](#h2operf).
254
255 H2O Classic | H2O 3.0 | Model Type
256 ------------- | ------------- | -------------
257 `@model$priorDistribution`| &nbsp; | `all`
258 `@model$params` | `@allparameters` | `all`
259 `@model$err` | `@model$scoring_history` | `all`
260 `@model$classification` | &nbsp; | `all`
261 `@model$varimp` | `@model$variable_importances` | `all`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
262 `@model$confusion` | `@model$training_metrics@metrics$cm$table` | `binomial` and `multinomial`
263 `@model$auc` | `@model$training_metrics@metrics$AUC` | `binomial`
264 `@model$gini` | `@model$training_metrics@metrics$Gini` | `binomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
265 `@model$best_cutoff` | &nbsp; | `binomial`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
266 `@model$F1` | `@model$training_metrics@metrics$thresholds_and_metric_scores$f1` | `binomial`
267 `@model$F2` | `@model$training_metrics@metrics$thresholds_and_metric_scores$f2` | `binomial`
268 `@model$accuracy` | `@model$training_metrics@metrics$thresholds_and_metric_scores$accuracy` | `binomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
269 `@model$error` | &nbsp; | `binomial`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
270 `@model$precision` | `@model$training_metrics@metrics$thresholds_and_metric_scores$precision` | `binomial`
271 `@model$recall` | `@model$training_metrics@metrics$thresholds_and_metric_scores$recall` | `binomial`
272 `@model$mcc` | `@model$training_metrics@metrics$thresholds_and_metric_scores$absolute_MCC` | `binomial`
273 `@model$max_per_class_err` | currently replaced by `@model$training_metrics@metrics$thresholds_and_metric_scores$min_per_class_correct` | `binomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
274
Jun 13, 2015 @jessica0xdata One more update for intercept (R docs)
275
276
277
278
May 15, 2015 @jessica0xdata Revise Migration Guide
279 ---
280
281 <a name="GLM"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
282 ## GLM
May 15, 2015 @jessica0xdata Revise Migration Guide
283
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
284 ### Renamed GLM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
285
286 The following parameters have been renamed, but retain the same functions:
287
288 H2O Classic Parameter Name | H2O 3.0 Parameter Name
289 -------------------|-----------------------
290 `data` | `training_frame`
291 `key` | `model_id`
292 `nlambda` | `nlambdas`
293 `lambda.min.ratio` | `lambda_min_ratio`
294 `iter.max` | `max_iterations`
295 `epsilon` | `beta_epsilon`
296
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
297 ### Deprecated GLM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
298
299 The following parameters have been removed:
300
301 - `return_all_lambda`: A logical value indicating whether to return every model built during the lambda search. (may be re-added)
302 - `higher_accuracy`: For improved accuracy, adjust the `beta_epsilon` value.
303 - `strong_rules`: Discards predictors likely to have 0 coefficients prior to model building. (may be re-added as enabled by default)
304 - `non_negative`: Specify a non-negative response. (may be re-added)
305 - `variable_importances`: Variable importances are now computed automatically and displayed in the model output. They have been renamed to *Normalized Coefficient Magnitudes*.
306 - `disable_line_search`: This parameter has been deprecated, as it was mainly used for testing purposes.
307 - `max_predictors`: Stops training the algorithm if the number of predictors exceeds the specified value. (may be re-added)
308
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
309 ### New GLM Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
310
311 The following parameters have been added:
312
313 - `validation_frame`: Specify the validation dataset.
314 - `solver`: Select IRLSM or LBFGS.
315
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
316 ### GLM Algorithm Comparison
May 15, 2015 @jessica0xdata Revise Migration Guide
317
318
319 H2O Classic | H2O 3.0
320 ------------- | -------------
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
321 `h2o.glm <- function(` | `h2o.glm(`
May 15, 2015 @jessica0xdata Revise Migration Guide
322 `x,` | `x,`
323 `y,` | `y,`
324 `data,` |`training_frame,`
325 `key = "",` | `model_id,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
326 &nbsp; | `validation_frame = NULL`
May 15, 2015 @jessica0xdata Revise Migration Guide
327 `iter.max = 100,` | `max_iterations = 50,`
328 `epsilon = 1e-4` | `beta_epsilon = 0`
329 `strong_rules = TRUE,` |
330 `return_all_lambda = FALSE,` |
Jun 13, 2015 @jessica0xdata One more update for intercept (R docs)
331 `intercept = TRUE,` | `intercept = TRUE`
May 15, 2015 @jessica0xdata Revise Migration Guide
332 `non_negative = FALSE,` |
333 &nbsp; | `solver = c("IRLSM", "L_BFGS"),`
334 `standardize = TRUE,` | `standardize = TRUE,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
335 `family,` | `family = c("gaussian", "binomial", "multinomial", "poisson", "gamma", "tweedie"),`
May 15, 2015 @jessica0xdata Revise Migration Guide
336 `link,` | `link = c("family_default", "identity", "logit", "log", "inverse", "tweedie"),`
337 `tweedie.p = ifelse(family == "tweedie",1.5, NA_real_)` | `tweedie_variance_power = NaN,`
338 &nbsp; | `tweedie_link_power = NaN,`
339 `alpha = 0.5,` | `alpha = 0.5,`
340 `prior = NULL` | `prior = 0.0,`
341 `lambda = 1e-5,` | `lambda = 1e-05,`
342 `lambda_search = FALSE,` | `lambda_search = FALSE,`
343 `nlambda = -1,` | `nlambdas = -1,`
344 `lambda.min.ratio = -1,` | `lambda_min_ratio = 1.0,`
345 `use_all_factor_levels = FALSE` | `use_all_factor_levels = FALSE,`
346 `nfolds = 0,` | `nfolds = 0,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
347 &nbsp; | `fold_column = NULL,`
348 &nbsp; | `fold_assignment = c("AUTO", "Random", "Modulo"),`
349 &nbsp; | `keep_cross_validation_predictions = FALSE,`
350 `beta_constraints = NULL,` | `beta_constraints = NULL)`
May 15, 2015 @jessica0xdata Revise Migration Guide
351 `higher_accuracy = FALSE,` |
352 `variable_importances = FALSE,` |
353 `disable_line_search = FALSE,` |
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
354 `offset = NULL,` | `offset_column = NULL,`
355 &nbsp; | `weights_column = NULL,`
356 &nbsp; | `intercept = TRUE,`
357 `max_predictors = -1)` | `max_active_predictors = -1)`
May 15, 2015 @jessica0xdata Revise Migration Guide
358
359
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
360 ### Output
May 15, 2015 @jessica0xdata Revise Migration Guide
361
362
363 The following table provides the component name in H2O, the corresponding component name in H2O 3.0 (if supported), and the model type (binomial, multinomial, or all). Many components are now included in `h2o.performance`; for more information, refer to [(`h2o.performance`)](#h2operf).
364
365 H2O Classic | H2O 3.0 | Model Type
366 ------------- | ------------- | -------------
367 `@model$params` | `@allparameters` | `all`
368 `@model$coefficients` | `@model$coefficients` | `all`
369 `@model$nomalized_coefficients` | `@model$coefficients_table$norm_coefficients` | `all`
370 `@model$rank` | `@model$rank` | `all`
371 `@model$iter` |`@model$iter` | `all`
372 `@model$lambda` | &nbsp; | `all`
373 `@model$deviance` | `@model$residual_deviance` | `all`
374 `@model$null.deviance` | `@model$null_deviance` | `all`
375 `@model$df.residual` | `@model$residual_degrees_of_freedom` | `all`
376 `@model$df.null` | `@model$null_degrees_of_freedom` | `all`
377 `@model$aic` | `@model$AIC`| `all`
378 `@model$train.err` | &nbsp; | `binomial`
379 `@model$prior` | &nbsp; | `binomial`
380 `@model$thresholds` | `@model$threshold` | `binomial`
381 `@model$best_threshold` | &nbsp; | `binomial`
382 `@model$auc` | `@model$AUC` | `binomial`
383 `@model$confusion` | &nbsp; | `binomial`
384
385 <a name="Kmeans"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
386 ## K-Means
May 15, 2015 @jessica0xdata Revise Migration Guide
387
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
388 ### Renamed K-Means Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
389
390 The following parameters have been renamed, but retain the same functions:
391
392 H2O Classic Parameter Name | H2O 3.0 Parameter Name
393 -------------------|-----------------------
394 `data` | `training_frame`
395 `key` | `model_id`
396 `centers` | `k`
397 `cols` | `x`
398 `iter.max` | `max_iterations`
399 `normalize` | `standardize`
400
401 **Note** In H2O, the `normalize` parameter was disabled by default. The `standardize` parameter is enabled by default in H2O 3.0 to provide more accurate results for datasets containing columns with large values.
402
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
403 ### New K-Means Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
404
405 The following parameters have been added:
406
407 - `user` has been added as an additional option for the `init` parameter. Using this parameter forces the K-Means algorithm to start at the user-specified points.
408 - `user_points`: Specify starting points for the K-Means algorithm.
409
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
410 ### K-Means Algorithm Comparison
May 15, 2015 @jessica0xdata Revise Migration Guide
411
412 H2O Classic | H2O 3.0
413 ------------- | -------------
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
414 `h2o.kmeans <- function(` | `h2o.kmeans(`
May 15, 2015 @jessica0xdata Revise Migration Guide
415 `data,` | `training_frame,`
416 `cols = '',` | `x,`
417 `centers,` | `k,`
418 `key = "",` | `model_id,`
419 `iter.max = 10,` | `max_iterations = 1000,`
420 `normalize = FALSE,` | `standardize = TRUE,`
421 `init = "none",` | `init = c("Furthest","Random", "PlusPlus"),`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
422 `seed = 0,` | `seed,`
423 &nbsp; | `nfolds = 0,`
424 &nbsp; | `fold_column = NULL,`
425 &nbsp; | `fold_assignment = c("AUTO", "Random", "Modulo"),`
426 &nbsp; | `keep_cross_validation_predictions = FALSE)`
May 15, 2015 @jessica0xdata Revise Migration Guide
427
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
428 ### Output
May 15, 2015 @jessica0xdata Revise Migration Guide
429
430
431 The following table provides the component name in H2O and the corresponding component name in H2O 3.0 (if supported).
432
433 H2O Classic | H2O 3.0
434 ------------- | -------------
435 `@model$params` | `@allparameters`
436 `@model$centers` | `@model$centers`
437 `@model$tot.withinss` | `@model$tot_withinss`
438 `@model$size` | `@model$size`
439 `@model$iter` | `@model$iterations`
440 &nbsp; | `@model$_scoring_history`
441 &nbsp; | `@model$_model_summary`
442
443 ---
444
445 <a name="DL"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
446 ## Deep Learning
May 15, 2015 @jessica0xdata Revise Migration Guide
447
448 **Note**: If the results in the confusion matrix are incorrect, verify that `score_training_samples` is equal to 0. By default, only the first 10,000 rows are included.
449
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
450 ### Renamed Deep Learning Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
451
452 The following parameters have been renamed, but retain the same functions:
453
454 H2O Classic Parameter Name | H2O 3.0 Parameter Name
455 -------------------|-----------------------
456 `data` | `training_frame`
457 `key` | `model_id`
458 `validation` | `validation_frame`
459 `class.sampling.factors` | `class_sampling_factors`
460 `override_with_best_model` | `overwrite_with_best_model`
Jun 25, 2015 @jessica0xdata Update to Porting/Migration doc to fix MSE typo for DL; added info re…
461 `dlmodel@model$valid_class_error` | `@model$validation_metrics@$MSE`
462
May 15, 2015 @jessica0xdata Revise Migration Guide
463
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
464 ### Deprecated DL Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
465
466 The following parameters have been removed:
467
468 - `classification`: Classification is now inferred from the data type.
469 - `holdout_fraction`: Fraction of the training data to hold out for validation.
Jun 25, 2015 @jessica0xdata Update to Porting/Migration doc to fix MSE typo for DL; added info re…
470 - `dlmodel@model$best_cutoff`: This output parameter has been removed.
May 15, 2015 @jessica0xdata Revise Migration Guide
471
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
472 ### New DL Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
473
474 The following parameters have been added:
475
476 - `export_weights_and_biases`: An additional option allowing users to export the raw weights and biases as H2O frames.
477
478 The following options for the `loss` parameter have been added:
479
480 - `absolute`: Provides strong penalties for mispredictions
481 - `huber`: Can improve results for regression
482
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
483 ### DL Algorithm Comparison
May 15, 2015 @jessica0xdata Revise Migration Guide
484
485 H2O Classic | H2O 3.0
486 ------------- | -------------
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
487 `h2o.deeplearning <- function(x,` | `h2o.deeplearning (x, `
May 15, 2015 @jessica0xdata Revise Migration Guide
488 `y,` | `y,`
489 `data,` | `training_frame,`
490 `key = "",` | `model_id = "",`
491 `override_with_best_model,` | `overwrite_with_best_model = true,`
492 `classification = TRUE,` |
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
493 `nfolds = 0,` | `nfolds = 0`
May 15, 2015 @jessica0xdata Revise Migration Guide
494 `validation,` | `validation_frame,`
495 `holdout_fraction = 0,` |
496 `checkpoint = " "` | `checkpoint,`
497 `autoencoder,` | `autoencoder = false,`
498 `use_all_factor_levels,` | `use_all_factor_levels = true`
499 `activation,` | `_activation = c("Rectifier", "Tanh", "TanhWithDropout", "RectifierWithDropout", "Maxout", "MaxoutWithDropout"),`
500 `hidden,` | `hidden= c(200, 200),`
501 `epochs,` | `epochs = 10.0,`
502 `train_samples_per_iteration,` |`train_samples_per_iteration = -2,`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
503 &nbsp; | `target_ratio_comm_to_comp = 0.05`
May 15, 2015 @jessica0xdata Revise Migration Guide
504 `seed,` | `_seed,`
505 `adaptive_rate,` | `adaptive_rate = true,`
506 `rho,` | `rho = 0.99,`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
507 `epsilon,` | `epsilon = 1e-08,`
May 15, 2015 @jessica0xdata Revise Migration Guide
508 `rate,` | `rate = .005,`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
509 `rate_annealing,` | `rate_annealing = 1e-06,`
May 15, 2015 @jessica0xdata Revise Migration Guide
510 `rate_decay,` | `rate_decay = 1.0,`
511 `momentum_start,` | `momentum_start = 0,`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
512 `momentum_ramp,` | `momentum_ramp = 1e+06,`
May 15, 2015 @jessica0xdata Revise Migration Guide
513 `momentum_stable,` | `momentum_stable = 0,`
514 `nesterov_accelerated_gradient,` | `nesterov_accelerated_gradient = true,`
515 `input_dropout_ratio,` | `input_dropout_ratio = 0.0,`
516 `hidden_dropout_ratios,` | `hidden_dropout_ratios,`
517 `l1,` | `l1 = 0.0,`
518 `l2,` | `l2 = 0.0,`
519 `max_w2,` | `max_w2 = Inf,`
520 `initial_weight_distribution,` | `initial_weight_distribution = c("UniformAdaptive","Uniform", "Normal"),`
521 `initial_weight_scale,` | `initial_weight_scale = 1.0,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
522 `loss,` | `loss = "Automatic", "CrossEntropy", "Quadratic", "Absolute", "Huber"),`
Dec 28, 2015 @jessica0xdata Updated with recent changes & fixed formatting
523 &nbsp; | `distribution = c("AUTO", "gaussian", "bernoulli", "multinomial", "poisson", "gamma", "tweedie", "laplace", "huber"),`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
524 &nbsp; | `tweedie_power = 1.5,`
May 15, 2015 @jessica0xdata Revise Migration Guide
525 `score_interval,` | `score_interval = 5,`
526 `score_training_samples,` | `score_training_samples = 10000l,`
527 `score_validation_samples,` | `score_validation_samples = 0l,`
528 `score_duty_cycle,` | `score_duty_cycle = 0.1,`
529 `classification_stop,` | `classification_stop = 0`
530 `regression_stop,` | `regression_stop = 1e-6,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
531 &nbsp; | `stopping_rounds = 5,`
Jun 3, 2016 Documentation: Added new stopping_metric option
532 &nbsp; | `stopping_metric = c("AUTO", "deviance", "logloss", "MSE", "AUC", "r2", "misclassification", "mean_per_class_error"),`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
533 &nbsp; | `stopping_tolerance = 0,`
May 15, 2015 @jessica0xdata Revise Migration Guide
534 `quiet_mode,` | `quiet_mode = false,`
535 `max_confusion_matrix_size,` | `max_confusion_matrix_size,`
536 `max_hit_ratio_k,` | `max_hit_ratio_k,`
537 `balance_classes,` | `balance_classes = false,`
538 `class_sampling_factors,` | `class_sampling_factors,`
539 `max_after_balance_size,` | `max_after_balance_size,`
540 `score_validation_sampling,` | `score_validation_sampling,`
541 `diagnostics,` | `diagnostics = true,`
542 `variable_importances,` | `variable_importances = false,`
543 `fast_mode,` | `fast_mode = true,`
544 `ignore_const_cols,` | `ignore_const_cols = true,`
545 `force_load_balance,` | `force_load_balance = true,`
546 `replicate_training_data,` | `replicate_training_data = true,`
547 `single_node_mode,` | `single_node_mode = false,`
548 `shuffle_training_data,` | `shuffle_training_data = false,`
549 `sparse,` | `sparse = false,`
550 `col_major,` | `col_major = false,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
551 `max_categorical_features,` | `max_categorical_features,`
May 15, 2015 @jessica0xdata Revise Migration Guide
552 `reproducible)` | `reproducible=FALSE,`
553 `average_activation` | `average_activation = 0,`
554 &nbsp; | `sparsity_beta = 0`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
555 &nbsp; | `export_weights_and_biases=FALSE,`
556 &nbsp; | `offset_column = NULL,`
557 &nbsp; | `weights_column = NULL,`
558 &nbsp; | `nfolds = 0,`
559 &nbsp; | `fold_column = NULL,`
560 &nbsp; | `fold_assignment = c("AUTO", "Random", "Modulo"),`
561 &nbsp; | `keep_cross_validation_predictions = FALSE)`
May 15, 2015 @jessica0xdata Revise Migration Guide
562
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
563 ### Output
May 15, 2015 @jessica0xdata Revise Migration Guide
564
565
566 The following table provides the component name in H2O, the corresponding component name in H2O 3.0 (if supported), and the model type (binomial, multinomial, or all). Many components are now included in `h2o.performance`; for more information, refer to [(`h2o.performance`)](#h2operf).
567
568 H2O Classic | H2O 3.0 | Model Type
569 ------------- | ------------- | -------------
570 `@model$priorDistribution`| &nbsp; | `all`
571 `@model$params` | `@allparameters` | `all`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
572 `@model$train_class_error` | `@model$training_metrics@metrics@$MSE` | `all`
Jun 25, 2015 @jessica0xdata Update to Porting/Migration doc to fix MSE typo for DL; added info re…
573 `@model$valid_class_error` | `@model$validation_metrics@$MSE` | `all`|
May 15, 2015 @jessica0xdata Revise Migration Guide
574 `@model$varimp` | `@model$_variable_importances` | `all`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
575 `@model$confusion` | `@model$training_metrics@metrics$cm$table` | `binomial` and `multinomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
576 `@model$train_auc` | `@model$train_AUC` | `binomial`
577 &nbsp; | `@model$_validation_metrics` | `all`
578 &nbsp; | `@model$_model_summary` | `all`
579 &nbsp; | `@model$_scoring_history` | `all`
580
581
582 ---
583
584 <a name="DRF"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
585 ## Distributed Random Forest
May 15, 2015 @jessica0xdata Revise Migration Guide
586
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
587 ### Changes to DRF in H2O 3.0
May 15, 2015 @jessica0xdata Revise Migration Guide
588
Jun 13, 2015 @jessica0xdata One more update for intercept (R docs)
589 Distributed Random Forest (DRF) was represented as `h2o.randomForest(type="BigData", ...)` in H2O Classic. In H2O Classic, SpeeDRF (`type="fast"`) was not as accurate, especially for complex data with categoricals, and did not address regression problems. DRF (`type="BigData"`) was at least as accurate as SpeeDRF (`type="fast"`) and was the only algorithm that scaled to big data (data too large to fit on a single node).
May 15, 2015 @jessica0xdata Revise Migration Guide
590 In H2O 3.0, our plan is to improve the performance of DRF so that the data fits on a single node (optimally, for all cases), which will make SpeeDRF obsolete. Ultimately, the goal is provide a single algorithm that provides the "best of both worlds" for all datasets and use cases.
Jun 15, 2015 @jessica0xdata Update team list, add note re: multi-file zip, h2o.predict for DRF re…
591 Please note that H2O does not currently support the ability to specify the number of trees when using `h2o.predict` for a DRF model.
592
May 15, 2015 @jessica0xdata Revise Migration Guide
593
594 **Note**: H2O 3.0 only supports DRF. SpeeDRF is no longer supported. The functionality of DRF in H2O 3.0 is similar to DRF functionality in H2O.
595
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
596 ### Renamed DRF Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
597
598 The following parameters have been renamed, but retain the same functions:
599
600 H2O Classic Parameter Name | H2O 3.0 Parameter Name
601 -------------------|-----------------------
602 `data` | `training_frame`
603 `key` | `model_id`
604 `validation` | `validation_frame`
605 `sample.rate` | `sample_rate`
606 `ntree` | `ntrees`
607 `depth` | `max_depth`
608 `balance.classes` | `balance_classes`
609 `score.each.iteration` | `score_each_iteration`
610 `class.sampling.factors` | `class_sampling_factors`
611 `nodesize` | `min_rows`
612
613
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
614 ### Deprecated DRF Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
615
616 The following parameters have been removed:
617
618 - `classification`: This is now automatically inferred from the response type. To achieve classification with a 0/1 response column, explicitly convert the response to a factor (`as.factor()`).
619 - `importance`: Variable importances are now computed automatically and displayed in the model output.
620 - `holdout.fraction`: Specifying the fraction of the training data to hold out for validation is no longer supported.
621 - `doGrpSplit`: The bit-set group splitting of categorical variables is now the default.
622 - `verbose`: Infonrmation about tree splits and extra statistics is now included automatically in the stdout.
623 - `oobee`: The out-of-bag error estimate is now computed automatically (if no validation set is specified).
624 - `stat.type`: This parameter was used for SpeeDRF, which is no longer supported.
625 - `type`: This parameter was used for SpeeDRF, which is no longer supported.
626
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
627 ### New DRF Parameters
May 15, 2015 @jessica0xdata Revise Migration Guide
628
629 The following parameter has been added:
630
631 - `build_tree_one_node`: Run on a single node to use fewer CPUs.
632
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
633 ### DRF Algorithm Comparison
May 15, 2015 @jessica0xdata Revise Migration Guide
634
635 H2O Classic | H2O 3.0
636 ------------- | -------------
637 `h2o.randomForest <- function(x,` | `h2o.randomForest <- function(`
638 `x,` | `x,`
639 `y,` | `y,`
640 `data,` | `training_frame,`
641 `key="",` | `model_id,`
642 `validation,` | `validation_frame,`
643 `mtries = -1,` | `mtries = -1,`
May 21, 2015 @jessica0xdata Update default value for `sample_rate` (per Arno)
644 `sample.rate=2/3,` | `sample_rate = 0.632,`
May 15, 2015 @jessica0xdata Revise Migration Guide
645 &nbsp; | `build_tree_one_node = FALSE,`
646 `ntree=50` | `ntrees=50,`
647 `depth=20,` | `max_depth = 20,`
648 &nbsp; | `min_rows = 1,`
649 `nbins=20,` | `nbins = 20,`
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
650 &nbsp; | `nbins_top_level,`
651 &nbsp; | `nbins_cats =1024,`
652 &nbsp; | `binomial_double_trees = FALSE,`
May 15, 2015 @jessica0xdata Revise Migration Guide
653 `balance.classes = FALSE,` | `balance_classes = FALSE,`
654 `seed = -1,` | `seed`
655 `nodesize = 1,` |
656 `classification=TRUE,` |
657 `importance=FALSE,` |
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
658 &nbsp; | `weights_column = NULL,`
659 `nfolds=0,` | `nfolds = 0,`
660 &nbsp; | `fold_column = NULL,`
661 &nbsp; | `fold_assignment = c("AUTO", "Random", "Modulo"),`
662 &nbsp; | `keep_cross_validation_predictions = FALSE,`
663 &nbsp; | `score_each_iteration = FALSE,`
664 &nbsp; | `stopping_rounds = 0,`
Jun 3, 2016 Documentation: Added new stopping_metric option
665 &nbsp; | `stopping_metric = c("AUTO", "deviance", "logloss", "MSE", "AUC", "r2", "misclassification", "mean_per_class_error"), `
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
666 &nbsp; | `stopping_tolerance = 0.001)`
May 15, 2015 @jessica0xdata Revise Migration Guide
667 `holdout.fraction = 0,` |
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
668 `max.after.balance.size = 5,` | `max_after_balance_size,`
May 15, 2015 @jessica0xdata Revise Migration Guide
669 `class.sampling.factors = NULL,` | &nbsp;
670 `doGrpSplit = TRUE,` |
671 `verbose = FALSE,` |
672 `oobee = TRUE,` |
673 `stat.type = "ENTROPY",` |
674 `type = "fast")` |
675
676
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
677 ### Output
May 15, 2015 @jessica0xdata Revise Migration Guide
678
679
680 The following table provides the component name in H2O, the corresponding component name in H2O 3.0 (if supported), and the model type (binomial, multinomial, or all). Many components are now included in `h2o.performance`; for more information, refer to [(`h2o.performance`)](#h2operf).
681
682 H2O Classic | H2O 3.0 | Model Type
683 ------------- | ------------- | -------------
684 `@model$priorDistribution`| &nbsp; | `all`
685 `@model$params` | `@allparameters` | `all`
686 `@model$mse` | `@model$scoring_history` | `all`
687 `@model$forest` | `@model$model_summary` | `all`
688 `@model$classification` | &nbsp; | `all`
689 `@model$varimp` | `@model$variable_importances` | `all`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
690 `@model$confusion` | `@model$training_metrics@metrics$cm$table` | `binomial` and `multinomial`
691 `@model$auc` | `@model$training_metrics@metrics$AUC` | `binomial`
692 `@model$gini` | `@model$training_metrics@metrics$Gini` | `binomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
693 `@model$best_cutoff` | &nbsp; | `binomial`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
694 `@model$F1` | `@model$training_metrics@metrics$thresholds_and_metric_scores$f1` | `binomial`
695 `@model$F2` | `@model$training_metrics@metrics$thresholds_and_metric_scores$f2` | `binomial`
696 `@model$accuracy` | `@model$training_metrics@metrics$thresholds_and_metric_scores$accuracy` | `binomial`
May 15, 2015 @jessica0xdata Revise Migration Guide
697 `@model$Error` | `@model$Error` | `binomial`
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
698 `@model$precision` | `@model$training_metrics@metrics$thresholds_and_metric_scores$precision` | `binomial`
699 `@model$recall` | `@model$training_metrics@metrics$thresholds_and_metric_scores$recall` | `binomial`
700 `@model$mcc` | `@model$training_metrics@metrics$thresholds_and_metric_scores$absolute_MCC` | `binomial`
701 `@model$max_per_class_err` | currently replaced by `@model$training_metrics@metrics$thresholds_and_metric_scores$min_per_class_correct` | `binomial`
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
702
703
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
704 ## Github Users
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
705
May 15, 2015 @jessica0xdata Revise Migration Guide
706 All users who pull directly from the H2O classic repo on Github should be aware that this repo will be renamed. To retain access to the original H2O (2.8.6.2 and prior) repository:
May 15, 2015 @jessica0xdata Docs update
707
708 **The simple way**
709
May 15, 2015 @jessica0xdata Revise Migration Guide
710 This is the easiest way to change your local repo and is recommended for most users.
May 15, 2015 @jessica0xdata Docs update
711
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
712 1. Enter `git remote -v` to view a list of your repositories.
713 2. Copy the address your H2O classic repo (refer to the text in brackets below - your address will vary depending on your connection method):
May 15, 2015 @jessica0xdata Docs update
714
715 ```
716 H2O_User-MBP:h2o H2O_User$ git remote -v
717 origin https://{H2O_User@github.com}/h2oai/h2o.git (fetch)
718 origin https://{H2O_User@github.com}/h2oai/h2o.git (push)
719 ```
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
720 3. Enter `git remote set-url origin {H2O_User@github.com}:h2oai/h2o-2.git`, where `{H2O_User@github.com}` represents the address copied in the previous step.
May 15, 2015 @jessica0xdata Docs update
721
722 **The more complicated way**
723
724 This method involves editing the Github config file and should only be attempted by users who are confident enough with their knowledge of Github to do so.
725
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
726 1. Enter `vim .git/config`.
727 2. Look for the `[remote "origin"]` section:
May 15, 2015 @jessica0xdata Docs update
728
729 ```
730 [remote "origin"]
731 url = https://H2O_User@github.com/h2oai/h2o.git
732 fetch = +refs/heads/*:refs/remotes/origin/*
733 ```
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
734 3. In the `url =` line, change `h2o.git` to `h2o-2.git`.
735 4. Save the changes.
May 14, 2015 @jessica0xdata Add Migration doc (per request by Ray & Parag)
736
May 22, 2015 @jessica0xdata Update Migration.md
737 The latest version of H2O is stored in the `h2o-3` repository. All previous links to this repo will still work, but if you would like to manually update your Github configuration, follow the instructions above, replacing `h2o-2` with `h2o-3`.