Permalink
Newer
Older
100644 1211 lines (683 sloc) 67.2 KB
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1 # Flow Web UI ...
2
3 >**Note**: This topic is no longer being maintained. Refer to the [Using Flow - H2O's Web UI](https://github.com/h2oai/h2o-3/blob/master/h2o-docs/src/product/flow.rst) topic for the most up-to-date documentation.
Mar 11, 2015 @jessica0xdata Add GLM Example flow, remove ToC from Getting Started with Flow, and …
4
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
5 H2O Flow is an open-source user interface for H2O. It is a web-based interactive environment that allows you to combine code execution, text, mathematics, plots, and rich media in a single document.
Mar 17, 2015 @jessica0xdata Add intro based on Prithvi's blog post to Getting Started with Flow
6
7 With H2O Flow, you can capture, rerun, annotate, present, and share your workflow. H2O Flow allows you to use H2O interactively to import files, build models, and iteratively improve them. Based on your models, you can make predictions and add rich text to create vignettes of your work - all within Flow's browser-based environment.
8
9 Flow's hybrid user interface seamlessly blends command-line computing with a modern graphical user interface. However, rather than displaying output as plain text, Flow provides a point-and-click user interface for every H2O operation. It allows you to access any H2O object in the form of well-organized tabular data.
10
11 H2O Flow sends commands to H2O as a sequence of executable cells. The cells can be modified, rearranged, or saved to a library. Each cell contains an input field that allows you to enter commands, define functions, call other functions, and access other cells or objects on the page. When you execute the cell, the output is a graphical object, which can be inspected to view additional details.
12
May 15, 2015 @jessica0xdata Batch docs update
13 While H2O Flow supports REST API, R scripts, and CoffeeScript, no programming experience is required to run H2O Flow. You can click your way through any H2O operation without ever writing a single line of code. You can even disable the input cells to run H2O Flow using only the GUI. H2O Flow is designed to guide you every step of the way, by providing input prompts, interactive help, and example flows.
Mar 11, 2015 @jessica0xdata Add GLM Example flow, remove ToC from Getting Started with Flow, and …
14
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
15 ## Introduction
May 14, 2015 @jessica0xdata Fix faulty headings in readme
16
17 This guide will walk you through how to use H2O's web UI, H2O Flow. To view a demo video of H2O Flow, click <a href="https://www.youtube.com/watch?feature=player_embedded&v=wzeuFfbW7WE" target="_blank">here</a>.
18
19
Jan 15, 2015 @jessica0xdata 1st flow doc update
20 ---
21
22 <a name="GetHelp"></a>
May 14, 2015 @jessica0xdata Update README.md
23 ## Getting Help
Jan 15, 2015 @jessica0xdata 1st flow doc update
24 ---
25
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
26 First, let's go over the basics. Type `h` to view a list of helpful shortcuts.
Jan 15, 2015 @jessica0xdata 1st flow doc update
27
28 The following help window displays:
29
30 ![help menu](https://raw.githubusercontent.com/h2oai/h2o/master/docs/Flow-images/Shortcuts.png)
31
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
32 To close this window, click the **X** in the upper-right corner, or click the **Close** button in the lower-right corner. You can also click behind the window to close it. You can also access this list of shortcuts by clicking the **Help** menu and selecting **Keyboard Shortcuts**.
Jan 15, 2015 @jessica0xdata 1st flow doc update
33
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
34 For additional help, click **Help** > **Assist Me** or click the **Assist Me!** button in the row of buttons below the menus.
Jan 15, 2015 @jessica0xdata 1st flow doc update
35
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
36 ![Assist Me](images/Flow_AssistMeButton.png)
37
38 You can also type `assist` in a blank cell and press **Ctrl+Enter**. A list of common tasks displays to help you find the correct command.
39
40 ![Assist Me links](images/Flow_assist.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
41
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
42 There are multiple resources to help you get started with Flow in the **Help** sidebar.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
43
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
44 >**Note**: To hide the sidebar, click the **>>** button above it. ![Flow - Hide Sidebar](images/Flow_SidebarHide.png)
45 >
46 >To display the sidebar if it is hidden, click the **<<** button. ![Flow - Hide Sidebar](images/Flow_SidebarDisplay.png)
47
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
48 To access this documentation, select the **Flow Web UI...** link below the **General** heading in the Help sidebar.
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
49
50 You can also explore the pre-configured flows available in H2O Flow for a demonstration of how to create a flow. To view the example flows:
51
52 - Click the **view example Flows** link below the **Quickstart Videos** button in the **Help** sidebar
53 ![Flow - View Example Flows link](images/Flow_ViewExampleFlows.png)
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
54
55 or
56
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
57 - Click the **Browse installed packs...** link in the **Packs** subsection of the **Help** sidebar. Click the **examples** folder and select the example flow from the list.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
58
59 ![Flow Packs](images/Flow_ExampleFlows.png)
60
61 If you have a flow currently open, a confirmation window appears asking if the current notebook should be replaced. To load the example flow, click the **Load Notebook** button.
62
63 To view the REST API documentation, click the **Help** tab in the sidebar and then select the type of REST API documentation (**Routes** or **Schemas**).
64
65 ![REST API documentation](images/Flow_REST_docs.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
66
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
67 Before getting started with H2O Flow, make sure you understand the different cell modes. Certain actions can only be performed when the cell is in a specific mode.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
68
Jan 15, 2015 @jessica0xdata 1st flow doc update
69 ---
70
71 <a name="Cell"></a>
May 14, 2015 @jessica0xdata Update README.md
72 ## Understanding Cell Modes
Jan 15, 2015 @jessica0xdata 1st flow doc update
73
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
74 There are two modes for cells: edit and command.
75
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
76 <a name="EditMode"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
77 ### Using Edit Mode
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
78 In edit mode, the cell is yellow with a blinking bar to indicate where text can be entered and there is an orange flag to the left of the cell.
Jan 15, 2015 @jessica0xdata 1st flow doc update
79
May 5, 2015 @jessica0xdata Update README.md
80
Jan 16, 2015 @jessica0xdata Fixed images
81 ![Edit Mode](images/Flow_EditMode.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
82
Apr 28, 2015 @jessica0xdata Update README.md
83 <a name="CmdMode"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
84 ### Using Command Mode
Jan 15, 2015 @jessica0xdata 1st flow doc update
85 In command mode, the flag is yellow. The flag also indicates the cell's format:
86
Mar 5, 2015 @jessica0xdata Added updated images for Getting Started with Flow doc and reformatte…
87 - **MD**: Markdown
88
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
89 >**Note**: Markdown formatting is not applied until you run the cell by:
90 >
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
91 >- clicking the **Run** button ![Flow - Run Button](images/Flow_RunButton.png)
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
92 > or
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
93
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
94 >- pressing **Ctrl+Enter**
Jan 15, 2015 @jessica0xdata 1st flow doc update
95
Jan 16, 2015 @jessica0xdata Image fix
96 ![Flow - Markdown](images/Flow_markdown.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
97
May 5, 2015 @jessica0xdata Update README.md
98 - **CS**: Code (default)
Jan 15, 2015 @jessica0xdata 1st flow doc update
99
Jan 16, 2015 @jessica0xdata Fixed images
100 ![Flow - Code](images/Flow_parse_code_ex.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
101
May 5, 2015 @jessica0xdata Update README.md
102 - **RAW**: Raw format (for code comments)
Jan 15, 2015 @jessica0xdata 1st flow doc update
103
Jan 16, 2015 @jessica0xdata Fixed images
104 ![Flow - Raw](images/Flow_raw.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
105
May 5, 2015 @jessica0xdata Update README.md
106 - **H[1-6]**: Heading level (where 1 is a first-level heading)
Jan 15, 2015 @jessica0xdata 1st flow doc update
107
Jan 16, 2015 @jessica0xdata Fixed images
108 ![Flow - Heading Levels](images/Flow_headinglevels.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
109
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
110 >**NOTE**: If there is an error in the cell, the flag is red.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
111
112 ![Cell error](images/Flow_redflag.png)
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
113
114 If the cell is executing commands, the flag is teal. The flag returns to yellow when the task is complete.
115
116 ![Cell executing](images/Flow_cellmode_runningflag.png)
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
117
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
118 ### Changing Cell Formats
May 5, 2015 @jessica0xdata Update README.md
119
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
120 To change the cell's format (for example, from code to Markdown), make sure you are in command (not edit) mode and that the cell you want to change is selected. The easiest way to do this is to click on the flag to the left of the cell. Enter the keyboard shortcut for the format you want to use. The flag's text changes to display the current format.
May 5, 2015 @jessica0xdata Update README.md
121
122 Cell Mode | Keyboard Shortcut
123 ------------- | -----------------
124 Code | `y`
125 Markdown | `m`
126 Raw text | `r`
127 Heading 1 | `1`
128 Heading 2 | `2`
129 Heading 3 | `3`
130 Heading 4 | `4`
131 Heading 5 | `5`
132 Heading 6 | `6`
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
133
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
134
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
135 ### Running Cells
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
136
137
138 The series of buttons at the top of the page below the menus run cells in a flow.
139
140 ![Flow - Run Buttons](images/Flow_RunButtons.png)
141
142 - To run all cells in the flow, click the **Flow** menu, then click **Run All Cells**.
143 - To run the current cell and all subsequent cells, click the **Flow** menu, then click **Run All Cells Below**.
144 - To run an individual cell in a flow, confirm the cell is in [Edit Mode](#EditMode), then:
145
146 - press **Ctrl+Enter**
147
148 or
149
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
150 - click the **Run** button ![Flow - Run Button](images/Flow_RunButton.png)
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
151
152
153
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
154 ### Running Flows
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
155 When you run the flow, a progress bar indicates the current status of the flow. You can cancel the currently running flow by clicking the **Stop** button in the progress bar.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
156
157 ![Flow Progress Bar](images/Flow_progressbar.png)
158
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
159 When the flow is complete, a message displays in the upper right.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
160
161 ![Flow - Completed Successfully](images/Flow_run_pass.png)
162 ![Flow - Did Not Complete](images/Flow_run_fail.png)
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
163
164 >**Note**: If there is an error in the flow, H2O Flow stops at the cell that contains the error.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
165
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
166
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
167 ### Using Keyboard Shortcuts
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
168
Jan 15, 2015 @jessica0xdata 1st flow doc update
169 Here are some important keyboard shortcuts to remember:
170
171 - Click a cell and press **Enter** to enter edit mode, which allows you to change the contents of a cell.
172 - To exit edit mode, press **Esc**.
173 - To execute the contents of a cell, press the **Ctrl** and **Enter** buttons at the same time.
174
Apr 28, 2015 @jessica0xdata Update README.md
175 The following commands must be entered in [command mode](#CmdMode).
Jan 15, 2015 @jessica0xdata 1st flow doc update
176
177 - To add a new cell *above* the current cell, press **a**.
178 - To add a new cell *below* the current cell, press **b**.
179 - To delete the current cell, press the **d** key *twice*. (**dd**).
180
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
181 You can view these shortcuts by clicking **Help** > **Keyboard Shortcuts** or by clicking the **Help** tab in the sidebar.
182
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
183 ### Using Variables in Cells
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
184
185 Variables can be used to store information such as download locations. To use a variable in Flow:
186
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
187 1. Define the variable in a code cell (for example, `locA = "https://h2o-public-test-data.s3.amazonaws.com/bigdata/laptop/kdd2009/small-churn/kdd_train.csv"`).
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
188 ![Flow variable definition](images/Flow_VariableDefinition.png)
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
189 2. Run the cell. H2O validates the variable.
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
190 ![Flow variable validation](images/Flow_VariableValidation.png)
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
191 3. Use the variable in another code cell (for example, `importFiles [locA]`).
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
192 ![Flow variable example](images/Flow_VariableExample.png)
193 To further simplify your workflow, you can save the cells containing the variables and definitions as [clips](#Clips).
194
Apr 2, 2015 @jessica0xdata Remove word2vec info and deprecated parameters, fix parentheses, and …
195
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
196 ### Using Flow Buttons
Jan 15, 2015 @jessica0xdata 1st flow doc update
197 There are also a series of buttons at the top of the page below the flow name that allow you to save the current flow, add a new cell, move cells up or down, run the current cell, and cut, copy, or paste the current cell. If you hover over the button, a description of the button's function displays.
198
Jan 16, 2015 @jessica0xdata Fixed images
199 ![Flow buttons](images/Flow_buttons.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
200
May 5, 2015 @jessica0xdata Update README.md
201 You can also use the menus at the top of the screen to edit the order of the cells, toggle specific format types (such as input or output), create models, or score models. You can also access troubleshooting information or obtain help with Flow.
Jan 16, 2015 @jessica0xdata Fixed images
202 ![Flow menus](images/Flow_menus.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
203
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
204 >**Note**: To disable the code input and use H2O Flow strictly as a GUI, click the **Cell** menu, then **Toggle Cell Input**.
Jan 15, 2015 @jessica0xdata 1st flow doc update
205
206 Now that you are familiar with the cell modes, let's import some data.
207
208 ---
209
210 <a name="ImportData"></a>
May 15, 2015 @jessica0xdata Batch docs update
211 # ... Importing Data
Jan 15, 2015 @jessica0xdata 1st flow doc update
212
Jun 24, 2016 Remove "h2oclassic" path for example datasets
213 If you don't have any data of your own to work with, you can find some example datasets at <a href="http://data.h2o.ai" target="_blank">http://data.h2o.ai</a>.
Mar 5, 2015 @jessica0xdata Added updated images for Getting Started with Flow doc and reformatte…
214
Jan 15, 2015 @jessica0xdata 1st flow doc update
215
216 There are multiple ways to import data in H2O flow:
217
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
218 - Click the **Assist Me!** button in the row of buttons below the menus, then click the **importFiles** link. Enter the file path in the auto-completing **Search** entry field and press **Enter**. Select the file from the search results and confirm it by clicking the **Add All** link.
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
219 ![Flow - Import Files Auto-Suggest](images/Flow_Import_AutoSuggest.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
220
Apr 16, 2015 @jessica0xdata Update README.md
221 - In a blank cell, select the CS format, then enter `importFiles ["path/filename.format"]` (where `path/filename.format` represents the complete file path to the file, including the full file name. The file path can be a local file path or a website address.
Apr 4, 2016 PUBDEV-2608 Clarify diff between upload_file and import_file
222
223 >**Note**: For S3 file locations, use the format `importFiles [ "s3n:/path/to/bucket/file/file.tab.gz" ]`
Dec 29, 2015 @jessica0xdata Added link to R import file example
224
Apr 4, 2016 PUBDEV-2608 Clarify diff between upload_file and import_file
225 - For an example of how to import a single file or a directory in R, refer to the following [example](https://github.com/h2oai/h2o-2/blob/master/R/tests/testdir_hdfs/runit_s3n_basic.R).
Jan 15, 2015 @jessica0xdata 1st flow doc update
226
227 After selecting the file to import, the file path displays in the "Search Results" section. To import a single file, click the plus sign next to the file. To import all files in the search results, click the **Add all** link. The files selected for import display in the "Selected Files" section.
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
228 ![Import Files](images/Flow_import.png)
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
229
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
230 >**Note**: If the file is compressed, it will only be read using a single thread. For best performance, we recommend uncompressing the file before importing, as this will allow use of the faster multithreaded distributed parallel reader during import. Please note that .zip files containing multiple files are not currently supported.
Jan 15, 2015 @jessica0xdata 1st flow doc update
231
May 13, 2015 @jessica0xdata Update README.md
232
Jan 15, 2015 @jessica0xdata 1st flow doc update
233
234 - To import the selected file(s), click the **Import** button.
235
236 - To remove all files from the "Selected Files" list, click the **Clear All** link.
237
238 - To remove a specific file, click the **X** next to the file path.
239
240 After you click the **Import** button, the raw code for the current job displays. A summary displays the results of the file import, including the number of imported files and their Network File System (nfs) locations.
241
Jan 16, 2015 @jessica0xdata Fixed images
242 ![Import Files - Results](images/Flow_import_results.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
243
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
244 ## Uploading Data
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
245
Apr 28, 2015 @jessica0xdata Update README.md
246 To upload a local file, click the **Data** menu and select **Upload File...**. Click the **Choose File** button, select the file, click the **Choose** button, then click the **Upload** button.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
247
248 ![File Upload Pop-Up](images/Flow_UploadDataset.png)
249
250 When the file has uploaded successfully, a message displays in the upper right and the **Setup Parse** cell displays.
251
252
253 ![File Upload Successful](images/Flow_FileUploadPass.png)
254
255 Ok, now that your data is available in H2O Flow, let's move on to the next step: parsing. Click the **Parse these files** button to continue.
Jan 15, 2015 @jessica0xdata 1st flow doc update
256
257 ---
258
259 <a name="ParseData"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
260 ## Parsing Data
Jan 15, 2015 @jessica0xdata 1st flow doc update
261
262 After you have imported your data, parse the data.
263
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
264 ![Flow - Parse options](images/Flow_parse_setup.png)
265
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
266 The read-only **Sources** field displays the file path for the imported data selected for parsing.
267
268 The **ID** contains the auto-generated name for the parsed data (by default, the file name of the imported file with `.hex` as the file extension). Use the default name or enter a custom name in this field.
269
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
270 Select the parser type (if necessary) from the drop-down **Parser** list. For most data parsing, H2O automatically recognizes the data type, so the default settings typically do not need to be changed. The following options are available:
Jan 15, 2015 @jessica0xdata 1st flow doc update
271
272 - Auto
May 19, 2015 @jessica0xdata Doc updates
273 - ARFF
Jan 15, 2015 @jessica0xdata 1st flow doc update
274 - XLS
May 19, 2015 @jessica0xdata Doc updates
275 - XLSX
Jan 15, 2015 @jessica0xdata 1st flow doc update
276 - CSV
277 - SVMLight
278
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
279 >**Note**: For SVMLight data, the column indices must be >= 1 and the columns must be in ascending order.
Jun 11, 2015 @jessica0xdata Add info re:SVMLight col. indices req (>= 1); update Deep Learning tu…
280
Jan 15, 2015 @jessica0xdata 1st flow doc update
281 If a separator or delimiter is used, select it from the **Separator** list.
282
283 Select a column header option, if applicable:
284
285 - **Auto**: Automatically detect header types.
286 - **First row contains column names**: Specify heading as column names.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
287 - **First row contains data**: Specify heading as data. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
288
289 Select any necessary additional options:
290
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
291 - **Enable single quotes as a field quotation character**: Treat single quote marks (also known as apostrophes) in the data as a character, rather than an enum. This option is not selected by default.
292 - **Delete on done**: Check this checkbox to delete the imported data after parsing. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
293
Jun 18, 2015 @jessica0xdata Doc changes for parsing changes (pagination)
294 A preview of the data displays in the "Edit Column Names and Types" section.
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
295
Jan 15, 2015 @jessica0xdata 1st flow doc update
296
Jun 18, 2015 @jessica0xdata Doc changes for parsing changes (pagination)
297 To change or add a column name, edit or enter the text in the column's entry field. In the screenshot below, the entry field for column 16 is highlighted in red.
298
299 ![Flow - Column Name Entry Field](images/Flow_ColNameEntry.png)
300
301 To change the column type, select the drop-down list to the right of the column name entry field and select the data type. The options are:
302
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
303 - Unknown
304 - Numeric
305 - Enum
306 - Time
307 - UUID
308 - String
309 - Invalid
310
Jun 18, 2015 @jessica0xdata Doc changes for parsing changes (pagination)
311 You can search for a column by entering it in the *Search by column name...* entry field above the first column name entry field. As you type, H2O displays the columns that match the specified search terms.
312
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
313 **Note**: Only custom column names are searchable. Default column names cannot be searched.
314
Jun 18, 2015 @jessica0xdata Doc changes for parsing changes (pagination)
315 To navigate the data preview, click the **<- Previous page** or **-> Next page** buttons.
316
317 ![Flow - Pagination buttons](images/Flow_PageButtons.png)
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
318
319 After making your selections, click the **Parse** button.
320
Jan 15, 2015 @jessica0xdata 1st flow doc update
321 After you click the **Parse** button, the code for the current job displays.
322
Jan 16, 2015 @jessica0xdata Fixed images
323 ![Flow - Parse code](images/Flow_parse_code_ex.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
324
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
325 Since we've submitted a couple of jobs (data import & parse) to H2O now, let's take a moment to learn more about jobs in H2O.
326
Jan 15, 2015 @jessica0xdata 1st flow doc update
327 ---
328
329 <a name="ViewJobs"></a>
May 14, 2015 @jessica0xdata Update README.md
330 ## Viewing Jobs
Jan 15, 2015 @jessica0xdata 1st flow doc update
331
Apr 28, 2015 @jessica0xdata Update README.md
332 Any command (such as `importFiles`) you enter in H2O is submitted as a job, which is associated with a key. The key identifies the job within H2O and is used as a reference.
Jan 15, 2015 @jessica0xdata 1st flow doc update
333
May 14, 2015 @jessica0xdata Update README.md
334 ### Viewing All Jobs
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
335
Apr 28, 2015 @jessica0xdata Update README.md
336 To view all jobs, click the **Admin** menu, then click **Jobs**, or enter `getJobs` in a cell in CS mode.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
337
338 ![View Jobs](images/Flow_getJobs.png)
339
340 The following information displays:
341
Apr 28, 2015 @jessica0xdata Update README.md
342 - Type (for example, `Frame` or `Model`)
343 - Link to the object
344 - Description of the job type (for example, `Parse` or `GBM`)
345 - Start time
346 - End time
347 - Run time
348
349 To refresh this information, click the **Refresh** button. To view the details of the job, click the **View** button.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
350
May 14, 2015 @jessica0xdata Update README.md
351 ### Viewing Specific Jobs
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
352
Apr 28, 2015 @jessica0xdata Update README.md
353 To view a specific job, click the link in the "Destination" column.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
354
355 ![View Job - Model](images/Flow_ViewJob_Model.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
356
357 The following information displays:
358
Apr 28, 2015 @jessica0xdata Update README.md
359 - Type (for example, `Frame`)
360 - Link to object (key)
361 - Description (for example, `Parse`)
362 - Status
Jan 15, 2015 @jessica0xdata 1st flow doc update
363 - Run time
Apr 28, 2015 @jessica0xdata Update README.md
364 - Progress
Jan 15, 2015 @jessica0xdata 1st flow doc update
365
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
366 >**NOTE**: For a better understanding of how jobs work, make sure to review the [Viewing Frames](#ViewFrames) section as well.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
367
368 Ok, now that you understand how to find jobs in H2O, let's submit a new one by building a model.
Jan 15, 2015 @jessica0xdata 1st flow doc update
369
370 ---
371
372 <a name="BuildModel"></a>
May 15, 2015 @jessica0xdata Batch docs update
373 # ... Building Models
Jan 15, 2015 @jessica0xdata 1st flow doc update
374
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
375 To build a model:
376
Jun 12, 2015 @jessica0xdata Updates for UI changes
377 - Click the **Assist Me!** button in the row of buttons below the menus and select **buildModel**
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
378
379 or
380
381 - Click the **Assist Me!** button, select **getFrames**, then click the **Build Model...** button below the parsed .hex data set
382
Apr 28, 2015 @jessica0xdata Update README.md
383 or
384
385 - Click the **View** button after parsing data, then click the **Build Model** button
386
387 or
388
389 - Click the drop-down **Model** menu and select the model type from the list
390
391
Jun 12, 2015 @jessica0xdata Updates for UI changes
392 The **Build Model...** button can be accessed from any page containing the .hex key for the parsed data (for example, `getJobs` > `getFrame`). The following image depicts the K-Means model type. Available options vary depending on model type.
May 19, 2015 @jessica0xdata Doc updates
393
394
395 ![Model Builder](images/Flow_ModelBuilder.png)
396
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
397
398 In the **Build a Model** cell, select an algorithm from the drop-down menu:
Jan 15, 2015 @jessica0xdata 1st flow doc update
399
400 <a name="Kmeans"></a>
Apr 28, 2015 @jessica0xdata Update README.md
401 - **K-means**: Create a K-Means model.
402
Jan 15, 2015 @jessica0xdata 1st flow doc update
403 <a name="GLM"></a>
Apr 28, 2015 @jessica0xdata Update README.md
404 - **Generalized Linear Model**: Create a Generalized Linear model.
Jan 15, 2015 @jessica0xdata 1st flow doc update
405
Apr 28, 2015 @jessica0xdata Update README.md
406 <a name="DRF"></a>
Apr 8, 2015 @jessica0xdata Update README.md
407 - **Distributed RF**: Create a distributed Random Forest model.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
408
Apr 28, 2015 @jessica0xdata Update README.md
409 <a name="NB"></a>
May 19, 2015 @jessica0xdata Doc updates
410 - **Naïve Bayes**: Create a Naïve Bayes model.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
411
Apr 28, 2015 @jessica0xdata Update README.md
412 <a name="PCA"></a>
Apr 8, 2015 @jessica0xdata Update README.md
413 - **Principal Component Analysis**: Create a Principal Components Analysis model for modeling without regularization or performing dimensionality reduction.
Jan 15, 2015 @jessica0xdata 1st flow doc update
414
415 <a name="GBM"></a>
Apr 8, 2015 @jessica0xdata Update README.md
416 - **Gradient Boosting Machine**: Create a Gradient Boosted model
Jan 15, 2015 @jessica0xdata 1st flow doc update
417
418 <a name="DL"></a>
Apr 28, 2015 @jessica0xdata Update README.md
419 - **Deep Learning**: Create a Deep Learning model.
Jan 15, 2015 @jessica0xdata 1st flow doc update
420
Jul 4, 2015 @jessica0xdata Update for build 26
421
Jan 15, 2015 @jessica0xdata 1st flow doc update
422 The available options vary depending on the selected model. If an option is only available for a specific model type, the model type is listed. If no model type is specified, the option is applicable to all model types.
423
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
424 - **model_id**: (Optional) Enter a custom name for the model to use as a reference. By default, H2O automatically generates an ID containing the model type (for example, `gbm-6f6bdc8b-ccbc-474a-b590-4579eea44596`).
Jan 15, 2015 @jessica0xdata 1st flow doc update
425
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
426 - **training_frame**: (Required) Select the dataset used to build the model.
Apr 28, 2015 @jessica0xdata Update README.md
427
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
428 - **validation_frame**: (Optional) Select the dataset used to evaluate the accuracy of the model.
Jan 15, 2015 @jessica0xdata 1st flow doc update
429
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
430 - **nfolds**: ([GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF)) Specify the number of folds for cross-validation.
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
431
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
432 - **response_column**: (Required for [GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF), [Naïve Bayes](#NB)) Select the column to use as the independent variable.
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
433
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
434 - **ignored_columns**: (Optional) Click the checkbox next to a column name to add it to the list of columns excluded from the model. To add all columns, click the **All** button. To remove a column from the list of ignored columns, click the X next to the column name. To remove all columns from the list of ignored columns, click the **None** button. To search for a specific column, type the column name in the **Search** field above the column list. To only show columns with a specific percentage of missing values, specify the percentage in the **Only show columns with more than 0% missing values** field. To change the selections for the hidden columns, use the **Select Visible** or **Deselect Visible** buttons.
Jan 15, 2015 @jessica0xdata 1st flow doc update
435
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
436 - **ignore\_const\_cols**: (Optional) Check this checkbox to ignore constant training columns, since no information can be gained from them. This option is selected by default.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
437
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
438 - **transform**: ([PCA](#PCA)) Select the transformation method for the training data: None, Standardize, Normalize, Demean, or Descale.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
439
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
440 - **pca_method**: ([PCA](#PCA)) Select the algorithm to use for computing the principal components:
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
441 - *GramSVD*: Uses a distributed computation of the Gram matrix, followed by a local SVD using the JAMA package
442 - *Power*: Computes the SVD using the power iteration method
Oct 22, 2015 @jessica0xdata Updates for Recent Changes; PUBDEV-2247; Timeline desc.; new "Randomi…
443 - *Randomized*: Uses randomized subspace iteration method
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
444 - *GLRM*: Fits a generalized low-rank model with L2 loss function and no regularization and solves for the SVD using local matrix algebra
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
445
Nov 4, 2015 @jessica0xdata Updates for multinomial in GLM
446 - **family**: ([GLM](#GLM)) Select the model type (Gaussian, Binomial, Multinomial, Poisson, Gamma, or Tweedie).
Jan 15, 2015 @jessica0xdata 1st flow doc update
447
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
448 - **solver**: ([GLM](#GLM)) Select the solver to use (AUTO, IRLSM, L\_BFGS, COORDINATE\_DESCENT\_NAIVE, or COORDINATE\_DESCENT). IRLSM is fast on on problems with a small number of predictors and for lambda-search with L1 penalty, while [L_BFGS](http://cran.r-project.org/web/packages/lbfgs/vignettes/Vignette.pdf) scales better for datasets with many columns. COORDINATE\_DESCENT is IRLSM with the covariance updates version of cyclical coordinate descent in the innermost loop. COORDINATE\_DESCENT\_NAIVE is IRLSM with the naive updates version of cyclical coordinate descent in the innermost loop. COORDINATE\_DESCENT\_NAIVE and COORDINATE\_DESCENT are currently experimental.
Jan 15, 2015 @jessica0xdata 1st flow doc update
449
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
450 - **link**: ([GLM](#GLM)) Select a link function (Identity, Family_Default, Logit, Log, Inverse, or Tweedie).
Jan 15, 2015 @jessica0xdata 1st flow doc update
451
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
452 - **alpha**: ([GLM](#GLM)) Specify the regularization distribution between L2 and L2.
Jan 15, 2015 @jessica0xdata 1st flow doc update
453
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
454 - **lambda**: ([GLM](#GLM)) Specify the regularization strength.
May 29, 2015 @jessica0xdata Doc update for Arno's changes (nbins_cats)
455
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
456 - **lambda_search**: ([GLM](#GLM)) Check this checkbox to enable lambda search, starting with lambda max. The given lambda is then interpreted as lambda min.
May 29, 2015 @jessica0xdata Doc update
457
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
458 - **non-negative**: ([GLM](#GLM)) To force coefficients to be non-negative, check this checkbox.
Jan 15, 2015 @jessica0xdata 1st flow doc update
459
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
460 - **standardize**: ([K-Means](#Kmeans), [GLM](#GLM)) To standardize the numeric columns to have mean of zero and unit variance, check this checkbox. Standardization is highly recommended; if you do not use standardization, the results can include components that are dominated by variables that appear to have larger variances relative to other attributes as a matter of scale, rather than true contribution. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
461
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
462 - **beta_constraints**: ([GLM](#GLM)) To use beta constraints, select a dataset from the drop-down menu. The selected frame is used to constraint the coefficient vector to provide upper and lower bounds.
May 29, 2015 @jessica0xdata Update for Arno's changes (binomial_double_trees); some cleanup
463
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
464 - **ntrees**: ([GBM](#GBM), [DRF](#DRF)) Specify the number of trees.
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
465
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
466 - **max\_depth**: ([GBM](#GBM), [DRF](#DRF)) Specify the maximum tree depth.
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
467
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
468 - **min\_rows**: ([GBM](#GBM), [DRF](#DRF)) Specify the minimum number of observations for a leaf ("nodesize" in R).
Jan 15, 2015 @jessica0xdata 1st flow doc update
469
Sep 29, 2015 @jessica0xdata Updates to readme & Docker doc
470 - **nbins**: ([GBM](#GBM), [DRF](#DRF)) (Numerical [real/int] only) Specify the minimum number of bins for the histogram to build, then split at the best point.
Apr 17, 2015 @jessica0xdata Update README.md
471
Oct 5, 2015 @jessica0xdata Added more info re:`nbins_cats`
472 - **nbins_cats**: ([GBM](#GBM), [DRF](#DRF)) (Categorical [factors/enums] only) Specify the maximum number of bins for the histogram to build, then split at the best point. Higher values can lead to more overfitting. The levels are ordered alphabetically; if there are more levels than bins, adjacent levels share bins. This value has a more significant impact on model fitness than **nbins**. Larger values may increase runtime, especially for deep trees and large clusters, so tuning may be required to find the optimal value for your configuration.
Apr 17, 2015 @jessica0xdata Update README.md
473
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
474 - **learn_rate**: ([GBM](#GBM)) Specify the learning rate. The range is 0.0 to 1.0.
Apr 17, 2015 @jessica0xdata Update README.md
475
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
476 - **distribution**: ([GBM](#GBM), [DL](#DL)) Select the distribution type from the drop-down list. The options are auto, bernoulli, multinomial, gaussian, poisson, gamma, or tweedie.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
477
Nov 5, 2015 @jessica0xdata Updates to docs
478 - **sample_rate**: ([GBM](#GBM), [DRF](#DRF)) Specify the row sampling rate (x-axis). The range is 0.0 to 1.0. Higher values may improve training accuracy. Test accuracy improves when either columns or rows are sampled. For details, refer to "Stochastic Gradient Boosting" ([Friedman, 1999](https://statweb.stanford.edu/~jhf/ftp/stobst.pdf)).
Oct 6, 2015 @jessica0xdata Added varimp for Py, `col_sample_rate` and `sample_rate`
479
Dec 17, 2015 @jessica0xdata Updates for `col_sample_rate_per_tree`
480 - **col\_sample_rate**: ([GBM](#GBM), [DRF](#DRF)) Specify the column sampling rate (y-axis). The range is 0.0 to 1.0. Higher values may improve training accuracy. Test accuracy improves when either columns or rows are sampled. For details, refer to "Stochastic Gradient Boosting" ([Friedman, 1999](https://statweb.stanford.edu/~jhf/ftp/stobst.pdf)).
Oct 6, 2015 @jessica0xdata Added varimp for Py, `col_sample_rate` and `sample_rate`
481
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
482 - **mtries**: ([DRF](#DRF)) Specify the columns to randomly select at each level. If the default value of `-1` is used, the number of variables is the square root of the number of columns for classification and p/3 for regression (where p is the number of predictors).
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
483
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
484 - **binomial\_double\_trees**: ([DRF](#DRF)) (Binary classification only) Build twice as many trees (one per class). Enabling this option can lead to higher accuracy, while disabling can result in faster model building. This option is disabled by default.
May 29, 2015 @jessica0xdata Doc update
485
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
486 - **score\_each\_iteration**: ([K-Means](#Kmeans), [DRF](#DRF), [Naïve Bayes](#NB), [PCA](#PCA), [GBM](#GBM), [GLM](#GLM)) To score during each iteration of the model training, check this checkbox.
Jan 15, 2015 @jessica0xdata 1st flow doc update
487
Nov 5, 2015 @jessica0xdata Updates to docs
488 - **k***: ([K-Means](#Kmeans), [PCA](#PCA)) For K-Means, specify the number of clusters. For PCA, specify the rank of matrix approximation.
May 29, 2015 @jessica0xdata Doc update
489
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
490 - **user_points**: ([K-Means](#Kmeans)) For K-Means, specify the number of initial cluster centers.
Apr 28, 2015 @jessica0xdata Update README.md
491
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
492 - **max_iterations**: ([K-Means](#Kmeans), [PCA](#PCA), [GLM](#GLM)) Specify the number of training iterations.
May 29, 2015 @jessica0xdata Doc update
493
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
494 - **init**: ([K-Means](#Kmeans)) Select the initialization mode. The options are Furthest, PlusPlus, Random, or User.
May 29, 2015 @jessica0xdata Doc update
495
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
496 >**Note**: If PlusPlus is selected, the initial Y matrix is chosen by the final cluster centers from the K-Means PlusPlus algorithm.
Jul 4, 2015 @jessica0xdata Update for build 26
497
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
498 - **tweedie_variance_power**: ([GLM](#GLM)) (Only applicable if *Tweedie* is selected for **Family**) Specify the Tweedie variance power.
Jul 4, 2015 @jessica0xdata Update for build 26
499
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
500 - **tweedie_link_power**: ([GLM](#GLM)) (Only applicable if *Tweedie* is selected for **Family**) Specify the Tweedie link power.
Jan 15, 2015 @jessica0xdata 1st flow doc update
501
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
502 - **activation**: ([DL](#DL)) Select the activation function (Tanh, TanhWithDropout, Rectifier, RectifierWithDropout, Maxout, MaxoutWithDropout). The default option is Rectifier.
Jan 15, 2015 @jessica0xdata 1st flow doc update
503
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
504 - **hidden**: ([DL](#DL)) Specify the hidden layer sizes (e.g., 100,100). For Grid Search, use comma-separated values: (10,10),(20,20,20). The default value is [200,200]. The specified value(s) must be positive.
Jan 15, 2015 @jessica0xdata 1st flow doc update
505
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
506 - **epochs**: ([DL](#DL)) Specify the number of times to iterate (stream) the dataset. The value can be a fraction.
Jan 15, 2015 @jessica0xdata 1st flow doc update
507
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
508 - **variable_importances**: ([DL](#DL)) Check this checkbox to compute variable importance. This option is not selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
509
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
510 - **laplace**: ([Naïve Bayes](#NB)) Specify the Laplace smoothing parameter.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
511
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
512 - **min\_sdev**: ([Naïve Bayes](#NB)) Specify the minimum standard deviation to use for observations without enough data.
Jan 15, 2015 @jessica0xdata 1st flow doc update
513
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
514 - **eps\_sdev**: ([Naïve Bayes](#NB)) Specify the threshold for standard deviation. If this threshold is not met, the **min\_sdev** value is used.
Jan 15, 2015 @jessica0xdata 1st flow doc update
515
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
516 - **min\_prob**: ([Naïve Bayes](#NB)) Specify the minimum probability to use for observations without enough data.
Jan 15, 2015 @jessica0xdata 1st flow doc update
517
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
518 - **eps\_prob**: ([Naïve Bayes](#NB)) Specify the threshold for standard deviation. If this threshold is not met, the **min\_sdev** value is used.
Jan 15, 2015 @jessica0xdata 1st flow doc update
519
Nov 5, 2015 @jessica0xdata Updates to docs
520 - **compute_metrics**: ([Naïve Bayes](#NB), [PCA](#PCA)) To compute metrics on training data, check this checkbox. The Naïve Bayes classifier assumes independence between predictor variables conditional on the response, and a Gaussian distribution of numeric predictors with mean and standard deviation computed from the training dataset. When building a Naïve Bayes classifier, every row in the training dataset that contains at least one NA will be skipped completely. If the test dataset has missing values, then those predictors are omitted in the probability calculation during prediction.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
521
Jan 15, 2015 @jessica0xdata 1st flow doc update
522 **Advanced Options**
523
Nov 5, 2015 @jessica0xdata Updates to docs
524 - **fold_assignment**: ([GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF), [K-Means](#Kmeans)) (Applicable only if a value for **nfolds** is specified and **fold_column** is not selected) Select the cross-validation fold assignment scheme. The available options are Random or [Modulo](https://en.wikipedia.org/wiki/Modulo_operation).
Jan 15, 2015 @jessica0xdata 1st flow doc update
525
Nov 5, 2015 @jessica0xdata Updates to docs
526 - **fold_column**: ([GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF), [K-Means](#Kmeans)) Select the column that contains the cross-validation fold index assignment per observation.
Jan 15, 2015 @jessica0xdata 1st flow doc update
527
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
528 - **offset_column**: ([GLM](#GLM), [DRF](#DRF), [GBM](#GBM)) Select a column to use as the offset.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
529 >*Note*: Offsets are per-row "bias values" that are used during model training. For Gaussian distributions, they can be seen as simple corrections to the response (y) column. Instead of learning to predict the response (y-row), the model learns to predict the (row) offset of the response column. For other distributions, the offset corrections are applied in the linearized space before applying the inverse link function to get the actual response values. For more information, refer to the following [link](http://www.idg.pl/mirrors/CRAN/web/packages/gbm/vignettes/gbm.pdf).
Jan 15, 2015 @jessica0xdata 1st flow doc update
530
Sep 17, 2015 @jessica0xdata Added info for `weights_column` (per Erin)
531 - **weights_column**: ([GLM](#GLM), [DL](#DL), [DRF](#DRF), [GBM](#GBM)) Select a column to use for the observation weights. The specified `weights_column` must be included in the specified `training_frame`. *Python only*: To use a weights column when passing an H2OFrame to `x` instead of a list of column names, the specified `training_frame` must contain the specified `weights_column`.
Nov 24, 2015 @jessica0xdata Updates to clarify weights, Sparkling Water, roxygen2 requirement
532 >*Note*: Weights are per-row observation weights and do not increase the size of the data frame. This is typically the number of times a row is repeated, but non-integer values are supported as well. During training, rows with higher weights matter more, due to the larger loss function pre-factor.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
533
Sep 29, 2015 @jessica0xdata Updates for booklets; created online versions; updates for MeanSquare…
534 - **loss**: ([DL](#DL)) Select the loss function. For DL, the options are Automatic, Quadratic, CrossEntropy, Huber, or Absolute and the default value is Automatic. Absolute, Quadratic, and Huber are applicable for regression or classification, while CrossEntropy is only applicable for classification. Huber can improve for regression problems with outliers.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
535
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
536 - **checkpoint**: ([DL](#DL), [DRF](#DRF), [GBM](#GBM)) Enter a model key associated with a previously-trained model. Use this option to build a new model as a continuation of a previously-generated model.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
537
Nov 5, 2015 @jessica0xdata Updates to docs
538 - **use\_all\_factor\_levels**: ([DL](#DL), [PCA](#PCA)) Check this checkbox to use all factor levels in the possible set of predictors; if you enable this option, sufficient regularization is required. By default, the first factor level is skipped. For Deep Learning models, this option is useful for determining variable importances and is automatically enabled if the autoencoder is selected.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
539
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
540 - **train\_samples\_per\_iteration**: ([DL](#DL)) Specify the number of global training samples per MapReduce iteration. To specify one epoch, enter 0. To specify all available data (e.g., replicated training data), enter -1. To use the automatic values, enter -2.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
541
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
542 - **adaptive_rate**: ([DL](#DL)) Check this checkbox to enable the adaptive learning rate (ADADELTA). This option is selected by default. If this option is enabled, the following parameters are ignored: `rate`, `rate_decay`, `rate_annealing`, `momentum_start`, `momentum_ramp`, `momentum_stable`, and `nesterov_accelerated_gradient`.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
543
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
544 - **input\_dropout\_ratio**: ([DL](#DL)) Specify the input layer dropout ratio to improve generalization. Suggested values are 0.1 or 0.2. The range is >= 0 to <1.
Jan 15, 2015 @jessica0xdata 1st flow doc update
545
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
546 - **l1**: ([DL](#DL)) Specify the L1 regularization to add stability and improve generalization; sets the value of many weights to 0.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
547
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
548 - **l2**: ([DL](#DL)) Specify the L2 regularization to add stability and improve generalization; sets the value of many weights to smaller values.
Jan 15, 2015 @jessica0xdata 1st flow doc update
549
Nov 24, 2015 @jessica0xdata Updates to clarify weights, Sparkling Water, roxygen2 requirement
550 - **balance_classes**: ([GBM](#GBM), [DL](#DL)) Oversample the minority classes to balance the class distribution. This option is not selected by default and can increase the data frame size. This option is only applicable for classification. Majority classes can be undersampled to satisfy the **Max\_after\_balance\_size** parameter.
Jan 15, 2015 @jessica0xdata 1st flow doc update
551
Dec 19, 2015 @jessica0xdata Added note to `balance_classes` parameter
552 >**Note**: `balance_classes` balances over just the target, not over all classes in the training frame.
553
Nov 5, 2015 @jessica0xdata Updates to docs
554 - **max\_confusion\_matrix\_size**: ([DRF](#DRF), [DL](#DL), [Naïve Bayes](#NB), [GBM](#GBM), [GLM](#GLM)) Specify the maximum size (in number of classes) for confusion matrices to be printed in the Logs.
Apr 17, 2015 @jessica0xdata Update README.md
555
Nov 21, 2015 @jessica0xdata Updates (multi-class --> multinomial) to clarify hit ratio
556 - **max\_hit\_ratio\_k**: ([DRF](#DRF), [DL](#DL), [Naïve Bayes](#NB), [GBM](#GBM), [GLM](#GLM)) Specify the maximum number (top K) of predictions to use for hit ratio computation. Applicable to multinomial only. To disable, enter 0.
Apr 17, 2015 @jessica0xdata Update README.md
557
Oct 14, 2016 @laurendiperna PUBDEV-3448: updated the GBM Booklet, the user guide, and the Flow re…
558 - **r2_stopping**: ([GBM](#GBM), [DRF](#DRF)) r2_stopping is no longer supported and will be ignored if set - please use stopping_rounds, stopping_metric and stopping_tolerance instead.
Jan 15, 2015 @jessica0xdata 1st flow doc update
559
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
560 - **build\_tree\_one\_node**: ([DRF](#DRF), [GBM](#GBM)) To run on a single node, check this checkbox. This is suitable for small datasets as there is no network overhead but fewer CPUs are used. The default setting is disabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
561
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
562 - **rate**: ([DL](#DL)) Specify the learning rate. Higher rates result in less stable models and lower rates result in slower convergence. Not applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
563
Nov 5, 2015 @jessica0xdata Updates to docs
564 - **rate_annealing**: ([DL](#DL)) Specify the learning rate annealing. The formula is rate/(1+rate\_annealing value \* samples). Not applicable if **adaptive\_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
565
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
566 - **momentum_start**: ([DL](#DL)) Specify the initial momentum at the beginning of training. A suggested value is 0.5. Not applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
567
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
568 - **momentum_ramp**: ([DL](#DL)) Specify the number of training samples for increasing the momentum. Not applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
569
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
570 - **momentum_stable**: ([DL](#DL)) Specify the final momentum value reached after the **momentum_ramp** training samples. Not applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
571
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
572 - **nesterov\_accelerated\_gradient**: ([DL](#DL)) Check this checkbox to use the Nesterov accelerated gradient. This option is recommended and selected by default. Not applicable is **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
573
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
574 - **hidden\_dropout\_ratios**: ([DL](#DL)) Specify the hidden layer dropout ratios to improve generalization. Specify one value per hidden layer, each value between 0 and 1 (exclusive). There is no default value. This option is applicable only if *TanhwithDropout*, *RectifierwithDropout*, or *MaxoutWithDropout* is selected from the **Activation** drop-down list.
Jan 15, 2015 @jessica0xdata 1st flow doc update
575
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
576 - **tweedie_power**: ([DL](#DL), [GBM](#GBM)) (Only applicable if *Tweedie* is selected for **Family**) Specify the Tweedie power. The range is from 1 to 2. For a normal distribution, enter `0`. For Poisson distribution, enter `1`. For a gamma distribution, enter `2`. For a compound Poisson-gamma distribution, enter a value greater than 1 but less than 2. For more information, refer to [Tweedie distribution](https://en.wikipedia.org/wiki/Tweedie_distribution).
Jan 15, 2015 @jessica0xdata 1st flow doc update
577
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
578 - **score_interval**: ([DL](#DL)) Specify the shortest time interval (in seconds) to wait between model scoring.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
579
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
580 - **score\_training\_samples**: ([DL](#DL)) Specify the number of training set samples for scoring. To use all training samples, enter 0.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
581
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
582 - **score\_validation\_samples**: ([DL](#DL)) (Requires selection from the **validation_frame** drop-down list) This option is applicable to classification only. Specify the number of validation set samples for scoring. To use all validation set samples, enter 0.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
583
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
584 - **score\_duty\_cycle**: ([DL](#DL)) Specify the maximum duty cycle fraction for scoring. A lower value results in more training and a higher value results in more scoring. The value must be greater than 0 and less than 1.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
585
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
586 - **autoencoder**: ([DL](#DL)) Check this checkbox to enable the Deep Learning autoencoder. This option is not selected by default.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
587 >**Note**: This option requires a loss function other than CrossEntropy. If this option is enabled, **use\_all\_factor\_levels** must be enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
588
589 **Expert Options**
590
Nov 5, 2015 @jessica0xdata Updates to docs
591 - **keep\_cross\_validation\_predictions**: ([GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF), [K-Means](#Kmeans)) To keep the cross-validation predictions, check this checkbox.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
592
Nov 5, 2015 @jessica0xdata Updates to docs
593 - **class\_sampling\_factors**: ([DRF](#DRF), [GBM](#GBM), [DL](#DL)) Specify the per-class (in lexicographical order) over/under-sampling ratios. By default, these ratios are automatically computed during training to obtain the class balance. This option is only applicable for classification problems and when **balance_classes** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
594
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
595 - **overwrite\_with\_best\_model**: ([DL](#DL)) Check this checkbox to overwrite the final model with the best model found during training. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
596
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
597 - **target\_ratio\_comm\_to\_comp**: ([DL](#DL)) Specify the target ratio of communication overhead to computation. This option is only enabled for multi-node operation and if **train\_samples\_per\_iteration** equals -2 (auto-tuning).
Jan 15, 2015 @jessica0xdata 1st flow doc update
598
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
599 - **rho**: ([DL](#DL)) Specify the adaptive learning rate time decay factor. This option is only applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
600
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
601 - **epsilon**: ([DL](#DL)) Specify the adaptive learning rate time smoothing factor to avoid dividing by zero. This option is only applicable if **adaptive_rate** is enabled.
Jan 15, 2015 @jessica0xdata 1st flow doc update
602
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
603 - **max_w2**: ([DL](#DL)) Specify the constraint for the squared sum of the incoming weights per unit (e.g., for Rectifier).
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
604
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
605 - **initial\_weight\_distribution**: ([DL](#DL)) Select the initial weight distribution (Uniform Adaptive, Uniform, or Normal). If Uniform Adaptive is used, the **initial\_weight\_scale** parameter is not applicable.
Apr 10, 2015 @jessica0xdata Update README.md
606
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
607 - **initial\_weight\_scale**: ([DL](#DL)) Specify the initial weight scale of the distribution function for Uniform or Normal distributions. For Uniform, the values are drawn uniformly from initial weight scale. For Normal, the values are drawn from a Normal distribution with the standard deviation of the initial weight scale. If Uniform Adaptive is selected as the **initial\_weight\_distribution**, the **initial\_weight\_scale** parameter is not applicable.
Jan 15, 2015 @jessica0xdata 1st flow doc update
608
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
609 - **classification_stop**: ([DL](#DL)) (Applicable to discrete/categorical datasets only) Specify the stopping criterion for classification error fractions on training data. To disable this option, enter -1.
Jan 15, 2015 @jessica0xdata 1st flow doc update
610
Nov 21, 2015 @jessica0xdata Updates (multi-class --> multinomial) to clarify hit ratio
611 - **max\_hit\_ratio\_k**: ([DL](#DL), [GLM](#GLM)) (Classification only) Specify the maximum number (top K) of predictions to use for hit ratio computation (for multinomial only). To disable this option, enter 0.
Jan 15, 2015 @jessica0xdata 1st flow doc update
612
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
613 - **regression_stop**: ([DL](#DL)) (Applicable to real value/continuous datasets only) Specify the stopping criterion for regression error (MSE) on the training data. To disable this option, enter -1.
Jan 15, 2015 @jessica0xdata 1st flow doc update
614
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
615 - **diagnostics**: ([DL](#DL)) Check this checkbox to compute the variable importances for input features (using the Gedeon method). For large networks, selecting this option can reduce speed. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
616
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
617 - **fast_mode**: ([DL](#DL)) Check this checkbox to enable fast mode, a minor approximation in back-propagation. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
618
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
619 - **force\_load\_balance**: ([DL](#DL)) Check this checkbox to force extra load balancing to increase training speed for small datasets and use all cores. This option is selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
620
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
621 - **single\_node\_mode**: ([DL](#DL)) Check this checkbox to force H2O to run on a single node for fine-tuning of model parameters. This option is not selected by default.
Jul 4, 2015 @jessica0xdata Update for build 26
622
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
623 - **replicate\_training\_data**: ([DL](#DL)) Check this checkbox to replicate the entire training dataset on every node for faster training on small datasets. This option is not selected by default. This option is only applicable for clouds with more than one node.
Jan 15, 2015 @jessica0xdata 1st flow doc update
624
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
625 - **shuffle\_training\_data**: ([DL](#DL)) Check this checkbox to shuffle the training data. This option is recommended if the training data is replicated and the value of **train\_samples\_per\_iteration** is close to the number of nodes times the number of rows. This option is not selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
626
Jun 16, 2016 missing_values_handling option is available in GLM
627 - **missing\_values\_handling**: ([DL](#DL), [GLM](#GLM) Select how to handle missing values (Skip or MeanImputation).
Jan 15, 2015 @jessica0xdata 1st flow doc update
628
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
629 - **quiet_mode**: ([DL](#DL)) Check this checkbox to display less output in the standard output. This option is not selected by default.
Jan 15, 2015 @jessica0xdata 1st flow doc update
630
Oct 27, 2015 @jessica0xdata Updated definition for "sparse" for DL
631 - **sparse**: ([DL](#DL)) Check this checkbox to enable sparse data handling, which is more efficient for data with many zero values.
Jan 15, 2015 @jessica0xdata 1st flow doc update
632
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
633 - **col_major**: ([DL](#DL)) Check this checkbox to use a column major weight matrix for the input layer. This option can speed up forward propagation but may reduce the speed of backpropagation. This option is not selected by default.
634
635 >**Note**: This parameter has been deprecated.
Jan 15, 2015 @jessica0xdata 1st flow doc update
636
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
637 - **average_activation**: ([DL](#DL)) Specify the average activation for the sparse autoencoder. If **Rectifier** is selected as the **Activation** type, this value must be positive. For Tanh, the value must be in (-1,1).
Jan 15, 2015 @jessica0xdata 1st flow doc update
638
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
639 - **sparsity_beta**: ([DL](#DL)) Specify the sparsity-based regularization optimization. For more information, refer to the following [link](http://www.mit.edu/~9.520/spring09/Classes/class11_sparsity.pdf).
Jan 15, 2015 @jessica0xdata 1st flow doc update
640
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
641 - **max\_categorical\_features**: ([DL](#DL)) Specify the maximum number of categorical features enforced via hashing.
Jan 15, 2015 @jessica0xdata 1st flow doc update
642
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
643 - **reproducible**: ([DL](#DL)) To force reproducibility on small data, check this checkbox. If this option is enabled, the model takes more time to generate, since it uses only one thread.
Jan 15, 2015 @jessica0xdata 1st flow doc update
644
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
645 - **export\_weights\_and\_biases**: ([DL](#DL)) To export the neural network weights and biases as H2O frames, check this checkbox.
Jan 15, 2015 @jessica0xdata 1st flow doc update
646
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
647 - **max\_after\_balance\_size**: ([DRF](#DRF), [GBM](#GBM), [DL](#DL)) Specify the maximum relative size of the training data after balancing class counts (can be less than 1.0). Requires **balance\_classes**.
Jan 15, 2015 @jessica0xdata 1st flow doc update
648
Sep 29, 2015 @jessica0xdata Updates to readme & Docker doc
649 - **nbins\_top\_level**: ([DRF](#DRF), [GBM](#GBM)) (For numerical [real/int] columns only) Specify the maximum number of bins at the root level to use to build the histogram. This number will then be decreased by a factor of two per level.
Jan 15, 2015 @jessica0xdata 1st flow doc update
650
May 31, 2016 Added "seed" option to GLM
651 - **seed**: ([K-Means](#Kmeans), [GLM](#GLM), [GBM](#GBM), [DL](#DL), [DRF](#DRF)) Specify the random number generator (RNG) seed for algorithm components dependent on randomization. The seed is consistent for each H2O instance so that you can create models with the same starting conditions in alternative configurations.
Jan 15, 2015 @jessica0xdata 1st flow doc update
652
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
653 - **intercept**: ([GLM](#GLM)) To include a constant term in the model, check this checkbox. This option is selected by default.
May 29, 2015 @jessica0xdata Update for Arno's changes (binomial_double_trees); some cleanup
654
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
655 - **objective_epsilon**: ([GLM](#GLM)) Specify a threshold for convergence. If the objective value is less than this threshold, the model is converged.
Aug 7, 2015 @jessica0xdata Updates for booklets; updates for simons-5 (inc. `nbins_top_level`)
656
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
657 - **beta_epsilon**: ([GLM](#GLM)) Specify the beta epsilon value. If the L1 normalization of the current beta change is below this threshold, consider using convergence.
Jan 15, 2015 @jessica0xdata 1st flow doc update
658
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
659 - **gradient_epsilon**: ([GLM](#GLM)) (For L-BFGS only) Specify a threshold for convergence. If the objective value (using the L-infinity norm) is less than this threshold, the model is converged.
Jan 15, 2015 @jessica0xdata 1st flow doc update
660
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
661 - **prior**: ([GLM](#GLM)) Specify prior probability for y ==1. Use this parameter for logistic regression if the data has been sampled and the mean of response does not reflect reality.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
662
Aug 20, 2015 @jessica0xdata Fixed capitalization to match param. names
663 - **max\_active\_predictors**: ([GLM](#GLM)) Specify the maximum number of active predictors during computation. This value is used as a stopping criterium to prevent expensive model building with many predictors.
Apr 24, 2015 @jessica0xdata Update README.md
664
Jan 15, 2015 @jessica0xdata 1st flow doc update
665
666
667 ---
668
669 <a name="ViewModel"></a>
May 14, 2015 @jessica0xdata Update README.md
670 ## Viewing Models
Jan 15, 2015 @jessica0xdata 1st flow doc update
671
672 Click the **Assist Me!** button, then click the **getModels** link, or enter `getModels` in the cell in CS mode and press **Ctrl+Enter**. A list of available models displays.
673
Jan 16, 2015 @jessica0xdata Fixed images
674 ![Flow Models](images/Flow_getModels.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
675
Apr 28, 2015 @jessica0xdata Update README.md
676 To view all current models, you can also click the **Model** menu and click **List All Models**.
677
Jan 15, 2015 @jessica0xdata 1st flow doc update
678 To inspect a model, check its checkbox then click the **Inspect** button, or click the **Inspect** button to the right of the model name.
679
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
680 ![Flow Model](images/Flow_GetModel.png)
681
682 A summary of the model's parameters displays. To display more details, click the **Show All Parameters** button.
683
Apr 24, 2015 @jessica0xdata Update README.md
684 To delete a model, click the **Delete** button.
685
Jul 30, 2015 @jessica0xdata Doc updates
686 To generate a Plain Old Java Object (POJO) that can use the model outside of H2O, click the **Download POJO** button.
Jul 17, 2015 @jessica0xdata Updates for nightly build & other doc fixes
687
Jul 30, 2015 @jessica0xdata Doc updates
688 >**Note**: A POJO can be run in standalone mode or it can be integrated into a platform, such as [Hadoop's Storm](https://github.com/h2oai/h2o-training/blob/master/tutorials/streaming/storm/README.md). To make the POJO work in your Java application, you will also need the `h2o-genmodel.jar` file (available in `h2o-3/h2o-genmodel/build/libs/h2o-genmodel.jar`).
Apr 24, 2015 @jessica0xdata Update README.md
689
Jan 15, 2015 @jessica0xdata 1st flow doc update
690 ---
691
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
692 ### Exporting and Importing Models
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
693
694 **To export a built model:**
695
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
696 1. Click the **Model** menu at the top of the screen.
697 2. Select *Export Model...*
698 3. In the `exportModel` cell that appears, select the model from the drop-down *Model:* list.
699 4. Enter a location for the exported model in the *Path:* entry field.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
700 >**Note**: If you specify a location that doesn't exist, it will be created. For example, if you only enter `test` in the *Path:* entry field, the model will be exported to `h2o-3/test`.
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
701 5. To overwrite any files with the same name, check the *Overwrite:* checkbox.
702 6. Click the **Export** button. A confirmation message displays when the model has been successfully exported.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
703
704 ![Export Model](images/ExportModel.png)
705
706
707 **To import a built model:**
708
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
709 1. Click the **Model** menu at the top of the screen.
710 2. Select *Import Model...*
711 3. Enter the location of the model in the *Path:* entry field.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
712 >**Note**: The file path must be complete (e.g., `Users/h2o-user/h2o-3/exported_models`). Do not rename models while importing.
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
713 4. To overwrite any files with the same name, check the *Overwrite:* checkbox.
714 5. Click the **Import** button. A confirmation message displays when the model has been successfully imported. To view the imported model, click the **View Model** button.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
715
716 ![Import Model](images/ImportModel.png)
717
718 ---
719
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
720 ### Using Grid Search
Nov 3, 2015 @jessica0xdata Added "Grid Search in Flow" section
721
722
723 To include a parameter in a grid search in Flow, check the checkbox in the *Grid?* column to the right of the parameter name (highlighted in red in the image below).
724
725 ![Grid Search Column](images/Flow_GridSearch.png)
726
727
728 - If the parameter selected for grid search is Boolean (T/F or Y/N), both values are included when the *Grid?* checkbox is selected.
729 - If the parameter selected for grid search is a list of values, the values display as checkboxes when the *Grid?* checkbox is selected. More than one option can be selected.
730 - If the parameter selected for grid search is a numerical value, use a semicolon (;) to separate each additional value.
731 - To view a list of all grid searches, select the **Model** menu, then click **List All Grid Search Results**, or click the **Assist Me** button and select **getGrids**.
732
733
734 ---
735
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
736 ### Checkpointing Models
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
737
738 Some model types, such as DRF, GBM, and Deep Learning, support checkpointing. A checkpoint resumes model training so that you can iterate your model. The dataset must be the same. The following model parameters must be the same when restarting a model from a checkpoint:
739
740
741 Must be the same as in checkpoint model | | |
742 --------------------|------------------|-----------------------|
743 `drop_na20_cols` | `response_column` | `activation` |
744 `use_all_factor_levels` | `adaptive_rate` | `autoencoder` |
745 `rho` | `epsilon` | `sparse` |
746 `sparsity_beta` | `col_major` | `rate` |
747 `rate_annealing` | `rate_decay` | `momentum_start` |
748 `momentum_ramp` | `momentum_stable` | `nesterov_accelerated_gradient`|
749 `ignore_const_cols`| `max_categorical_features` |`nfolds`|
750 `distribution` | `tweedie_power` | |
751
752
753 The following parameters can be modified when restarting a model from a checkpoint:
754
755 Can be modified | | |
756 ----------------|-|-|
757 `seed` | `checkpoint`| `epochs` |
758 `score_interval`| `train_samples_per_iteration`| `target_ratio_comm_to_comp`
759 `score_duty_cycle`| `score_training_samples`| `score_validation_samples`
760 `score_validation_sampling`| `classification_stop`| `regression_stop`
761 `quiet_mode` | `max_confusion_matrix_size`| `max_hit_ratio_k`
762 `diagnostics` | `variable_importances`| `initial_weight_distribution`
763 `initial_weight_scale` | `force_load_balance` | `replicate_training_data`
764 `shuffle_training_data`| `single_node_mode` | `fast_mode`
765 `l1`|`l2`| `max_w2`
766 `input_dropout_ratio`| `hidden_dropout_ratios` | `loss`
767 `overwrite_with_best_model`| `missing_values_handling` | `average_activation`
768 `reproducible` | `export_weights_and_biases`| `elastic_averaging`
769 `elastic_averaging_moving_rate`| `elastic_averaging_regularization`| `mini_batch_size`
770
771
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
772 1. After building your model, copy the `model_id`. To view the `model_id`, click the **Model** menu then click **List All Models**.
773 2. Select the model type from the drop-down **Model** menu.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
774 >**Note**: The model type must be the same as the checkpointed model.
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
775 3. Paste the copied `model_id` in the *checkpoint* entry field.
776 4. Click the **Build Model** button. The model will resume training.
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
777
778
779 ---
780
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
781 ### Interpreting Model Results
Jul 30, 2015 @jessica0xdata Doc updates
782
Nov 26, 2015 @jessica0xdata Updated Model Results, added PySparkling and Sparkling Water info, ad…
783 **Scoring history**: [GBM](#GBM), [DL](#DL) Represents the error rate of the model as it is built. Typically, the error rate will be higher at the beginning (the left side of the graph) then decrease as the model building completes and accuracy improves. Can include mean squared error (MSE) and deviance.
Jul 30, 2015 @jessica0xdata Doc updates
784
785 ![Scoring History example](images/Flow_ScoringHistory.png)
786
Nov 5, 2015 @jessica0xdata Updates for variable importances, PCA
787 **Variable importances**: [GBM](#GBM), [DL](#DL) Represents the statistical significance of each variable in the data in terms of its affect on the model. Variables are listed in order of most to least importance. The percentage values represent the percentage of importance across all variables, scaled to 100%. The method of computing each variable's importance depends on the algorithm. To view the scaled importance value of a variable, use your mouse to hover over the bar representing the variable.
Jul 30, 2015 @jessica0xdata Doc updates
788
789 ![Variable Importances example](images/Flow_VariableImportances.png)
790
791 **Confusion Matrix**: [DL](#DL) Table depicting performance of algorithm in terms of false positives, false negatives, true positives, and true negatives. The actual results display in the columns and the predictions display in the rows; correct predictions are highlighted in yellow. In the example below, `0` was predicted correctly 902 times, while `8` was predicted correctly 822 times and `0` was predicted as `4` once.
792
793 ![Confusion Matrix example](images/Flow_ConfusionMatrix.png)
794
Nov 26, 2015 @jessica0xdata Updated Model Results, added PySparkling and Sparkling Water info, ad…
795 **ROC Curve**: [DL](#DL), [GLM](#GLM), [DRF](#DRF) Graph representing the ratio of true positives to false positives. To view a specific threshold, select a value from the drop-down **Threshold** list. To view any of the following details, select it from the drop-down **Criterion** list:
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
796
Jul 30, 2015 @jessica0xdata Doc updates
797 - Max f1
798 - Max f2
799 - Max f0point5
800 - Max accuracy
801 - Max precision
Sep 18, 2015 @jessica0xdata Updates for docs for Slater 3.2.0.2
802 - Max absolute MCC (the threshold that maximizes the absolute Matthew's Correlation Coefficient)
Jul 30, 2015 @jessica0xdata Doc updates
803 - Max min per class accuracy
804
805 The lower-left side of the graph represents less tolerance for false positives while the upper-right represents more tolerance for false positives. Ideally, a highly accurate ROC resembles the following example.
806
807 ![ROC Curve example](images/Flow_ROC.png)
808
Nov 26, 2015 @jessica0xdata Updated Model Results, added PySparkling and Sparkling Water info, ad…
809 **Hit Ratio**: [GBM](#GBM), [DRF](#DRF), [NaiveBayes](#NB), [DL](#DL), [GLM](#GLM) (Multinomial Classification only) Table representing the number of times that the prediction was correct out of the total number of predictions.
810
811 ![Hit Ratio Table](images/HitRatioTable.png)
812
813
814 **Standardized Coefficient Magnitudes** [GLM](#GLM) Bar chart representing the relationship of a specific feature to the response variable. Coefficients can be positive (orange) or negative (blue). A positive coefficient indicates a positive relationship between the feature and the response, where an increase in the feature corresponds with an increase in the response, while a negative coefficient represents a negative relationship between the feature and the response where an increase in the feature corresponds with a decrease in the response (or vice versa).
815
816 ![Standardized Coefficient Magnitudes](images/SCM.png)
817
818
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
819 To learn how to make predictions, continue to the next section.
Jul 30, 2015 @jessica0xdata Doc updates
820
821 ---
822
Sep 11, 2015 @jessica0xdata Changes for Slater 3.2.0.1
823
Jan 15, 2015 @jessica0xdata 1st flow doc update
824 <a name="Predict"></a>
May 15, 2015 @jessica0xdata Batch docs update
825 # ... Making Predictions
Jan 15, 2015 @jessica0xdata 1st flow doc update
826
Apr 28, 2015 @jessica0xdata Update README.md
827 After creating your model, click the key link for the model, then click the **Predict** button.
Jun 11, 2015 @jessica0xdata Misc. typo fixes
828 Select the model to use in the prediction from the drop-down **Model:** menu and the data frame to use in the prediction from the drop-down **Frame:** menu, then click the **Predict** button.
Jan 15, 2015 @jessica0xdata 1st flow doc update
829
Jan 16, 2015 @jessica0xdata Fixed images
830 ![Making Predictions](images/Flow_makePredict.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
831
Apr 28, 2015 @jessica0xdata Update README.md
832
Jan 15, 2015 @jessica0xdata 1st flow doc update
833 ---
834
835 <a name="ViewPredict"></a>
May 14, 2015 @jessica0xdata Fix faulty headings in readme
836 ## Viewing Predictions
Jan 15, 2015 @jessica0xdata 1st flow doc update
837
838 Click the **Assist Me!** button, then click the **getPredictions** link, or enter `getPredictions` in the cell in CS mode and press **Ctrl+Enter**. A list of the stored predictions displays.
839 To view a prediction, click the **View** button to the right of the model name.
840
Jan 16, 2015 @jessica0xdata Fixed images
841 ![Viewing Predictions](images/Flow_getPredict.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
842
Jul 30, 2015 @jessica0xdata Doc updates
843 You can also view predictions by clicking the drop-down **Score** menu and selecting **List All Predictions**.
844
845
846
Apr 28, 2015 @jessica0xdata Update README.md
847
Jan 15, 2015 @jessica0xdata 1st flow doc update
848 ---
849
850 <a name="ViewFrame"></a>
May 14, 2015 @jessica0xdata Update README.md
851 ## Viewing Frames
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
852
Jun 12, 2015 @jessica0xdata Updates for UI changes
853 To view a specific frame, click the "Key" link for the specified frame, or enter `getFrameSummary "FrameName"` in a cell in CS mode (where `FrameName` is the name of a frame, such as `allyears2k.hex`).
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
854
855 ![Viewing specified frame](images/Flow_getFrame.png)
856
Apr 28, 2015 @jessica0xdata Update README.md
857
May 19, 2015 @jessica0xdata Doc updates
858 From the `getFrameSummary` cell, you can:
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
859
860 - view a truncated list of the rows in the data frame by clicking the **View Data** button
May 19, 2015 @jessica0xdata Doc updates
861 - split the dataset by clicking the **Split...** button
862 - view the columns, data, and factors in more detail or plot a graph by clicking the **Inspect** button
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
863 - create a model by clicking the **Build Model** button
864 - make a prediction based on the data by clicking the **Predict** button
865 - download the data as a .csv file by clicking the **Download** button
866 - view the characteristics or domain of a specific column by clicking the **Summary** link
867
May 19, 2015 @jessica0xdata Doc updates
868 When you view a frame, you can "drill-down" to the necessary level of detail (such as a specific column or row) using the **Inspect** button or by clicking the links. The following screenshot displays the results of clicking the **Inspect** button for a frame.
Jan 15, 2015 @jessica0xdata 1st flow doc update
869
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
870 ![Inspecting Frames](images/Flow_inspectFrame.png)
871
May 19, 2015 @jessica0xdata Doc updates
872 This screenshot displays the results of clicking the **columns** link.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
873
874 ![Inspecting Columns](images/Flow_inspectCol.png)
875
876
Apr 28, 2015 @jessica0xdata Update README.md
877 To view all frames, click the **Assist Me!** button, then click the **getFrames** link, or enter `getFrames` in the cell in CS mode and press **Ctrl+Enter**. You can also view all current frames by clicking the drop-down **Data** menu and selecting **List All Frames**.
878
879 A list of the current frames in H2O displays that includes the following information for each frame:
Jan 15, 2015 @jessica0xdata 1st flow doc update
880
881
May 19, 2015 @jessica0xdata Doc updates
882 - Link to the frame (the "key")
Jan 15, 2015 @jessica0xdata 1st flow doc update
883 - Number of rows and columns
884 - Size
885
886
887 For parsed data, the following information displays:
888
889 - Link to the .hex file
890 - The **Build Model**, **Predict**, and **Inspect** buttons
891
Jan 16, 2015 @jessica0xdata Fixed images
892 ![Parsed Frames](images/Flow_getFrames.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
893
894 To make a prediction, check the checkboxes for the frames you want to use to make the prediction, then click the **Predict on Selected Frames** button.
895
Apr 16, 2015 @jessica0xdata Update README.md
896 ---
897
May 14, 2015 @jessica0xdata Update README.md
898 ### Splitting Frames
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
899
May 19, 2015 @jessica0xdata Doc updates
900 Datasets can be split within Flow for use in model training and testing.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
901
Apr 17, 2015 @jessica0xdata Update README.md
902 ![splitFrame cell](images/Flow_splitFrame.png)
903
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
904 1. To split a frame, click the **Assist Me** button, then click **splitFrame**.
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
905
906 >**Note**: You can also click the drop-down **Data** menu and select **Split Frame...**.
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
907 2. From the drop-down **Frame:** list, select the frame to split.
908 3. In the second **Ratio** entry field, specify the fractional value to determine the split. The first **Ratio** field is automatically calculated based on the values entered in the second **Ratio** field.
Apr 17, 2015 @jessica0xdata Update README.md
909
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
910 >**Note**: Only fractional values between 0 and 1 are supported (for example, enter `.5` to split the frame in half). The total sum of the ratio values must equal one. H2O automatically adjusts the ratio values to equal one; if unsupported values are entered, an error displays.
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
911 4. In the **Key** entry field, specify a name for the new frame.
912 5. (Optional) To add another split, click the **Add a split** link. To remove a split, click the `X` to the right of the **Key** entry field.
913 6. Click the **Create** button.
Apr 16, 2015 @jessica0xdata Update README.md
914
915 ---
May 14, 2015 @jessica0xdata Update README.md
916 ### Creating Frames
Apr 29, 2015 @jessica0xdata Update README.md
917
918 To create a frame with a large amount of random data (for example, to use for testing), click the drop-down **Admin** menu, then select **Create Synthetic Frame**. Customize the frame as needed, then click the **Create** button to create the frame.
919
Jun 12, 2015 @jessica0xdata Updates for UI changes
920 ![Flow - Creating Frames](images/Flow_CreateFrame.png)
921
Apr 29, 2015 @jessica0xdata Update README.md
922 ---
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
923
May 14, 2015 @jessica0xdata Update README.md
924 ### Plotting Frames
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
925
926 To create a plot from a frame, click the **Inspect** button, then click the **Plot** button.
927
May 19, 2015 @jessica0xdata Doc updates
928 Select the type of plot (point, path, or rect) from the drop-down **Type** menu, then select the x-axis and y-axis from the following options:
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
929
930 - label
May 19, 2015 @jessica0xdata Doc updates
931 - type
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
932 - missing
933 - zeros
May 19, 2015 @jessica0xdata Doc updates
934 - +Inf
935 - -Inf
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
936 - min
937 - max
938 - mean
939 - sigma
940 - cardinality
941
942 Select one of the above options from the drop-down **Color** menu to display the specified data in color, then click the **Plot** button to plot the data.
943
Aug 7, 2015 @jessica0xdata Fixed typo in SiteIntro that was causing broken image
944 ![Flow - Plotting Frames](images/Flow_plot.png)
Jun 12, 2015 @jessica0xdata Updates for UI changes
945
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
946 >**Note**: Because H2O stores enums internally as numeric then maps the integers to an array of strings, any `min`, `max`, or `mean` values for categorical columns are not meaningful and should be ignored. Displays for categorical data will be modified in a future version of H2O.
May 12, 2015 @jessica0xdata Update README.md
947
Jan 15, 2015 @jessica0xdata 1st flow doc update
948 ---
949
May 14, 2015 @jessica0xdata Docs update for website
950 <a name="Flows"></a>
951
May 15, 2015 @jessica0xdata Batch docs update
952 # ... Using Flows
May 14, 2015 @jessica0xdata Docs update for website
953
954 You can use and modify flows in a variety of ways:
955
956 - Clips allow you to save single cells
957 - Outlines display summaries of your workflow
958 - Flows can be saved, duplicated, loaded, or downloaded
959
960 ---
961
962
Jan 15, 2015 @jessica0xdata 1st flow doc update
963 <a name="Clips"></a>
964
May 14, 2015 @jessica0xdata Docs update for website
965 ## Using Clips
Jan 15, 2015 @jessica0xdata 1st flow doc update
966
967 Clips enable you to save cells containing your workflow for later reuse. To save a cell as a clip, click the paperclip icon to the right of the cell (highlighted in the red box in the following screenshot).
Jan 16, 2015 @jessica0xdata Fixed images
968 ![Paperclip icon](images/Flow_clips_paperclip.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
969
970 To use a clip in a workflow, click the "Clips" tab in the sidebar on the right.
971
Jan 16, 2015 @jessica0xdata Fixed images
972 ![Clips tab](images/Flow_clips.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
973
974 All saved clips, including the default system clips (such as `assist`, `importFiles`, and `predict`), are listed. Clips you have created are listed under the "My Clips" heading. To select a clip to insert, click the circular button to the left of the clip name. To delete a clip, click the trashcan icon to right of the clip name.
975
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
976 >**NOTE**: The default clips listed under "System" cannot be deleted.
Jan 15, 2015 @jessica0xdata 1st flow doc update
977
978 Deleted clips are stored in the trash. To permanently delete all clips in the trash, click the **Empty Trash** button.
979
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
980 >**NOTE**: Saved data, including flows and clips, are persistent as long as the same IP address is used for the cluster. If a new IP is used, previously saved flows and clips are not available.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
981
Jan 15, 2015 @jessica0xdata 1st flow doc update
982 ---
983
984 <a name="Outline"></a>
May 14, 2015 @jessica0xdata Docs update for website
985 ## Viewing Outlines
Jan 15, 2015 @jessica0xdata 1st flow doc update
986
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
987 The **Outline** tab in the sidebar displays a brief summary of the cells currently used in your flow; essentially, a command history.
Apr 16, 2015 @jessica0xdata Update README.md
988
989 - To jump to a specific cell, click the cell description.
990 - To delete a cell, select it and press the X key on your keyboard.
Jan 15, 2015 @jessica0xdata 1st flow doc update
991
Jan 16, 2015 @jessica0xdata Image fix
992 ![View Outline](images/Flow_outline.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
993
994 ---
995
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
996 <a name="SaveFlow"></a>
May 14, 2015 @jessica0xdata Docs update for website
997 ## Saving Flows
Jan 15, 2015 @jessica0xdata 1st flow doc update
998
Jun 12, 2015 @jessica0xdata Updates for UI changes
999 You can save your flow for later reuse. To save your flow as a notebook, click the "Save" button (the first button in the row of buttons below the flow name), or click the drop-down "Flow" menu and select "Save Flow."
Jan 15, 2015 @jessica0xdata 1st flow doc update
1000 To enter a custom name for the flow, click the default flow name ("Untitled Flow") and type the desired flow name. A pencil icon indicates where to enter the desired name.
1001
Jan 16, 2015 @jessica0xdata Fixed images
1002 ![Renaming Flows](images/Flow_rename.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
1003
1004 To confirm the name, click the checkmark to the right of the name field.
1005
Jan 16, 2015 @jessica0xdata Fixed images
1006 ![Confirm Name](images/Flow_rename2.png)
Jan 15, 2015 @jessica0xdata 1st flow doc update
1007
1008 To reuse a saved flow, click the "Flows" tab in the sidebar, then click the flow name. To delete a saved flow, click the trashcan icon to the right of the flow name.
1009
Jan 16, 2015 @jessica0xdata Image fix
1010 ![Flows](images/Flow_flows.png)
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1011
May 14, 2015 @jessica0xdata Docs update for website
1012 ### Finding Saved Flows on your Disk
Mar 20, 2015 @tomkraljevic Tweak flow saving instructions.
1013
1014 By default, flows are saved to the `h2oflows` directory underneath your home directory. The directory where flows are saved is printed to stdout:
1015
Mar 20, 2015 @jessica0xdata Update README.md
1016 ```
1017 03-20 14:54:20.945 172.16.2.39:54323 95667 main INFO: Flow dir: '/Users/<UserName>/h2oflows'
1018 ```
Mar 20, 2015 @tomkraljevic Tweak flow saving instructions.
1019
Mar 20, 2015 @jessica0xdata Update README.md
1020 To back up saved flows, copy this directory to your preferred backup location.
Mar 20, 2015 @jessica0xdata Add location of saved flows to "Getting Started" doc
1021
Mar 20, 2015 @tomkraljevic Tweak flow saving instructions.
1022 To specify a different location for saved flows, use the command-line argument `-flow_dir` when launching H2O:
1023
1024 `java -jar h2o.jar -flow_dir /<New>/<Location>/<For>/<Saved>/<Flows>`
1025
Mar 24, 2015 @jessica0xdata Update README.md
1026 where `/<New>/<Location>/<For>/<Saved>/<Flows>` represents the specified location. If the directory does not exist, it will be created the first time you save a flow.
Mar 20, 2015 @jessica0xdata Update README.md
1027
May 14, 2015 @jessica0xdata Docs update for website
1028 ### Saving Flows on a Hadoop cluster
Mar 20, 2015 @jessica0xdata Update README.md
1029
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
1030 If you are running H2O Flow on a Hadoop cluster, H2O will try to find the HDFS home directory to use as the default directory for flows. If the HDFS home directory is not found, flows cannot be saved unless a directory is specified while launching using `-flow_dir`:
Mar 20, 2015 @jessica0xdata Update README.md
1031
Jun 5, 2015 @jessica0xdata Changed Hadoop mapperXmx value to 6 (per Ray)
1032 `hadoop jar h2odriver.jar -nodes 1 -mapperXmx 6g -output hdfsOutputDirName -flow_dir hdfs:///<Saved>/<Flows>/<Location>`
Mar 20, 2015 @jessica0xdata Update README.md
1033
Mar 24, 2015 @jessica0xdata Update README.md
1034 The location specified in `flow_dir` may be either an hdfs or regular filesystem directory. If the directory does not exist, it will be created the first time you save a flow.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1035
May 19, 2015 @jessica0xdata Doc updates
1036 ### Copying Flows
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1037
Jun 12, 2015 @jessica0xdata Updates for UI changes
1038 To create a copy of the current flow, select the **Flow** menu, then click **Make a Copy**. The name of the current flow changes to `Copy of <FlowName>` (where `<FlowName>` is the name of the flow). You can save the duplicated flow using this name by clicking **Flow** > **Save Flow**, or [rename it](#SaveFlow) before saving.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1039
1040
May 14, 2015 @jessica0xdata Docs update for website
1041 ### Downloading Flows
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1042
Jun 12, 2015 @jessica0xdata Updates for UI changes
1043 After saving a flow as a notebook, click the **Flow** menu, then select **Download this Flow**. A new window opens and the saved flow is downloaded to the default downloads folder on your computer. The file is exported as `<filename>.flow`, where `<filename>` is the name specified when the flow was saved.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1044
May 19, 2015 @jessica0xdata Doc updates
1045 **Caution**: You must have an active internet connection to download flows.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1046
May 14, 2015 @jessica0xdata Docs update for website
1047 ### Loading Flows
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1048
1049 To load a saved flow, click the **Flows** tab in the sidebar at the right. In the pop-up confirmation window that appears, select **Load Notebook**, or click **Cancel** to return to the current flow.
1050
1051 ![Confirm Replace Flow](images/Flow_confirmreplace.png)
1052
1053 After clicking **Load Notebook**, the saved flow is loaded.
1054
Jun 12, 2015 @jessica0xdata Updates for UI changes
1055 To load an exported flow, click the **Flow** menu and select **Open Flow...**. In the pop-up window that appears, click the **Choose File** button and select the exported flow, then click the **Open** button.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1056
1057 ![Open Flow](images/Flow_Open.png)
1058
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
1059 >**Notes**:
1060
1061 >- Only exported flows using the default .flow filetype are supported. Other filetypes will not open.
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1062 - If the current notebook has the same name as the selected file, a pop-up confirmation appears to confirm that the current notebook should be overwritten.
1063
1064 ---
1065
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1066 <a name="Troubleshooting"></a>
May 15, 2015 @jessica0xdata Batch docs update
1067 # ...Troubleshooting Flow
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1068
1069 To troubleshoot issues in Flow, use the **Admin** menu. The **Admin** menu allows you to check the status of the cluster, view a timeline of events, and view or download logs for issue analysis.
1070
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
1071 >**NOTE**: To view the current H2O Flow version, click the **Help** menu, then click **About**.
Jan 27, 2015 @jessica0xdata Updated Recent Changes & Flow README docs
1072
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1073 ## Viewing Cluster Status
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1074
1075 Click the **Admin** menu, then select **Cluster Status**. A summary of the status of the cluster (also known as a cloud) displays, which includes the same information:
1076
1077 - Cluster health
1078 - Whether all nodes can communicate (consensus)
1079 - Whether new nodes can join (locked/unlocked)
May 19, 2015 @jessica0xdata Doc updates
1080
Jun 17, 2015 @jessica0xdata Revised Note format; added info re: sidebar, variables, search auto-s…
1081 >**Note**: After you submit a job to H2O, the cluster does not accept new nodes.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1082 - H2O version
Mar 4, 2015 @jessica0xdata Updated Getting Started with Flow doc; added new Parse image, info on…
1083 - Number of used and available nodes
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1084 - When the cluster was created
1085
1086 ![Cluster Status](images/Flow_CloudStatus.png)
1087
1088
Jan 27, 2015 @jessica0xdata Updated Recent Changes & Flow README docs
1089 The following information displays for each node:
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1090
1091 - IP address (name)
1092 - Time of last ping
1093 - Number of cores
1094 - Load
1095 - Amount of data (used/total)
1096 - Percentage of cached data
1097 - GC (free/total/max)
1098 - Amount of disk space in GB (free/max)
1099 - Percentage of free disk space
1100
1101 To view more information, click the **Show Advanced** button.
1102
1103 ---
1104
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1105 ## Viewing CPU Status (Water Meter)
Jan 27, 2015 @jessica0xdata Updated Recent Changes & Flow README docs
1106
1107 To view the current CPU usage, click the **Admin** menu, then click **Water Meter (CPU Meter)**. A new window opens, displaying the current CPU use statistics.
1108
1109 ---
1110
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1111 ## Viewing Logs
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1112 To view the logs for troubleshooting, click the **Admin** menu, then click **Inspect Log**.
1113
1114 ![Inspect Log](images/Flow_viewLog.png)
1115
1116 To view the logs for a specific node, select it from the drop-down **Select Node** menu.
1117
1118 ---
1119
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
1120 <a name="DL_Logs"></a>
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1121 ## Downloading Logs
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1122
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
1123 To download the logs for further analysis, click the **Admin** menu, then click **Download Log**. A new window opens and the logs download to your default download folder. You can close the new window after downloading the logs. Send the logs to [h2ostream](mailto:h2ostream@googlegroups.com) or [file a JIRA ticket](#ReportIssue) for issue resolution.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1124
1125 ---
1126
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1127 ## Viewing Stack Trace Information
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1128
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1129 To view the stack trace information, click the **Admin** menu, then click **Stack Trace**.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1130
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1131 ![Stack Trace](images/Flow_stacktrace.png)
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1132
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1133 To view the stack trace information for a specific node, select it from the drop-down **Select Node** menu.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1134
1135 ---
1136
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1137 ## Viewing Network Test Results
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1138
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1139 To view network test results, click the **Admin** menu, then click **Network Test**.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1140
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1141 ![Network Test Results](images/Flow_NetworkTest.png)
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1142
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1143 ---
1144
1145 ## Accessing the Profiler
1146
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
1147 The Profiler looks across the cluster to see where the same stack trace occurs, and can be helpful for identifying activity on the current CPU.
Mar 14, 2015 @jessica0xdata Updates for Getting Started with Flow doc
1148 To view the profiler, click the **Admin** menu, then click **Profiler**.
1149
1150 ![Profiler](images/Flow_profiler.png)
1151
1152 To view the profiler information for a specific node, select it from the drop-down **Select Node** menu.
1153
1154 ---
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1155
1156
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1157 ## Viewing the Timeline
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1158
1159 To view a timeline of events in Flow, click the **Admin** menu, then click **Timeline**. The following information displays for each event:
1160
1161 - Time of occurrence (HH:MM:SS:MS)
1162 - Number of nanoseconds for duration
1163 - Originator of event ("who")
1164 - I/O type
1165 - Event type
1166 - Number of bytes sent & received
1167
1168 ![Timeline](images/Flow_timeline.png)
1169
Jan 27, 2015 @jessica0xdata Updated Recent Changes & Flow README docs
1170 To obtain the most recent information, click the **Refresh** button.
Jan 17, 2015 @jessica0xdata Updates to H2O Flow doc for build 0.2.26
1171
Feb 17, 2015 @jessica0xdata Updated Getting Started Flow doc; updated DL, GLM, GBM tutorials for …
1172 ---
1173
Sep 1, 2015 @jessica0xdata Working on first Skolem build.
1174 <a name="ReportIssue"></a>
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1175 ## Reporting Issues
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
1176
1177 If you experience an error with Flow, you can submit a JIRA ticket to notify our team.
1178
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1179 1. First, click the **Admin** menu, then click **Download Logs**. This will download a file contains information that will help our developers identify the cause of the issue.
1180 2. Click the **Help** menu, then click **Report an issue**. This will open our JIRA page where you can file your ticket.
1181 3. Click the **Create** button at the top of the JIRA page.
1182 4. Attach the log file from the first step, write a description of the error you experienced, then click the **Create** button at the bottom of the page. Our team will work to resolve the issue and you can track the progress of your ticket in JIRA.
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
1183
1184 ---
1185
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1186 ## Requesting Help
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
1187
1188 If you have a Google account, you can submit a request for assistance with H2O on our Google Groups page, [H2Ostream](https://groups.google.com/forum/#!forum/h2ostream).
1189
1190 To access H2Ostream from Flow:
1191
1192 0. Click the **Help** menu.
1193 0. Click **Forum/Ask a question**.
1194 0. Click the red **New topic** button.
1195 0. Enter your question and click the red **Post** button. If you are requesting assistance for an error you experienced, be sure to include your [logs](#DL_Logs).
1196
Jun 24, 2015 @jessica0xdata FAQ and README updates
1197
1198 You can also email your question to [h2ostream@googlegroups.com](mailto:h2ostream@googlegroups.com).
1199
Jun 23, 2017 @angela0xdata PUBDEV-4213: Updates to markdown syntax (#1307)
1200 Or, you can post your question on [Stack Overflow](https://stackoverflow.com/questions/tagged/h2o) using the "h2o" tag.
1201
Jun 24, 2015 @jessica0xdata FAQ and README updates
1202
Jun 17, 2015 @jessica0xdata Update Recent Changes; update docs for `build_tree_one_node` addition…
1203 ---
1204
Mar 11, 2015 @lo5 Dedent level 2 headings by one level
1205 ## Shutting Down H2O
Feb 17, 2015 @jessica0xdata Updated Getting Started Flow doc; updated DL, GLM, GBM tutorials for …
1206
1207 To shut down H2O, click the **Admin** menu, then click **Shut Down**. A *Shut down complete* message displays in the upper right when the cluster has been shut down.
Aug 12, 2015 @jessica0xdata Updates for simons-7; some other fixes
1208
1209 ---
1210