#Recent Changes
##H2O-Dev
###0.1.26.1051 - 2/13/15
####New Features These features have been added since the last release:
- Flow: Display alternate UI for splitFrames() (PUBDEV-399)
####Enhancements
These changes are improvements to existing features (which includes changed default values):
#####System
- Embedded H2O config can now provide flat file (needed for Hadoop) (github)
- Don't logging GET of individual jobs to avoid filling up the logs (github)
#####Algorithms
- Increase GBM/DRF factor binning back to historical levels. Had been capped accidentally at nbins (typically 20), was intended to support a much higher cap. (github)
- Tweaked rho heuristic in glm (github)
- Enable variable importances for autoencoders (github)
- Removed
group_splitoption from GBM - Flow: display varimp for GBM output (PUBDEV-398)
- variable importance for GBM (github)
####Bug Fixes
These changes are to resolve incorrect software behavior:
#####Algorithms
- Fixed bug in GLM exception handling causing GLM jobs to hang (github)
- Fixed a bug in kmeans input parameter schema where init was always being set to Furthest (github)
- Fixed mean computation in GLM (github)
- Fixed kmeans.R (github)
- Flow: Building GBM model fails with Error executing javascript (PUBDEV-396)
#####System
###0.1.26.1032 - 2/6/15
####New Features
#####General Improvements
- better model output
- support for Python client
- support for Maven
- support for Sparkling Water
- support for REST API schema
- support for Hadoop CDH5 (github)
#####UI
- Display summary visualizations by default in column summary output cells (PUBDEV-337)
- Display AUC curve by default in binomial prediction output cells (PUBDEV-338)
- Flow: Implement About H2O/Flow with version information (PUBDEV-111)
- Add UI for CreateFrame (PUBDEV-218)
- Flow: Add ability to cancel running jobs (PUBDEV-373)
- Flow: warn when user navigates away while having unsaved content (PUBDEV-322)
#####Algorithms
- Implement splitFrame() in Flow (PUBDEV-356)
- Variable importance graph in Flow for GLM (PUBDEV-360)
- Flow: Implement model building form init and validation (PUBDEV-102)
- Added a shuffle-and-split-frame function; Use it to build a saner model on time-series data (github)
- Added binomial model metrics (github)
- Run KMeans from R (HEXDEV-105)
- Be able to create a new GLM model from an existing one with updated coefficients (HEXDEV-48)
- Run KMeans from Python (HEXDEV-106)
- Run Deep Learning Binomial from Flow (HEXDEV-83)
- Run KMeans from Flow (HEXDEV-104)
- Run Deep Learning from Python (HEXDEV-85)
- Run Deep Learning from R (HEXDEV-84)
- Run Deep Learning Multinomial from Flow (HEXDEV-108)
- Run Deep Learning Regression from Flow (HEXDEV-109)
#####API
- Flow: added REST API documentation to the web ui (PUBDEV-60)
- Flow: Implement visualization API (PUBDEV-114)
#####System
- Dataset inspection from Flow (HEXDEV-66)
- Basic data munging (Rapids) from R (HEXDEV-70)
- Implement stack operator/stacking in Lightning (HEXDEV-128)
####Enhancements
#####UI
- Added better message when h2o.init() not yet called (
No active connection to an H2O cluster. Try calling "h2o.init()") (github)
#####Algorithms
- Updated the loss behavior for GBM. When loss is set to AUTO, if the response is an integer with 2 levels, then bernoullli (rather than gaussian) behavior is chosen. As a result, the
do_classificationflag is no longer necessary in Flow, since the loss completely specifies the desired behavior, and R users no longer to useas.factor()in their response to get the desired bernoulli behavior. - Updated column-based gradient task to use sparse interface (github)
- Updated LBFGS (added progress monitor interface, updated some default params), added progress and job support to GLM lbfgs (github)
- Added pretty print (github)
- Added AutoEncoder to R model categories (github)
- Added Coefficients table to GLM model (github)
- Updated glm lbfgs to allow for efficient lambda-search (l2 penalty only) (github)
- Removed splitframe shuffle parameter (github)
- Simplified model builders and added deeplearning model builder (github)
- Add DL model outputs to Flow (PUBDEV-372)
- Flow: Deep Learning: Expert Mode (PUBDEV-284)
- Flow: Display multinomial and regression DL model outputs (PUBDEV-383)
- Display varimp details for DL models (PUBDEV-381)
- Make binomial response "0" and "1" by default (github)
- Add Coefficients table to GLM model (github)
- Removed splitframe shuffle parameter (github)
- Update R GBM demos to reflect new input parameter names (github)
- Rename GLM variable importance to normalized coefficient magnitudes (github)
#####API
- Changed
keytodestination_key(github) - Cleaned up REST API schema interface (github)
- Changed method name, cleaned setup, added a pyunit runner (github)
#####System
- Allow changing column types during parse-setup (PUBDEV-376)
- Display %NAs in model builder column lists (PUBDEV-375)
- Figure out how to add H2O to PyPl (PUBDEV-178)
####Bug Fixes
#####UI
- Flow: Parse => 1m.svm hangs at 42% (PUBDEV-345)
- cup98 Dataset has columns that prevent validation/prediction (PUBDEV-349)
- Flow: predict step failed to function (PUBDEV-217)
- Flow: Arrays of numbers (ex. hidden in deeplearning)require brackets (PUBDEV-303)
- Flow v.0.1.26.1030: StackTrace was broken (PUBDEV-371)
- Flow: Import files -> Search -> Parse these files -> null pointer exception (PUBDEV-170)
- Flow: "getJobs" not working (PUBDEV-320)
- Thresholds x Metrics and Max Criteria x Metrics tables were flipped in flow (HEXDEV-155)
- Flow v.0.1.26.1030: StackTrace is broken (PUBDEV-348)
- flow: getJobs always shows "Your H2O cloud has no jobs" (PUBDEV-243)
- Flow: First and last characters deleted from ignored columns (PUBDEV-300)
- Sparkling water => Flow => Menu buttons for cell do not show up (PUBDEV-294)
#####Algorithms
- Flow: Build K Means model with default K value gives error "Required field k not specified" (PUBDEV-167)
- Slicing out a specific data point is broken (PUBDEV-280)
- Flow: SplitFrame and grep in algorithms for flow and loops back onto itself (PUBDEV-272)
- Fixed the predict method (github)
- Refactor ModelMetrics into a different class for Binomial (github)
- /Predictions.json did not cache predictions (HEXDEV-119)
- Flow, DL: Error after changing hidden layer size (PUBDEV-323)
- Error in node$h2o#node: $ operator is invalid for atomic vectors (PUBDEV-348)
- Fixed K-means predict (PUBDEV-321)
- Flow: DL build mode fails => as it's missing adding quotes to parameter (PUBDEV-301)
- Flow: Build K means model with training/validation frames => unknown error (PUBDEV-185)
- Flow: Build quantile mode=> Click goes in loop (PUBDEV-188)
#####API
- Sparkling Water/Flow: Failed to find version for schema (PUBDEV-367)
- Cloud.json returns odd node name (PUBDEV-259)
#####System
- guesser needs to send types to parse (PUBDEV-279)
- Got h2o.clusterStatus function working in R. (github)
- Parse: Using R => java.lang.NullPointerException (PUBDEV-380)
- Flow: Jobs => click on destination key => unimplemented: Unexpected val class for Inspect: class water.fvec.DataFrame (PUBDEV-363)
- Column assignment in R exposes NullPointerException in Rollup (PUBDEV-155)
- import from hdfs doesn't add files (PUBDEV-260)
- AssertionError: ERROR: got tcp resend with existing in-progress task (PUBDEV-219)
- HDFS parse fails when H2O launched on Spark CDH5 (PUBDEV-138)
- Flow: Parse failure => java.lang.ArrayIndexOutOfBoundsException (PUBDEV-296)
- "predict" step is not working in flow (PUBDEV-202)
- Flow: Frame finishes parsing but comes up as null in flow (PUBDEV-270)
- scala >flightsToORD.first() fails with "not serializable result" (PUBDEV-304)
- DL throws NPE for bad column names (PUBDEV-15)
- Flow: Build model: Not able to build KMeans/Deep Learning model (PUBDEV-297)
- Flow: Col summary for NA/Y cols breaks (PUBDEV-325)
- Sparkling Water : util.SparkUncaughtExceptionHandler: Uncaught exception in thread Thread NanoHTTPD Session,9,main (PUBDEV-346)
###0.1.20.1019 - 1/19/15
####New Features
#####UI
- Added various documentation links to the build page (github)
#####Algorithms
- Ported matrix multiply over and connected it to rapids (github)
####Enhancements
#####UI
- Allow user to specify (the log of) the number of rows per chunk for a new constant chunk; use this new function in CreateFrame (github)
- Make CreateFrame non-blocking, now displays progress bar in Flow (github)
- Add row and column count to H2OFrame show method (github)
- Admin watermeter page (PUBDEV-234)
- Admin stack trace (PUBDEV-228)
- Admin profile (PUBDEV-227)
- Flow: Add download logs in UI (PUBDEV-204)
- Need shutdown, minimally like h2o (PUBDEV-74)
#####API
- Changed 2 to 3 for JSON requests (github)
- Rename some more fields per consistency (
max_iterschanged tomax_iterations,_itersto_iterations,_ncatsto_categorical_column_count,_centersrawtocenters_raw,_avgwithinsstoavg_within_ss,_withinmsetowithin_mse) (github) - Changed K-Means output parameters (
withinmsetowithin_mse,avgsstoavg_ss,avgbetweensstoavg_between_ss) (github) - Remove default field values from DeepLearning parameters schema, since they come from the backing class (github)
- Add @API help annotation strings to JSON model output (PUBDEV-216)
#####Algorithms
- Minor fix in rapids matrix multiplicaton (github)
- Updated sparse chunk to cut off binary search for prefix/suffix zeros (github)
- Updated L_BFGS for GLM - warm-start solutions during lambda search, correctly pass current lambda value, added column-based gradient task (github)
- Fix model parameters' default values in the metadata (github)
- Set default value of k = number of clusters to 1 for K-Means (PUBDEV-251)
#####System
- Reject any training data with non-numeric values from KMeans model building (github)
####Bug Fixes
#####API
- Fixed isSparse call for constant chunks (github)
- Fixed sparse interface of constant chunks (no nonzero if const 1= 0) (github)
#####System
- Typeahead for folder contents apparently requires trailing "/" (github)
- Fix build and instructions for R install.packages() style of installation; Note we only support source installs now (github)
- Fixed R test runner h2o package install issue that caused it to fail to install on dev builds (github)
###0.1.18.1013 - 1/14/15
####New Features
#####UI
- Admin timeline (PUBDEV-226)
- Admin cluster status (PUBDEV-225)
- Markdown cells should auto run when loading a saved Flow notebook (PUBDEV-87)
- Complete About page to include info about the H2O version (PUBDEV-223)
####Enhancements
#####Algorithms
- Flow: Implement model output for GBM (PUBDEV-119)
###0.1.20.1016 - 12/28/14
- Added ip_port field in node json output for Cloud query (github)