H2O v3.24.0.4 Release Notes

  • ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-yates/4/index.html

    Bug

    ๐Ÿ›  [PUBDEV-4305] - Fixed an error that occurred when applying as.matrix() to an h2o dataframe with numeric values of size ~ 600K x 300. โœ… [PUBDEV-5937] - Introduced a new xgboost.predict.native.enable property, which ensures that H2OXGBoostEstimator will no longer always predicts the same value. ๐Ÿ“œ [PUBDEV-6440] - Users can now parse files from s3 using s3's directory URL with s3 protocol. ๐Ÿ›  [PUBDEV-6475] - Fixed an issue that caused h2o.getModelTree to produce an "invalid object for slot nas" error when XGBoost produced a root-node only decision tree. ๐ŸŽ [PUBDEV-6476] - Improved performance of H2OXGBoost on OS X. ๐Ÿ— [PUBDEV-6479] - In Stacked Ensembles, fixed a categorical encoding mismatch error when building the ensemble. Users can now use SE on top of base models that are trained with categorical encoding. [PUBDEV-6483] - In Isolation Forest, you can now specify that mtries = the number of features. ๐Ÿ›  [PUBDEV-6488] - Fixed an issue that caused XGBoost to produce a tree with split features being all NA. โšก๏ธ [PUBDEV-6489] - In h2o.getModelTree, when retrieving a threshold for values that are all NAs, updated the description to state that the "Split value is NA." ๐Ÿ›  [PUBDEV-6490] - Fixed an issue that caused trivial features with NAs to be given inflated importance when monotonicity constraints was enabled. As a result, variable importance values were incorrect. ๐Ÿ›  [PUBDEV-6491] - Fixed an NPE issue at water.init.HostnameGuesser when trying to launch a Sparkling Water cluster. [PUBDEV-6496] - Removed internal_cv_weights from h2o.predict_contributions() output when the prediction was used on a fold column from a model run with nfolds. โœ… [PUBDEV-6521] - Models that use Label Encoding no longer predict incorrectly on test data. [PUBDEV-6523] - Predictions now work correctly on a subset of training features when using categorical_encoding. ๐Ÿ›  [PUBDEV-6532] - Fixed an issue that caused XGBoost to format non-integer numbers (doubles, floats) using Locale.ENGLISH to ensure that a decimal point "." was used instead of a comma ",". ๐Ÿ“œ This locale setting grouped large numbers by thousands and split the groups with ",", which was unparseable to XGBoost.

    New Feature

    ๐Ÿ‘ [PUBDEV-6478] - Added support for CDH 6.2. [PUBDEV-6503] - Users can now specify an external IP for h2odriver callback.

    Improvement

    [PUBDEV-6519] - Added a "toCategoricalCol" helper function for column type conversion. ๐Ÿ“š [PUBDEV-6522] - Renamed "Generic Models" to "MOJO Import" in the documentation.

    ๐Ÿ“„ Docs

    ๐Ÿ‘ [PUBDEV-6486] - Added CDH 6.2 to list of supported Hadoop platforms. [PUBDEV-6511] - Added the import_hive_table() and import_mojo() functions to the R HTML documentation.