H2O v3.32.1.1 Release Notes

  • πŸš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-zipf/1/index.html

    πŸ› Bug

    [PUBDEV-6356] - GBM histograms now ignore rows with NA responses. [PUBDEV-7606] - Variable Importances added to GLM Generic model. πŸ›  [PUBDEV-7782] - Fixed the ArrayIndexOutOfBoundsException issue with GLM CV. 🐎 [PUBDEV-7825] - CoxPH performance no longer fails when a factor is used for the event_column. [PUBDEV-7841] - Existing frame no longer overwritten when data with the same query is loaded. πŸ›  [PUBDEV-7909] - Fixed how gain is calculated in XGBFI for GBM. [PUBDEV-7934] - Improved the error messages for save_to_hive_table. βœ… [PUBDEV-7963] - Added missing argument ’test’ for h2o.explain_row(). πŸ–¨ [PUBDEV-7979] - All trees now supported for XGBoost Print MOJO in Java. [PUBDEV-7987] - CoxPH prediction no longer fails when offset_column is specified. [PUBDEV-7998] - Added keys for Individual Conditional Expectation (ICE) plot in H2OExplanation class. [PUBDEV-8013] - model@model$parameters$x now reports actual feature names instead of names. [PUBDEV-8016] - h2o.explain no longer errors when AutoML object is trained with a fold_column. πŸ›  [PUBDEV-8046] - Fixed issues with python’s explanation plots not displaying fully.

    πŸ†• New Feature

    [PUBDEV-7706] - Ignored columns that are actually used for model training are unignored and no longer prevent model training to start in Flow. [PUBDEV-7735] - Added baseline hazard function estimate to CoxPH model. πŸ‘ [PUBDEV-7748] - Target Encoding now supports feature interactions. [PUBDEV-7805] - Added CoxPH concordance to both Flow and R/Python CoxPH summaries. [PUBDEV-7820] - Added a topbasemodel attribute to AutoML. [PUBDEV-7831] - Added new learning curve plotting function to R/Python. [PUBDEV-7854] - Added script for estimating the memory usage of a dataset. [PUBDEV-7859] - Added fault protections to grid search allowing saving of data and parameters, model checkpointing, and auto-recovery. πŸ‘ [PUBDEV-7884] - Added support for Java 15. πŸ‘ [PUBDEV-7969] - Added CDP7.1 support. πŸ–¨ [PUBDEV-7978] - Added support for XGBoost to Print MOJO as JSON. πŸ‘ [PUBDEV-8021] - Added support for refreshing HDFS delegation tokens. βͺ [PUBDEV-8035] - Reverted XGBoost categorical encodings for contributions.

    Task

    [PUBDEV-7637] - max_hit_ratio_k deprecated and removed. πŸ“¦ [PUBDEV-7894] - Added upper bound cap to supported Java version in H2O CRAN package requirements.

    πŸ‘Œ Improvement

    [PUBDEV-7473] - Users now allowed to include categorical column name in beta constraints. [PUBDEV-7579] - Multinomial PDP can now be plotted for more than one target class in Flow. [PUBDEV-7736] - Sped up CoxPH concordance score by using tree instead of the direct approach. [PUBDEV-7819] - XGBoost no longer fails when specifying custom fold_column. [PUBDEV-7843] - XGBoost CV models now built on multiple GPUs in parallel. [PUBDEV-7968] - Missing metrics added to GLM scoring history. [PUBDEV-8017] - Added validation checks for sampling rates for XGBoost for the R/Python clients. [PUBDEV-8024] -
    No longer errors when trying to use a fold column where not all folds are represented. [PUBDEV-8032] - Added the metalearner_transform option to Stacked Ensemble. [PUBDEV-8057] - GBM main model now built in parallel to the CV models. 🚚 [PUBDEV-8060] - Removed redundant extraction weights from GBM/DRF histogram. [PUBDEV-8061] - GBM now avoids scoring the last iteration twice when early stopping is enabled. [PUBDEV-8063] - POJO predictions for XGBoost now even closer to in-H2O predictions. [PUBDEV-8064] - Double-scoring of CV models in AutoML now avoided thus speeding up AutoML. [PUBDEV-8070] - AutoML now uses fewer neurons in DL grids and has improved the metalearner for Stacked Ensemble.

    Technical task

    [PUBDEV-7860] - Thin plate regression splines added to GAM.

    πŸ“„ Docs

    [PUBDEV-7917] - Added checkpoint description to GLM. πŸ“š [PUBDEV-7976] - Added thin plate regression spline documentation to GAM algorithm page. [PUBDEV-7988] - Added missing parameters to XGBoost algorithm page. 🌲 [PUBDEV-7992] - Added more information about log files to User Guide.