H2O v3.22.1.5 Release Notes

  • ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xu/5/index.html

    Bug

    ๐Ÿ›  [PUBDEV-6283] - Fixed an issue that caused stratified_split to fail when run on same column twice. ๐Ÿ›  [PUBDEV-6290] - Fixed an error that occurred when retreiving AutoML leader model with max_models = 1 in R. ๐Ÿ›  [PUBDEV-6292] - Fixed an issue that ersulted in an extra NA row in the GLM variable importance frame. [PUBDEV-6298] - h2odriver now works correctly on MapR. [PUBDEV-6300] - Flow no longer displays an error when searching for a file without first providing a path. [PUBDEV-6303] - GBM monotonicity constraints now correctly preserves the exact monotonicity. โš  [PUBDEV-6304] - Fixed the warning message that displays for categorical data with more then 10,000,000 values. ๐Ÿ”Š [PUBDEV-6305] - Users can now download logs from R after connecting via Steam. [PUBDEV-6313] - In AutoML, created new partition rules for generating new validation and leaderboard frames when cross validation is disabled and validation/leaderboard frames are not provided: If only the validation frame is missing: training/validation = 90/10. If only the leaderboard frame is missing: training/leaderboard = 90/10. If both the validation and leaderboard frames are missing: training/validation/leaderboard = 80/10/10. ๐Ÿ“ฆ [PUBDEV-6321] - Fixed resolution of spark-shell --packages "ai.h2o:h2o-algos:<vesion>" by Spark Ivy resolver. ๐Ÿ”ง [PUBDEV-6333] - Fixed an issue that caused h2o driver to fail to start when Hive was not configured.

    Improvement

    ๐Ÿ›  [PUBDEV-6271] - In Isolation Forest, fixed an issue that caused the minimum and maximum path length to not be correctly calculated when there are no OOB observations. [PUBDEV-6294] - A check_constant_response option is available in DRF and GBM. When enabled (default), then an exception is thrown if the response column is a constant value.

    ๐Ÿ“„ Docs

    [PUBDEV-5554] - When running XGBoost on Hadoop, recommend that users set -extramempercent to 120. [PUBDEV-6287] - Added the new check_constant_response option to the GBM and DRF chapters. Also added an example usage to the Parameters Appendix. ๐ŸŽ [PUBDEV-6301] - Added a description of the AUCPR metric to the Model Performance section in the User Guide. ๐Ÿ›  [PUBDEV-6314] - Fixed the Random Grid Search in Python example in the Grid Search chapter.