All Versions
184
Latest Version
Avg Release Cycle
13 days
Latest Release
-

Changelog History
Page 8

  • v3.22.0.5 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xia/5/index.html

    Bug

    ๐Ÿš€ [PUBDEV-6198] - Fixed an H2O hang issue in Sparkling Water deployments.

  • v3.22.0.4 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xia/4/index.html

    Bug

    ๐Ÿš€ [PUBDEV-6109] - In Flow, fixed an issue that caused POJOs, MOJOs, and genmodel.jar to fail to download. This occurred when Flow was launched via Enterprise Steam and in any deployment where user_context was specified. [PUBDEV-6166] - On the external backedn, H2O now explicitly passes the timestamp from the Spark Driver node.

  • v3.22.0.3 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xia/3/index.html

    Bug

    ๐Ÿ›  [PUBDEV-5829] - Fixed an issue with the REST API. Calling "get model" no longer returns 0 for the timestamp of the model. [PUBDEV-5959] - The PySparking client no longer hangs after re-connecting to the H2O external backend. ๐Ÿ›  [PUBDEV-5990] - Fixed an OOM issue in h2o.arrange. ๐Ÿ›  [PUBDEV-6059] - Fixed an issue that caused importing Pargue files with large Double data to fail. [PUBDEV-6076] - After applying group_by to a time stamped column, the original time stamp format is now retained. 0๏ธโƒฃ [PUBDEV-6079] - In AutoML, cross-validation metrics are now used for early stopping by default. Because of this, the validation_frame argument is now ignored unless nfolds==0 and, in that case, will be used for early stopping. ๐Ÿ›  [PUBDEV-6098] - Fixed an issue that caused the MOJO visualizer to fail for Isolation Forest models. [PUBDEV-6101] - StackedEnsembleMojoModel is now serializable. ๐Ÿ›  [PUBDEV-6107] - In the R client, fixed an error that occurrred when running getModelTree. ๐Ÿš€ [PUBDEV-6109] - In Flow, fixed an issue that caused POJOs, MOJOs, and genmodel.jar to fail to download. This occurred when Flow was launched via Enterprise Steam and in any deployment where user_context was specified. ๐Ÿ›  [PUBDEV-6111] - Fixed the formula used for calculating L2 distance. [PUBDEV-6117] - The Python client now allows users to enable XGBoost compare with any H2O frame. The convert_H2OFrame_2_DMatrix method accepts any H2O frame and can convert it to valid data for native XGBoost. [PUBDEV-6120] - H2O XGBoost now reports correct variable importances. The variable importances are computed from the gains of their respective loss functions during tree construction. [PUBDEV-6122] - Users can now save PDP plots. ๐Ÿ›  [PUBDEV-6123] - Fixed an issue that resulted in a SQL exception when connecting H2O to a SQL server and importing a table. ๐Ÿ›  [PUBDEV-6137] - Fixed an issue with GCS support on Hadoop environments.

    New Feature

    [PUBDEV-1984] - Added monotonic variables for GBM. [PUBDEV-6030] - EasyPredictModelWrapper now calculates reconstruction errors for AutoEncoder. [PUBDEV-6091] - When running a grid search, a timesteamp column was added that shows when each model was added to the grid summary table.

    Improvement

    [PUBDEV-5865] - In GBM, users can now specify the monotone_constraints parameter. [PUBDEV-6106] - Prediction contributions from each tree from MOJO to easywrapper are now exposed. โšก๏ธ [PUBDEV-6110] - Updated Gradle to version 5.0. ๐Ÿ›  [PUBDEV-6115] - Fixed the output of rankTsv in the AutoML leaderboard.

    ๐Ÿ“„ Docs

    โšก๏ธ [PUBDEV-4377] - Updated the Prediction section to include information on how the prediction threshold is selected for classification problems. โšก๏ธ [PUBDEV-6105] - Updated the description of enum_limited to indicate that T=1024. [PUBDEV-6148] - In the GBM chapter, added monotone_constraints to list of available parameters.

  • v3.22.0.2 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xia/2/index.html

    Bug

    ๐Ÿ“œ [PUBDEV-3281] - Fixed an issue that caused ARFF parser to parse some file incorrectly. ๐Ÿ›  [PUBDEV-4737] - When performing a grid search in Python, fixed an issue that caused all models to return a model.type of "supervised." [PUBDEV-5352] - When running DRF in the Python client, checkpointing on new data now works correctly. ๐Ÿ›  [PUBDEV-5869] - Fixed an issue that caused the confusion matrix recall and precision values to be switched. ๐Ÿ›  [PUBDEV-6036] - In the Python client, fixed an issue that caused the offset_column parameter to be ignored when it was passed in the GLM train statement. [PUBDEV-6042] - The H2O Tree Handler now works correctly on Isolation Forest models. ๐Ÿ›  [PUBDEV-6046] - When running AutoML, fixed an issue that resulted in a "Failed to get metric: auc from ModelMetrics type BinomialGLM" message. [PUBDEV-6050] - In Flow, Precision and Recall definitions are no longer inverted in the confusion matrix. ๐Ÿ›  [PUBDEV-6052] - Fixed the error message that displays when converting from a pandas dataframe to an h2oframe in Python 3.6. ๐Ÿ›  [PUBDEV-6054] - In XGBoost, fixed an issue that resulted in a "Maximum amount of file descriptors hit" message. ๐Ÿ›  [PUBDEV-6060] - Fixed the description of sample_rate in Isolation Forest. 0๏ธโƒฃ [PUBDEV-6063] - Cross validation models are no longer deleted by default. ๐Ÿ›  [PUBDEV-6065] - When viewing an AutoML leaderboard, fixed an issue that resulted in an ArrayIndexOutOfBoundsException if sort_metric was specified but no model was built.

    New Feature

    [PUBDEV-5766] - Added monotonicity constraints to H2O XGBoost.

    Task

    [PUBDEV-6039] - When generating MOJOs, h2o-genmodel.jar now includes a check for MOJO version 1.3 to determine whether the ho2-genmodel.jar and the MOJO version can work together. Prior versions of h2o-3 did not include MOJO 1.3, and as a result, MOJOs silently returned predicted values executed on an empty vector.

    Improvement

    ๐Ÿ“œ [PUBDEV-5705] - With a new skipped_columns option, users can now specify to drop specific columns before parsing. Note that this functionality is not supported for SVMLight or Avro file formats. [PUBDEV-6062] - The GLM multinomial coefficient table now includes the original levels as column names.

    ๐Ÿ“„ Docs

    ๐ŸŽ [PUBDEV-3216] - Created new Performance & Prediction and Variable Importance sections in the User Guide. 0๏ธโƒฃ [PUBDEV-5313] - Updatd the default value of categorical_encoding for XGBoost. This defaults to Auto (which is one_hot_encoding). โšก๏ธ [PUBDEV-6012] - In the parameter entry for weights_column, updated the example to exclude the weight column in the list of predictors. โšก๏ธ [PUBDEV-6016] - In the DRF FAQ, updated the "What happens when you try to predict on a categorical level not seen during training?" question. ๐Ÿ“„ [PUBDEV-6025] - TargetingEncoder is now included in the Python module docs. ๐Ÿ“š [PUBDEV-6041] - In GLM, updated the documentation to indicate that coordinate_descent is no longer experimental. [PUBDEV-6064] - Added default values for max_depth, sample_size, and sample_rate. Also added a parameter description entry for sample_size, showing an Isolation Forest example. [PUBDEV-6086] - Added the new monotone_constraints option to the XGBoost chapter.

  • v3.22.0.1 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-xia/1/index.html

    Bug

    [PUBDEV-5023] - In Python, the metalearner method is only available for Stacked Ensembles. โœ… [PUBDEV-5658] - Fixed an issue that caused micro benchmark tests to fail to run in the jmh directory. ๐Ÿ›  [PUBDEV-5663] - Fixed an issue that caused H2O to fail to export dataframes to S3. [PUBDEV-5745] - Added the keep_cross_validation_models argument to Grid Search. [PUBDEV-5746] - Improved efficiency of the keep_cross_validation_models parameter in AutoML [PUBDEV-5777] - Simplified the comparison of H2OXGBoost with native XGBoost when using the Python client. ๐Ÿ›  [PUBDEV-5780] - Fixed JDBC ingestion for Teradata databases. [PUBDEV-5824] - In the Python client and the Java API, multiple runs of the same AutoML instance no longer fail training new "Best Of Family" SE models that would include the newly generated models. ๐Ÿ›  [PUBDEV-5873] - Fixed an issue that resulted in an AssertionError when calling cbind from the Python client. [PUBDEV-5881] - AutoML now enforces case for the sort_metric option when using the Java API. โš™ [PUBDEV-5903] - In AutoML, StackEnsemble models are now always trained, even if we reached max_runtime_secs limit. ๐Ÿ“š [PUBDEV-5904] - In the R client, added documentation for helper functions. [PUBDEV-5922] - Renamed x to X in the H2O-sklearn fit method to be consistent with the sklearn API. ๐Ÿ”€ [PUBDEV-5924] - Merging datasets now works correctly. ๐Ÿ— [PUBDEV-5931] - Building on Maven with h2o-ext-xgboost on versions later than 3.18.0.11 no longer results in a dependency error. ๐Ÿ“œ [PUBDEV-5933] - Fixed a Java 11 ORC file parsing failure. โฌ†๏ธ [PUBDEV-5954] - Upgraded the version of the lodash package used in H2O Flow. [PUBDEV-5967] - -ip localhost now works correctly on WSL. ๐Ÿ“œ [PUBDEV-5971] - CSV/ARFF Parser no longer treats blank lines as data lines with NAs. [PUBDEV-5976] - Starting h2o-3 from the Python Client no longer fails on Java 10.0.2. ๐Ÿ›  [PUBDEV-5995] - Fixed an issue that caused StackedEnsemble MOJO model to return an "IllegalArgumentException: categorical value out of range" message. ๐Ÿšš [PUBDEV-5996] - Removed the "nclasses" parameter from tree traversal routines. [PUBDEV-5998] - Exposed H2OXGBoost parameters used to train a model to the Python API. Previously, this information was visible in the Java backend but was not passed back to the Python API. ๐Ÿšš [PUBDEV-5999] - Removed "illegal reflective access" warnings when starting H2O-3 with Java 10. [PUBDEV-6004] - In Stacked Ensembles, changes made to data during scoring now apply to all models. โšก๏ธ [PUBDEV-6005] - When running AutoML in Flow, updated the list of algorithms that can ber selected in the "Exclude These Algorithms" section.

    New Feature

    [PUBDEV-5170] - Individual predictions of GBM trees are now exposed in the MOJO API. [PUBDEV-5378] - Exposed target encoding in the Java API. [PUBDEV-5399] - The keep_cross_validation_fold_assignment option is now available in AutoML. ๐Ÿ‘ [PUBDEV-5609] - Added support for the Isolation Forest algorithm in H2O-3. Note that this is a Beta version of the algorithm. [PUBDEV-5668] - Added the keep_cross_validation_fold_assignment option to AutoML in Flow. ๐Ÿ”– [PUBDEV-5681] - h2o.connect no longer ignores strict_version_check=FALSE when connecting to a Steam cluster. ๐Ÿ‘ท [PUBDEV-5695] - Created an R demo for CoxPH. This is available here. [PUBDEV-5775] - It is now possible to combine two models into one MOJO, with the second model using the prediction from the first model as a feature. These models can be from any algorithm or combination of algorithms except Word2Vec. [PUBDEV-5852] - Implemented h2oframe.fillna(method='backward'). [PUBDEV-5977] - Improved speed-up of AutoML training on smaller datesets in client mode (Sparkling Water). [PUBDEV-5979] - Exposed Java Target Encoding in the Python client. ๐Ÿšš [PUBDEV-5988] - Users can now specify a -features parameter when starting h2o from the command line. This allows users to remove experimental or beta algorithms when starting H2O-3. Available options for this parameter include beta, stable, and experimental.

    Task

    [PUBDEV-4507] - Added XGBoost to AutoML. [PUBDEV-5696] - Added an option to allow users to use a user-specified JDBC driver. [PUBDEV-5722] - Exposed pr_auc to areas where you can find AUC, including scoring_history, model summary. Also added h2o.pr_auc() in R. ๐Ÿ‘ [PUBDEV-5901] - Added support for Java 11. ๐Ÿ“š [PUBDEV-6001] - Improved the AutoML documentation in the User Guide.

    Improvement

    [PUBDEV-5590] - Added a MAX_USR_CONNECTIONS_KEY argument to limit number of sessions for import_sql_table. ๐ŸŽ [PUBDEV-5669] - Improved performance gap when importing data using Hive2. [PUBDEV-5719] - Improved and cleaned up output for the h2o.mojo_predict_csv and h2o.mojo_predict_df functions. [PUBDEV-5743] - Users can now visualize XGBoost trees when running predictions. [PUBDEV-5761] - Added weights to partial depenced plots. Also added a level for missing values. [PUBDEV-5822] - Users can now download the genmodel.jar in Flow for completed models. [PUBDEV-5886] - In AutoML, changed the default for keep_cross_validation_models and keep_cross_validation_predictions from True to False. ๐Ÿ‘ [PUBDEV-5888] - Added support for predicting using the XGBoost Predictor. โšก๏ธ [PUBDEV-5909] - In XGBoost, optimized the matrix exchange between Java and native C++ code. [PUBDEV-5913] - Improved the h2o-3 README for installing in R and IntelliJ IDEA. [PUBDEV-5927] - Introduced a simple "streaming" mode that allows H2O to read from a table using basic SQL:92 constructs. [PUBDEV-5929] - In AutoML, stopping_metric is now based on sort_metric. [PUBDEV-5952] - The requirements.txt file now includes the Colorama version. [PUBDEV-5961] - In lockable.java, delete is now final in order to prevent inconsistent overrides. โช [PUBDEV-5964] - Reverted AutoML naming change from Auto.Algo to Auto.algo. [PUBDEV-6000] - In AutoML, automatic partitioning of the valiation frame now uses 10% of the training data instead of 20%. [PUBDEV-6002] - Changed model and grid indexing in autogenerated model names in AutoML to be 1 instead of 0 indexed. [PUBDEV-6017] - Allow public access to H2O instances started from R/Python. This can be done with the new bind_to_localhost (Boolean) parameter, which can be specified in h2o.init().

    ๐Ÿ“„ Docs

    ๐Ÿ— [PUBDEV-4505] - Added Scala and Java examples to the Building and Extracting a MOJO topic. [PUBDEV-4590] - Added a Scala example to the Stacked Ensembles topic. ๐Ÿ“š [PUBDEV-5949] - Added Tree class method to the Python module documentation. ๐Ÿ“š [PUBDEV-5641] - Removed references to UDP in the documentation. ๐Ÿšš [PUBDEV-5664] - Removed Sparkling Water topics from H2O-3 User Guide. These are in the Sparkling Water User Guide. ๐Ÿ”Š [PUBDEV-5674] - Added a Resources section to the Overview and included links to the awesome-h2o repository, H2O.ai blogs, and customer use cases. ๐Ÿ“š [PUBDEV-5693] - Updated GCP Installation documentation with infomation about quota limits. ๐Ÿ“š [PUBDEV-5709] - Updated Gains/Lift documentation. 16 groups are now used by default. [PUBDEV-5756] - Added Python examples to the Cross-Validation topic in the User Guide. [PUBDEV-5762] - Added loss_by_col and loss_by_col_idx to list of GLRM parameters. [PUBDEV-5810] - Updated documentation for class_sampling_factors. balance_classes must be enabled when using class_sampling_factors. ๐Ÿณ [PUBDEV-5839] - Added a Python example for initializing and starting h2o-3 in Docker. ๐Ÿ“š [PUBDEV-5857] - Updated the Admin menu documentation in Flow after adding "Download Gen Model" option. ๐Ÿ‘ [PUBDEV-5905] - In GBM and DRF, enum_limited is a supported option for categorical_encoding. ๐Ÿ’ป [PUBDEV-5962] - Added the -notify_local flag to list of flags available when starting H2O-3 from the command line. ๐Ÿ“š [PUBDEV-5982] - Added documentation for Isolation Forest (beta).

  • v3.20.0.9 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-wright/9/index.html

    Bug

    ๐Ÿ›  [PUBDEV-5930] - Fixed an issue that caused H2O to fail when loading a GLRM model.

    Improvement

    [PUBDEV-5938] - log4j.properties can be loaded from classpath. ๐Ÿ”ง [PUBDEV-5939] - Buffer configuration is now available for http/https connections.

  • v3.20.0.8 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-wright/8/index.html

    Bug

    ๐Ÿ“œ [PUBDEV-5855] - Fixed an issue that occurred when parsing columns that include double quotation. โš™ [PUBDEV-5880] - The max_runtime_secs option is no longer ignored when using the Python client. โœ… [PUBDEV-5906] - Fixed an XGBoost Sparsity detection test to make it deterministic. ๐Ÿ“œ [PUBDEV-5907] - Hadoop driver class no longer fails to parse new Java version string.

    New Feature

    [PUBDEV-5861] - Added a GBM/DRF Tree walker API in the R client. [PUBDEV-5862] - The R API for obtaining and traversing model trees in GBM/DRF is available in Python.

    Improvement

    ๐Ÿ‘ [PUBDEV-5706] - Added support for user defined split points in partial dependence plots. [PUBDEV-5748] - Confusion matrices can now be generated in Flow. [PUBDEV-5900] - Java version error messages now reference versions 7 and 8 instead of 1.7 and 1.8. [PUBDEV-5902] - A Python tree traversal demo is available at https://github.com/h2oai/h2o-3/blob/master/h2o-py/demos/tree_demo.ipynb.

  • v3.20.0.7 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-wright/7/index.html

    Bug

    ๐Ÿ›  [PUBDEV-5826] - Fixed an issue that caused a mismatch between GLRM MOJO predict and GLRM predict. ๐Ÿ›  [PUBDEV-5841] - Fixed an issue that caused H2O XGBoost grid search to fail even when sizing the sessions 4xs the data size and using extramempercent of 150. [PUBDEV-5848] - When performing multiple AutoML runs using the H2O R client, viewing the first AutoML leaderboard no longer results in an error. [PUBDEV-5864] - H2O now only binds to the local interface when started from R/Python. ๐Ÿ›  [PUBDEV-5871] - Fixed an issue that caused DeepLearning and XGBoost MOJOs to get a corrupted input row. This occurred when GenModel's helper functions that perform 1-hot encoding failed to take correctly into considerations cases where useAllFactorLevels = false and corrupted the first categorical value in the input row. [PUBDEV-5872] - Added gamma, tweedie, and poisson objective functions to the XGBoost Java Predictor. ๐Ÿ›  [PUBDEV-5877] - Fixed an issue in HDFS file import. In rare cases the import could fail due to temporarily inconsistent state of H2O distributed memory.

  • v3.20.0.6 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-wright/6/index.html

    Bug

    ๐Ÿ“œ [PUBDEV-5724] - H2oApi.frameColumn in h2o-bindings.jar now correctly parses responses. ๐Ÿš€ [PUBDEV-5751] - biz.k11i:xgboost-predictor:0.3.0 is now ported to the h2oai repo and released to Maven Central. This allows for easier deployment of H2O and Sparkling Water. [PUBDEV-5786] - In GLM, the coordinate descent solver is now only disabled for when family=multinomial. ๐Ÿ“œ [PUBDEV-5792] - Fixed an issue that caused the H2O parser to hang when reading a Parquet file. ๐Ÿ›  [PUBDEV-5803] - Fixed an issue that resulted in an AutoML "Unauthorized" Error when running through Enterprise Steam via R. [PUBDEV-5818] - Leaf Node assignment no longer produces the wrong paths for degenerated trees. ๐Ÿš€ [PUBDEV-5823] - Updated the list of Python dependencies on the release download page and in the User Guide. ๐Ÿ›  [PUBDEV-5826] - Fixed an issue that resulted in a mismatch between GLRM predict and GLRM MOJO predict. [PUBDEV-5844] - Launching H2O on a machine with greater than 2TB no longer results in an integer overflow error. ๐Ÿ“œ [PUBDEV-5847] - The HTTP parser no longer reads fewer rows when the data is compressed. [PUBDEV-5851] - AstFillNA Rapids expression now returns H2O.unimp() on backward methods.

    New Feature

    [PUBDEV-5735] - In GBM and DRF, tree traversal and information is now accessible from the R and Python clients. This can be done using the new h2o.getModelTree function. [PUBDEV-5779] - In GBM, added a new staged_predict_proba function. [PUBDEV-5812] - MOJO output now includes terminal node IDs. [PUBDEV-5832] - GBM/DRF, the H2OTreeClass function now allows you to specify categorical levels.

    Task

    โšก๏ธ [PUBDEV-5845] - Updated the XGBoost dependency to ai.h2o:xgboost-predictor:0.3.1.

    Improvement

    [PUBDEV-5837] - Terminal node IDs can now be retrieved in the predict_leaf_node_assignment function.

    ๐Ÿ“„ Docs

    ๐Ÿ‘ [PUBDEV-5836] - The User Guide now indicates that only Hive versions 2.2.0 or greater are supported for JDBC drivers. Hive 2.1 is not currently supported. ๐Ÿ“š [PUBDEV-5838] - In GLM, the documentation for the Coordinate Descent solver now notes that Coordinate Descent is not available when family=multinomial.

  • v3.20.0.5 Changes

    ๐Ÿš€ Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-wright/5/index.html

    Bug

    โœ… [PUBDEV-5543] - Hive smoke tests no longer time out on HDP. [PUBDEV-5793] - AutoML now correctly ignores columns specified in Flow. [PUBDEV-5794] - In Flow, the Import SQL Table button now works correctly. [PUBDEV-5806] - XGBoost cross validation now works correctly. [PUBDEV-5811] - Fixed an issue that caused AutoML to fail in Flow due to the keep_cross_validation_fold_assignment option. [PUBDEV-5814] - Multinomial Stacked Ensemble no longer fails when either XGBoost or Naive Bayes is the base model. ๐Ÿ›  [PUBDEV-5816] - Fixed an issue that caused XGBoost to generate the wrong metrics for multinomial cases. [PUBDEV-5819] - Increased the client_disconnect_timeout value when ClientDisconnectCheckThread searches for connected clients.

    Improvement

    โœ… [PUBDEV-5813] - Added automated Flow test for AutoML.