Apache Hivemall 0.5.2 釋出,可擴充套件的機器學習庫
Apache Hivemall 0.5.2 釋出了,Apache Hivemall 基於 Hive UDF/UDAF/UDTF,是一個可擴充套件的機器學習庫,執行基於 Hadoop 的資料處理框架,特別是 Apache Hive、Apache Spark 和 Apache Pig。
更新主要內容包括:
New Feature
[HIVEMALL-145] - Merge brickhouse functions
Improvement
[HIVEMALL-24] - Fix the prediction logic of Field-aware Factorization Machines more scalable
[HIVEMALL-46] - Make it more simpler to upgrade Spark versions
[HIVEMALL-172] - Change tree_predict 3rd argument to accept string options
[HIVEMALL-179] - Support Spark 2.3
[HIVEMALL-180] - Drop the Spark-2.0 support
[HIVEMALL-191] - Add Kryo serialization tests and remove existing workaround lazy instantiation code
[HIVEMALL-193] - Implement a tool for generating a list of Hivemall UDFs
[HIVEMALL-201] - Evaluate, fix and document FFM so Hivemall produces comparable accuracy to LIBFFM
[HIVEMALL-203] - Relocate Jackson package for to_json/from_json
[HIVEMALL-212] - Fix Classifier/Regressor not to forward zero weighted values
[HIVEMALL-215] - [DOC] Add step-by-step tutorial on the document
[HIVEMALL-222] - Introduce Gradient Clipping to avoid exploding gradient to General Classifier/Regressor
[HIVEMALL-223] - Add `-kv_map` and `-vk_map` option to to_ordered_list UDAF
詳情檢視更新日誌。
下載地址:http://hivemall.incubator.apache.org/download.html