sklearn 可視化模型的訓練測試收斂情況和特征重要性

阿新 • • 發佈：2018-08-15

object 畫出 ted stat mea 重要模型 error nbsp

show the code:

# Plot training deviance
def plot_training_deviance(clf, n_estimators, X_test, y_test):
    # compute test set deviance
    test_score = np.zeros((n_estimators,), dtype=np.float64)
    for i, y_pred in enumerate(clf.staged_predict(X_test)):
        test_score[i] = clf.loss_(y_test, y_pred)
    plt.figure(figsize 
=(12, 6))
    plt.subplot(1, 2, 1)
    plt.title(‘Deviance‘)
    train_score = clf.train_score_
    logging.info("len(train_score): %s" % len(train_score))
    logging.info(train_score)
    logging.info("len(test_score): %s" % len(test_score))
    logging.info(test_score)
    plt.plot(np.arange(n_estimators)  
+ 1, train_score, ‘b-‘,
             label=‘Training Set Deviance‘)
    plt.plot(np.arange(n_estimators) + 1, test_score, ‘r*‘, label=‘Test Set Deviance‘)
    plt.legend(loc=‘upper right‘)
    plt.xlabel(‘Boosting Iterations‘)
    plt.ylabel(‘Deviance‘)
    plt.show()


# Plot feature importance 

def plot_feature_importance(clf, feature_names):
    feature_importance = clf.feature_importances_
    # make importances relative to max importance
    feature_importance = 100.0 * (feature_importance / feature_importance.max())
    sorted_idx = np.argsort(feature_importance)
    pos = np.arange(sorted_idx.shape[0]) + .5
    plt.subplot(1, 2, 2)
    plt.barh(pos, feature_importance[sorted_idx], align=‘center‘)
    # plt.yticks(pos, feature_names[sorted_idx])
    plt.yticks(pos, [feature_names[idx] for idx in sorted_idx])
    plt.xlabel(‘Relative Importance‘)
    plt.title(‘Variable Importance‘)
    plt.show()


class Train(object):
    def __init__(self, data_file):
        self.data_file = data_file
        self.x_fields = ["xxx", "xxx", "xxx"]
        self.x_features, self.y_labels = self.load_data()

    def load_data(self):
        x_features, y_labels = [], []
        # ......
        return x_features, y_labels

    def train_model(self):
        model = GradientBoostingRegressor(random_state=42)
        model.fit(self.x_features, self.y_labels)
        y_pred = model.predict(self.x_features)
        logging.info("mean_squared_error: %.6f" % mean_squared_error(self.y_labels, y_pred))
        logging.info("mean_squared_log_error: %.6f" % mean_squared_log_error(self.y_labels, y_pred))

        plot_training_deviance(clf=model, n_estimators=model.get_params()["n_estimators"], X_test=self.x_features, y_test=self.y_labels)
                               
        # 輸出feature重要性
        logging.info("feature_importances_: %s" % model.feature_importances_)
        plot_feature_importance(clf=model, feature_names=self.x_fields)

參考的是sklearn中的樣例: Gradient Boosting regression — scikit-learn 0.19.2 documentation

畫出的圖如下所示：

技術分享圖片

sklearn 可視化模型的訓練測試收斂情況和特征重要性

object 畫出 ted stat mea 重要模型 error nbsp show the code: # Plot training deviance def plot_training_deviance(clf, n_estimators, X_tes

sklearn中xgboost模塊中plot_importance函數（特征重要性）

sklearn spl dict target hub datasets 目的 features 特征 # -*- coding: utf-8 -*- """ ########################################################

sklearn中樹模型可視化的方法

方法 ron 問題 style 業界們的 graphviz 還需要 plus 在機器學習的過程中，我們常常會用到樹模型的方式來解決我們的問題。在工業界，我們不僅要針對某個問題利用機器學習的方法來解決問題，而且還需要能力解釋其中的原理或原因。今天主要在這裏記錄一下樹模型是怎

matplotlib.pyplot可視化訓練結果

tensorflowmatplotlib.pyplot可視化訓練結果註：程序和數據來自上篇blog #定義激勵函數並定義一個添加神經層函數 import tensorflow as tf import numpy as np import matplotlib.pyplot as plt de

LDA模型數據的可視化

好的 strip pan remove 從大到小 ems open 可視化 except 1 """ 2 執行lda2vec.ipnb中的代碼 3 模型LDA 4 功能：訓練好後模型數據的可視化 5 """ 6 7 from lda

利用sklearn獲取手寫數字數據集，並進行可視化

字數 size pre code http text 添加 col sha %matplotlib inline from sklearn import datasets from matplotlib import pyplot as plt #獲取數據集 digits

DeepTracker: Visualizing the Training Process of Convolutional Neural Networks（對卷積神經網絡訓練過程的可視化）

training ces ini net mini 個人 src works con \ 裏面主要的兩個算法比較難以贅述，miniset主要就是求最小公共子集。（個人認為）DeepTracker: Visualizing the Train

tensorflow加載embedding模型進行可視化

labels model 代碼 worker shape ++ -c glob gin 1.功能采用python的gensim模塊訓練的word2vec模型，然後采用tensorflow讀取模型可視化embedding向量 ps:采用C++版本訓練的w2v模型，pyt

R數據可視化----ggplot2之標度、坐標軸和圖例詳解

abs 調整所有不同的 size n) 默認表達 idt 標度控制著數據到圖形屬性的映射，當有需要時，ggplot2會自動添加一個默認的標度。我們確實可以在不了解標度運行原理的情況下畫出許多圖形，但理解標度並學會如何操縱它們則將賦予我們對圖形更強的控制能力。每一種圖

Regexper可視化正則表達式工具

正則表達式正則工具Regexper可視化正則表達式工具Enter Javascript-style regular expression to dispalyhttps://regexper.com/http://www.regexpal.com/正則表達式30分鐘教程 https://deerchao.n

如何將枯燥的大數據呈現為可視化的圖？

大數據可視化將數據轉化成可視化圖表/形，其實一個工具就能完成，礙於工具太多，按照使用場景，暫且將已成熟應用的分為三個層次：第一層：數據報告、信息圖這裏統稱信息圖。信息圖是把數據、信息或知識可視化，必須要有一個清楚準確的解釋或表達甚為復雜且大量的信息。代表人物是新聞界的David McCandles

第三篇：數據可視化 - ggplot2

strong 保存轉換成特征散點圖說明 pdf格式 ota 目的前言 R語言的強大之處在於統計和作圖。其中統計部分的內容很多很強大，因此會在以後的實例中逐步介紹；而作圖部分的套路相對來說是比較固定的，現在可以先對它做一個總體的認識。

第二篇：數據可視化 - 基本API

數據挖掘 idt 示例 iyu 大小 blue .com sof 個性化前言數據可視化是數據挖掘非常重要的一個環節，它不單在查閱了解數據環節使用到，在整個數據挖掘的流程中都會使用到。因為數據可視化不單可以形象地展示數據，讓你對數據有更好

Docker可視化界面（Consul+Shipyard+Swarm+Service Discover）部署記錄

agen net 映射 control pro doc labs 容器默認賬戶前面一篇說到了Docker管理工具-Swarm部署記錄，基於這個環境，下面記錄下Docker可視化界面部署過程： 1）下載相關驚喜 manager-node節點（182.48.115.

87、使用TensorBoard進行可視化學習

哈哈哈 tput sco 而在封裝結果 average 實現 machine 1、還是以手寫識別為類，至於為什麽一直用手寫識別這個例子，原因很簡單，因為書上只給出了這個類子呀，哈哈哈，好神奇下面是可視化學習的標準函數 ‘‘‘ Created on 2017年5月23

三角網格表面高斯曲率的計算與可視化

綠色調試運行即將簡單坐標 com 框架搭建 alt 建立好久沒有寫代碼了，最近拿計算三角網格表面的高斯曲率練了練手，並實現了高斯曲率的可視化，復習了一點微分幾何的知識。感覺有時候還是要自己把代碼寫出來，調試運行，結合試驗結果，才能對相應的知識有更深的了解。所謂曲

Dionaea蜜罐IP數據地圖可視化

pmap git numpy file py3 tree 所有 open 結果 #關於如何簡單搭建Dionaea低交互式蜜罐，詳見博文 #前言. 　　以我在洛杉磯租用某臺VPS上搭建的Dionaea蜜罐，在5.26晚23.58至5.28日17.10時間段(41h)

數據可視化入門之show me the numbers

推薦有趣的好的 style blank 分享 span 需要 width 數據的可視化一直是自己瞎玩著學，近來想系統的學數據可視化的東西，於是搜索資料時看到有人推薦《show me the numbers》作為入門。由於搜不到具體的書籍內容，只能搜到一個

mysqlbinlog 可視化查看sql語句

sql語句 mysqlbinlog 可視化查看直接mysqlbinlog導出來的文件，執行sql部分的sql語句顯示為base64編碼格式，無法正常閱讀。所以生成sql記錄的時候，不能用常規的辦法去生成，需要加上相應的參數才能顯示出sql語句--base64-output=decode-rows

詳解Redis 可視化圖形監控界面 RedisLive

redis作為一款開源的 Redis 圖形化監控工具，RedisLive 提供對 Redis 實例的內存使用情況，接收的客戶端命令，接收的請求數量以及鍵進行監控。RedisLive 的工作原理基於 Redis 的 INFO 和 MONITOR 命令，通過向 Redis 實例發送 INFO 和 MONITOR

sklearn 可視化模型的訓練測試收斂情況和特征重要性

相關推薦