特徵重要度展示

阿新 • • 發佈：2018-12-13

RF評價特徵重要度，畫出特徵排行

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split,GridSearchCV
from sklearn.metrics import classification_report

def read_data():
    # load pickle
    #df = pd.read_pickle("./output/killed_collision_normal2class.pkl")
    df = pd.read_pickle("./output/killed_collision_normal2class.pkl")
    X_train, X_test, y_train, y_test=train_test_split(df.drop(columns=["KILLED"]), df["KILLED"],
                     test_size=0.3, random_state=0)
    return df, X_train, X_test, y_train, y_test

#---------讀取資料集
pd_data,X_train, X_test, y_train, y_test = read_data()

def feature_importance(features_num=20):
    if(features_num > X_train.shape[1]):
        print("the features num is too big for the  trainData")
        return

    forest = RandomForestClassifier(n_estimators=500,random_state=0,n_jobs=-1,max_features=20)
    forest.fit(X_train,y_train)
    y_true, y_pred = y_test, forest.predict(X_test)
    print(classification_report(y_true, y_pred))
    importance = forest.feature_importances_
    indices = np.argsort(importance)[::-1]
    print("----the importance of features and its importance_score------")
    j=1
    features_names=[]
    im_list= []
    for i in indices[0:features_num]:
        f_name = X_train.columns.values[i]
        print(j,f_name,importance[i])
        features_names.append(X_train.columns.values[i])
        im_list.append(importance[i])
        j+=1

    draw_importance(features_names,im_list)

def draw_importance(features,importances):
    indices = np.argsort(importances)
    print(indices)
    print(features)
    plt.title('Feature Importances')
    plt.barh(range(len(indices)), np.array(importances)[indices], color='b', align='center')
    plt.yticks(range(len(indices)), np.array(features)[indices])
    plt.xlabel('Relative Importance')
    plt.show()

if __name__=="__main__":
    feature_importance()

特徵重要度展示

RF評價特徵重要度，畫出特徵排行 import numpy as np import matplotlib.pyplot as plt import pandas as pd from sklearn.ensemble import RandomForestClassifier from skle

[LeetCode] Employee Importance 員工重要度

ade out lead like maximum employ truct most ever You are given a data structure of employee information, which includes the employee‘s

C++ 的重要特性展示

比較懶，直接程式碼展示吧如下程式碼展示C++ 的特性。 #include <string> #include <stdio.h> #include <stdlib.h> #include <errno.h> #include <stdi

sklearn資料特徵重要程度的篩選

''' ''' from sklearn.feature_selection import SelectKBest, f_classif import matplotlib.pyplot as plt selector = SelectKBest(f_cl

Unity 物體根據滑鼠移動而轉動（可用於物體的360度展示）（PC端）

有時候會有這個需求，就是物品的360度的展示，例如武將的全方位展示，或是物品的360度展示，這就需要根據滑鼠的移動來轉動物體而這個就可以實現哦！！！ using UnityEngine; using System.Collections; public class D

【資料建模 IV】特徵資訊度

IV(Information Value), 衡量特徵包含預測變數濃度的一種指標特徵資訊度解構：其中Gi,Bi表示箱i中好壞樣本佔全體好壞樣本的比例。　WOE表示兩類樣本分佈的差

360度全景展示帶來三維立體的感覺

酷雷曼隨著網絡科技信息的突飛猛進發展，網上訂餐、360度看房。看車訂酒店、等等，讓消費者足不出戶就能實現商品購買。如何給消費者提供更直觀的商品展示呢?近日，酷雷曼研發的一款360度全景瀏覽軟件，能夠使微信用戶實現360度全景看圖，對企業來說，這種非凡體驗能提高消費者的購買欲。橡膠谷3

關於百度地圖的數據展示

pub elong on() Coding erl cnblogs 關閉多邊形 back toMapList [{"dataId":"1506398830646205","createTime":"2017-09-26 12:07:10","updateTime":"2

運營小白淺談用戶運營重要指標--用戶活躍度

用戶運營提高活躍度數據分析工具都說一個人的數據分析能力決定了這個人對於用戶及產品的運營能力，那麽在用戶運營中懂得如何分析用戶活躍率就變的尤為重要了。作為一個剛剛入門的運營小白，想要盡快提升自己的能力那麽學會計算及分析活躍度就是必經之路。首先我們應了解的是：何為“活躍率”？活躍率是某一時間段內

Android 開發之集成百度地圖的定位與地圖展示

jni andro vra ada 列表 shee alias content markdown app 應用中,大多數應用都具有定位功能,百度定位就成了開發人員的集

百度地圖API，展示地圖和添加控件

空間鼠標滾輪比例尺鼠標 right size ofo ext aid 1、申請百度賬號和AK 點我申請 2、準備頁面根據HTML標準，每一份HTML文檔都應該聲明正確的文檔類型，我們建議您使用最新的符合HTML5規範的文檔聲明： <!DOCTYPE html

GPS轉化為百度坐標地圖展示

js地址：http://lbsyun.baidu.com/jsdemo.htm#c1_3 <!DOCTYPE html><html><head><meta http-equiv="Content-Type" content="text/h

百度ECharts圖表展示動態資料

百度ECharts是個非常強大的圖表工具，引入百度提供的echarts.min.js檔案後，只需從後臺獲取資料便可以圖表的形式展示資料，能夠更直觀的檢視、比較、統計資料。這裡以一個柱狀圖展示動態資料的小例子講解如何使用百度ECharts。 1.首先引入需要的js檔案： <

百度地圖開發(六)檢索定位附近街道資訊並展示

效果: 主要程式碼就是根據經緯度獲取經緯度附近周邊的資訊 /** * 根據經緯度獲取定位周邊街道資訊 * */ private void setPopupTipsInfo(LatLng latLng) { //設定反地理編

人機文字分類特徵構造——困惑度計算

最近在2018smp的一個比賽中鍛鍊了一下，該任務為文字分類，重點在於辨別人類作者和機器所寫文章的不同，在一番仔細斟酌之後發現兩者之間的區別有以下3點：（1）語序機器所寫的可能想表達的是一樣的說法，是基於文字規則對抽取詞彙的排列，但是結果呈現在詞序的排序上卻會出現一些偏差，舉個栗

“希希敬敬對”團隊--‘百度貼吧小爬蟲’Alpha版本展示部落格

“希希敬敬對”團隊成員簡介　　　　龍江騰（隊長）團隊PM 精通C語言，熟悉微控制器開發，嵌入式軟體開發。熟悉軟體專案的一般開發流程，有良好的程式設計風格，程式碼模組化思想。電子基礎紮實，能看懂原理圖，熟悉數位電路和類比電路知識。良好的團隊精神，性格開朗，善於溝通，有強烈的責任感，工作積極主動。

紋理特徵分析的灰度共生矩陣（GLCM）

紋理分析是對影象灰度（濃淡）空間分佈模式的提取和分析。紋理分析在遙感影象、X射線照片、細胞影象判讀和處理方面有廣泛的應用。關於紋理，還沒有一個統一的數學模型。它起源於表徵紡織品表面性質的紋理概念，可以用來描述任何物質組成成分的排列情況，例如醫學上X 射線照片中的肺紋理、血管紋理、航天(或航空)地形照

百度mapapi 地圖MapAPI地圖展示

地圖MapAPI地圖展示 <!doctype html>

語義的特徵提取及簡單詞頻展示(WordCloud)

對於語句分析，以及詞雲展示，具體程式碼如下： # coding=utf-8 import jieba import numpy import pandas as pd from wordcloud import WordCloud import matplotlib.pyplot a

html js百度地圖展示（通用）

引入js檔案 <head> <script type="text/javascript" src="http://api.map.baidu.com/api?v=2.0&ak=iBM9rbzTH2dMZW7MbYMYmFgb"></script>

特徵 重要度展示

相關推薦

特徵重要度展示