170720 混淆矩陣繪製+pandas讀取資料（有點亂，後面抽空再整理）

阿新 • • 發佈：2019-01-07

E:\Backup\validation confusion matrix_final2

# -*- coding: utf-8 -*-
"""
Created on Fri May 19 11:17:12 2017

@author: Bruce Lau
"""
#%%
import itertools
import numpy as np
import matplotlib.pyplot as plt
from scipy import stats

from sklearn.metrics import confusion_matrix
from p4_1_ds_competition import 
 idscnn2
# load the data
def reverse_onehot(onehot):
    y = []
    for i in onehot:
        i = i.tolist()
        y.append(i.index(max(i)))
    return y
#%%
def cal_acc(m1,m2):
    a1 = np.array(reverse_onehot(m1))
    re = sum(a1==m2)/len(m2)
#    print(re)
    return re
#%%

def plot_confusion_matrix 
(cm, classes,
                          normalize=False,
                          title='Confusion matrix',
                          cmap=plt.cm.Blues):
    """
    This function prints and plots the confusion matrix.
    Normalization can be applied by setting `normalize=True`.
    """
#    plt.figure(facecolor='w') 


    im = plt.imshow(cm, interpolation='nearest', cmap=cmap)
    plt.title(title)
#    plt.colorbar()
    plt.colorbar(im,fraction=0.046, pad=0.04)

    tick_marks = np.arange(len(classes))
    plt.xticks(tick_marks, classes) #, rotation=45
    plt.yticks(tick_marks, classes)

    if normalize:
        cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
        print("Normalized confusion matrix")
    else:
        print('Confusion matrix, without normalization')

#    print(cm)

    thresh = cm.max() / 2.
    for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
        cm[i,j]=round(cm[i,j],3)
        plt.text(j, i, cm[i, j],
                 horizontalalignment="center",
                 color="white" if cm[i, j] > thresh else "black")

#    plt.tight_layout()
    plt.ylabel('True fault type')
    plt.xlabel('CNN prediction fault type')


#%%
def cm_plot(y_,y,name,idx):
    class_names=np.array(['0','1','2','3','4','5','6','7','8','9'])
    cnf_matrix = confusion_matrix(y_, y)
    np.set_printoptions(precision=2)

    # Plot non-normalized confusion matrix
    #plt.figure(facecolor='w')
    #plot_confusion_matrix(cnf_matrix, classes=class_names,
    #                      title='Confusion matrix, without normalization')

    # Plot normalized confusion matrix
    plt.subplot(1,3,idx)
    plot_confusion_matrix(cnf_matrix, classes=class_names, normalize=False,
                          title='Normalized confusion matrix')
    plt.title(name)
    if idx == 3:
        plt.savefig(str(idx)+'.png',dpi=300)
        plt.show()

#%%
def papershow(de_c,fe_c):   
    # load the prediction data 9-17
    labels = np.load('labels.npy')
    # de_accuracy and fe_accuracy
    de_acc = cal_acc(de_c,labels)
    fe_acc = cal_acc(fe_c,labels)
    # ids fusion process
    me = np.ones((2500,10,2))
    me[:,:,0]=de_c
    me[:,:,1]=fe_c
    re = np.ones((2500,10))
    for i in range(2500):
        stack = me[i,:,:]
        re[i]=idscnn2(stack.T)
    # ids result
    fusion_result = cal_acc(re,labels)
    return np.array([de_acc, fe_acc, fusion_result]), re

#%%

def save_cm(path1,path2):
    de = np.load(path1)
    fe = np.load(path2)
    acc, me = papershow(de,fe)
    y1_ = de
    y2_ = fe
    y3_ = me
    y4_ = np.load('labels.npy')
    y1_ = reverse_onehot(y1_)
    y2_ = reverse_onehot(y2_)
    y3_ = reverse_onehot(y3_) 
    y4_ = y4_ 
    #
    plt.figure(facecolor='w',figsize=(16,4))
    cm_plot(y4_,y1_,'# 7 CNN model @ drive end\n accuracy=83.8%',1)

    cm_plot(y4_,y2_,'# 25 CNN model @ fan end\n accuracy=79.2%',2)

    cm_plot(y4_,y3_,'Fused model for # 7 and # 25\n accuracy=92.4%',3)

#109
#112
#%%
accuracy =  np.ones((20,3))
for i in np.arange(1,21):
    print(i)
    de = np.load('cnn_pre_pro2/pre_pro_'+str(i)+'/CA/107_de.npy')
    fe = np.load('cnn_pre_pro2/pre_pro_'+str(i)+'/CA/109_fe.npy')
    acc,re =  papershow(de,fe)
    accuracy[i-1,:] = acc
#%%
path1 = 'cnn_pre_pro2/pre_pro_1//CA/107_de.npy'
path2 = 'cnn_pre_pro2/pre_pro_1//CA/109_fe.npy'
#
save_cm(path1,path2)
#%% t-test
t_de = accuracy[:,0]
t_fe = accuracy[:,1]
t_me = accuracy[:,2]

t_de_me = stats.ttest_ind(t_de,t_me,equal_var=False)
t_fe_me = stats.ttest_ind(t_fe,t_me,equal_var=False)

print('t-test significant difference between de and me is: %f'%t_de_me[1])
print('t-test significant difference between fe and me is: %f'%t_de_me[1])
print("The averages are: ",np.mean(accuracy,axis=0))
avg = np.mean(accuracy,axis=0)
#%%
import pandas as pd
data = pd.read_excel('statistical-analysis2.xlsx',sheetname=1,skiprows=1)
data_array = data.values
print("The mean values of DS and IDS are \n", data.mean())
print('\n')
print("The std values of DS and IDS are \n", data.std())

170720 混淆矩陣繪製+pandas讀取資料（有點亂，後面抽空再整理）

E:\Backup\validation confusion matrix_final2 # -*- coding: utf-8 -*- """ Created on Fri May 19 11:17:12 2017 @author: Bruce

R語言讀取資料（Practical Data Science with R 第二章）

1、用R語言讀取檔案中的資料 1.1、用R語言讀取結構化資料以University of California Irvine Machine Learning Repository (http://archive.ics.uci.edu/ml/)的car資料為例： u

從resource中的raw資料夾中獲取檔案並讀取資料（資原始檔只能讀不能寫）

轉載：http://blog.sina.com.cn/s/blog_4d25c9870100qpax.html 一、從resource中的raw資料夾中獲取檔案並讀取資料（資原始檔只能讀不能寫） String res = ""; try{ InputStre

使用pandas模組從資料庫讀取資料（轉）

轉自：http://www.tuicool.com/articles/ZVzEz2N Python中用Pandas進行資料分析,最常用的就是Dataframe資料結構，之前寫過一篇文章介紹Pandas的基本用法，後來有些朋友問Pandas怎麼從資料庫中讀取資料，怎麼從檔

Pandas讀取檔案（read_csv與read_table 的區別）

pandas 載入檔案方式：注意，read_csv和read_table都是是載入帶分隔符的資料，每一個分隔符作為一個數據的標誌，但二者讀出來的資料格式還是不一樣的，read_table是以製表符 \t 作為資料的標誌，也就是以行為單位進行儲存。 read_cs

pandas 讀取資料集

1. read_excel pandas.read_excel(io, sheet_name=0, header=0, skiprows=None, skipfooter=0, **kwds) io: 輸入檔案的路徑名 shee

spark從mysql讀取資料（redis/mongdb/hbase等類似，換成各自RDD即可）

package com.ws.jdbc import java.sql.DriverManager import org.apache.spark.rdd.JdbcRDD import org.apache.spark.{SparkConf, SparkCont

從PCD檔案中讀取點雲資料（Reading Point Cloud data from PCD files）

在本教程中，我們將學習如何從PCD檔案中讀取點雲資料。 #程式碼首先，在你最喜歡的編輯器中建立一個名為pcd_read.cpp的檔案，並在其中放置下面的程式碼： #include <iostream> #include <pcl/io/pcd

java工具類之Excel檔案匯入、讀取資料（支援xls、和xlsx）

所需的jar包：poi的jar包儘量保持一致，不然會報版本不一致的錯誤下面是程式碼：package ReadExcel; import org.apache.poi.hssf.usermodel.HSSFWorkbook; import org.apache.poi.ss.

ffmpeg 從記憶體中讀取資料（或將資料輸出到記憶體）

原文見雷大神部落格：http://blog.csdn.net/leixiaohua1020/article/details/12980423 更新記錄（2014.7.24）： 1.為了使本文更通俗易懂，更新了部分內容，將例子改為從記憶體中開啟。 2.增加了將資料輸出

python使用pandas讀取資料檔案

可以使用pandas來方便的讀取csv檔案，免去自己處理csv時的瑣屑問題。安裝 sudo pip install pandas 或者直接使用pycharm的Setting->Interpreter->Tool直接安裝讀取csv檔案

小程式填坑之路—讀取使用者資訊、快取其資料、讀取其資料（button、wx.setStorage、wx.getStorage）

深深以為，遇見一個好的文章不容易，希望自己也能用心填坑。首先來說讀取使用者資訊，之前是用getUserInfo()，但在2018年4月30日之後，該介面不適用於開發版和測試版，正式上線的小程式不受影響。很不幸，我就是4月30後後的這批。出於

兩種方法實現STM32F103向串列埠一直髮送資料（程式原始碼，已測試)

串列埠是STM32最為重要的資源，在平時的硬體除錯和軟體除錯中都是不可或缺的工具，最近在測試一塊板子的通訊功能是否正常，我打算用板子A的串列埠USART1一直向串列埠傳送資料，用板子B的串列埠1接收資料，並將接收到的資料經過處理後顯示在LCD

習題 14.3 學校的人事部門儲存了有關學生的部分資料（學號、姓名、年齡、住址），教務部門也儲存了學生的另外一些資料（學號、姓名、性別、成績），兩個部門分別編寫了本部門的學生資料管理程式，其中都用。。

C++程式設計（第三版）譚浩強習題14.3 個人設計習題 14.3 學校的人事部門儲存了有關學生的部分資料（學號、姓名、年齡、住址），教務部門也儲存了學生的另外一些資料（學號、姓名、性別、成績），兩個部門分別編寫了本部門的學生資料管理程式，其中都用了Student作為類名。現在

【SSH網上商城專案實戰15】執行緒、定時器同步首頁資料（類似於部落格定期更新排名）

轉自：https://blog.csdn.net/eson_15/article/details/51387378 上一節我們做完了首頁UI介面，但是有個問題：如果我在後臺添加了一個商品，那麼我必須重啟一下伺服器才能重新同步後臺資料，然後重新整理首頁才能同步資

C 按行讀取檔案（但是最後一行會多輸出一行）

#include <stdio.h> int main() { char filename[] = "E:\\data_test\\commands.txt"; //檔名 &nb

在螢幕繪製兩個三角形（平面著色模式和Gouraud著色模式）

該例程有三個檔案：d3dUtility.cpp，colorTriangle.cpp，d3dUtility.h 關於d3dUtility.cpp以及d3dUtility.h兩個檔案裡面內容在部落格：Direct3D初始化例程中有詳細的解釋以及拿來就能用的原始碼但是在初始化以及繪製普通的三

分享6個月java基礎+進階精簡資料（視訊+原始碼+就業專案+面試報裝）

每天都有初學者詢問該如何學習，如何快速學習，因精力有限不能一一回復請見諒，現系統整理一套java初學者最佳的學習方法、路線、大綱及視訊資料，並對一些過期的知識點進行剔除！如Struts2，hibernate等舊框架！完全不需要在新手期進行學習，因為外面公司基本不再使用！希望

TCP 帶外資料（即緊急模式的傳送和接受）

首先給出OSI 參考模型與TCP/IP協議模型圖： 1. 概述：首先，我們需要知道的是資料分為兩種，一種是帶內資料，一種是帶外資料。帶內資料就是我們平常傳輸或者說是口頭叫的資料。帶外資料就是我們接下來講的內容。許多的傳輸層都具有帶

初識大資料（IDEA註冊，java基礎）

IDEA service破解：一、http://idea.lanyus.com/ 網站上下載http://idea.lanyus.com/JetbrainsCrack-2.10-release-en

170720 混淆矩陣繪製+pandas讀取資料（有點亂，後面抽空再整理）

相關推薦