CS231n——Assignment1-KNN

阿新 • • 發佈：2019-02-19

一、KNN

1.讀取資料

import numpy as np
import random
from cs231n.data_utils import  load_CIFAR10
import matplotlib.pyplot as plt
import os

plt.rcParams['figure.figsize']=(10.0,8.0)
plt.rcParams['image.interpolation']='nearest'
plt.rcParams['image.cmap']='gray'
#load the raw CIFAR-10 data
os.chdir('E://Python//deep learning CS231n//assignment1' 
)
cifar10_dir='E://Python//deep learning CS231n//assignment1//cs231n//datasets'
X_train,y_train,X_test,y_test=load_CIFAR10(cifar10_dir)
print('Training data shape:',X_train.shape)
print("Training labels shape:",y_train.shape)
print('Test data shape:',X_test.shape)
print('Test labels shape:',y_test.shape)

結果為

Training data shape: (50000, 32, 32, 3)
Training labels shape: (50000,)
Test data shape: (10000, 32, 32, 3)
Test labels shape: (10000,)

2.顯示一些樣本

enumerate()返回一個可迭代物件的列舉形式，如下例

返回0 plane/ 1 car/ 2 bird/......

classes=['plane', 'car', 'bird', 'cat', 'deer', 'dog', 'frog', 'horse', 'ship', 'truck']
num_classes=len(classes)
samples_per_class=7
for y,cls in enumerate(classes):
    idxs=np.flatnonzero(y_train==y)#記錄y_train中等於y的索引值
    idxs=np.random.choice(idxs,samples_per_class,replace=False)#選出7張圖
 
    for i,idx in enumerate(idxs):
        plt_idx=i* num_classes+y+1
        plt.subplot(samples_per_class,num_classes,plt_idx)
        plt.imshow(X_train[idx].astype('uint8'))
        plt.axis('off')
        if i==0:
            plt.title(cls)
plt.show()

3.調整資料集大小

#調整資料集的大小
num_training=5000
mask=range(num_training)
X_train=X_train[mask]
y_train=y_train[mask]

num_test=500
mask=range(num_test)
X_test=X_test[mask]
y_test=y_test[mask]
#把所有圖片變成一列
X_train=np.reshape(X_train,(X_train.shape[0],-1))
X_test=np.reshape(X_test,(X_test.shape[0],-1))
print (X_train.shape,X_test.shape)

現在X_train變成5000*3072

X_test變成了500*3072

4.KNN類的實現

計算距離用的是L2距離

np.argsort()函式返回的是陣列值從小到大的索引值，我們需要將距離最小的k個圖片挑出來，然後數它們所屬的類的個數
np.bincount(x)函式給出了它的索引值在x中出現的次數，如a=np.array([1,1,2,3,4,6]), np.bincount(a)=[0(0的個數),2(1的個數),1,1,1,0,1]

class KNearestNeighbor:  # 首先是定義一個處理KNN的類
    """ a kNN classifier with L2 distance """
def __init__(self):
        pass
    def train(self, X, y):
"""
        Train the classifier. For k-nearest neighbors this is just
        memorizing the training data.
        Inputs:
        - X: A numpy array of shape (num_train, D) containing the training data
          consisting of num_train samples each of dimension D.
        - y: A numpy array of shape (N,) containing the training labels, where
             y[i] is the label for X[i].
        """
self.X_train = X
self.y_train = y
def predict(self, X, k=1, num_loops=0):
"""
        Predict labels for test data using this classifier.
        Inputs:
        - X: A numpy array of shape (num_test, D) containing test data consisting
             of num_test samples each of dimension D.
        - k: The number of nearest neighbors that vote for the predicted labels.
        - num_loops: Determines which implementation to use to compute distances
          between training points and testing points.
        Returns:
        - y: A numpy array of shape (num_test,) containing predicted labels for the
          test data, where y[i] is the predicted label for the test point X[i].
        """
if num_loops == 0:
dists = self.compute_distances_no_loops(X)
        elif num_loops == 1:
dists = self.compute_distances_one_loop(X)
        elif num_loops == 2:
dists = self.compute_distances_two_loops(X)
        else:
            raise ValueError('Invalid value %d for num_loops' % num_loops)

        return self.predict_labels(dists, k=k)

    def compute_distances_two_loops(self, X):
"""
        Compute the distance between each test point in X and each training point
        in self.X_train using a nested loop over both the training data and the
        test data.
        Inputs:
        - X: A numpy array of shape (num_test, D) containing test data.
        Returns:
        - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
          is the Euclidean distance between the ith test point and the jth training
          point.
        """
num_test = X.shape[0]#測試樣本數
num_train = self.X_train.shape[0]#訓練樣本數
dists = np.zeros((num_test, num_train))
        for i in range(num_test):
            for j in range(num_train):
#####################################################################
                # TODO:                                                             #
# Compute the l2 distance between the ith test point and the jth    #
                # training point, and store the result in dists[i, j]. You should   #
                # not use a loop over dimension.                                    #
                #####################################################################
                #兩層迴圈
dists[i,j]=np.sqrt(np.dot(X[i]-self.X_train[j],X[i]-self.X_train[j]))

                #####################################################################
                #                       END OF YOUR CODE                            #
                #####################################################################
return dists

    def compute_distances_one_loop(self, X):
"""
        Compute the distance between each test point in X and each training point
        in self.X_train using a single loop over the test data.
        Input / Output: Same as compute_distances_two_loops
        """
num_test = X.shape[0]
        num_train = self.X_train.shape[0]
        dists = np.zeros((num_test, num_train))
        for i in range(num_test):
#######################################################################
            # TODO:                                                               #
# Compute the l2 distance between the ith test point and all training #
            # points, and store the result in dists[i, :].                        #
            #######################################################################
dists[i,:]=np.sqrt(np.sum(np.square(self.X_train-X[i,:]),axis=1))

            #######################################################################
            #                         END OF YOUR CODE                            #
            #######################################################################
return dists

    def compute_distances_no_loops(self, X):
"""
        Compute the distance between each test point in X and each training point
        in self.X_train using no explicit loops.
        Input / Output: Same as compute_distances_two_loops
        """
num_test = X.shape[0]
        num_train = self.X_train.shape[0]
        dists = np.zeros((num_test, num_train))
        #########################################################################
        # TODO:                                                                 #
# Compute the l2 distance between all test points and all training      #
        # points without using any explicit loops, and store the result in      #
        # dists.                                                                #
        #                                                                       #
        # You should implement this function using only basic array operations; #
        # in particular you should not use functions from scipy.                #
        #                                                                       #
        # HINT: Try to formulate the l2 distance using matrix multiplication    #
        #       and two broadcast sums.                                         #
        #########################################################################
sq_train=np.sum(np.square(self.X_train),axis=1)#(5000,)
sq_test=np.sum(np.square(X),axis=1) #(500,)
mul=np.multiply(np.dot(X,self.X_train.T),-2)#(500,5000)
dists=sq_train+sq_test+mul
        dists=np.sqrt(dists)
        #########################################################################
        #                         END OF YOUR CODE                              #
        #########################################################################
return dists

    def predict_labels(self, dists, k=1):
"""
        Given a matrix of distances between test points and training points,
        predict a label for each test point.
        Inputs:
        - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
          gives the distance betwen the ith test point and the jth training point.
        Returns:
        - y: A numpy array of shape (num_test,) containing predicted labels for the
          test data, where y[i] is the predicted label for the test point X[i].
        """
num_test = dists.shape[0]
        y_pred = np.zeros(num_test)
        for i in range(num_test):
# A list of length k storing the labels of the k nearest neighbors to
            # the ith test point.
closest_y = []
            #########################################################################
            # TODO:                                                                 #
# Use the distance matrix to find the k nearest neighbors of the ith    #
            # training point, and use self.y_train to find the labels of these      #
            # neighbors. Store these labels in closest_y.                           #
            # Hint: Look up the function numpy.argsort.                             #
            #########################################################################
sort=np.argsort(dists[i,:])#按降序排列
index=sort[0:k]#取前k個距離最小的
closest_y[i,:]=self.y_train(index)
            #########################################################################
            # TODO:                                                                 #
# Now that you have found the labels of the k nearest neighbors, you    #
            # need to find the most common label in the list closest_y of labels.   #
            # Store this label in y_pred[i]. Break ties by choosing the smaller     #
            # label.                                                                #
            #########################################################################
y_pred[i] = np.argmax(np.bincount(closest_y))
            #########################################################################
            #                           END OF YOUR CODE                            #
            #########################################################################
return y_pred

5.實際訓練和預測

np.linalg.norm()計算正規化距離

classifier=KNearestNeighbor()
classifier.train(X_train,y_train)
#用兩層迴圈計算
dists=classifier.compute_distances_two_loops(X_test)
print (dists.shape)

y_test_pred=classifier.predict_labels(dists,k=1)
num_correct=np.sum(y_test_pred==y_test)
accuracy=float(num_correct)/num_test
print("Got %d/ %d correct => accuracy: %f" %(num_correct,num_test,accuracy))

#計算一層迴圈的結果
dists_one=classifier.compute_distances_one_loop(X_test)

#檢查兩次距離是否一樣
difference=np.linalg.norm(dists-dists_one,ord=2)
print("Difference was: %f" % difference)
if difference<0.001:
print('Good! the distance matricecs are the same')
else:
print("Uh-oh! the distance matrices are different")

#full-vectorized version
dists_two=classifier.compute_distances_no_loops(X_test)
#檢查距離是否一樣
difference = np.linalg.norm(dists - dists_two, ord='fro')
print ('Difference was: %f' % (difference, ))
if difference < 0.001:
print ('Good! The distance matrices are the same')
else:
print ('Uh-oh! The distance matrices are different')

6.計算不同方法的花費時間

傳遞一個函式當引數，以及所有其引數

def time_function(f,*args):
    import time
    tic=time.time()
    f(*args)
    toc=time.time()
    return toc-tic

two_loop_time=time_function(classifier.compute_distances_two_loops,X_test)
print("Two loop version took %f seconds" %two_loop_time)

one_loop_time=time_function(classifier.compute_distances_one_loops,X_test)
print("One loop version took %f seconds" %one_loop_time)

no_loop_time=time_function(classifier.compute_distances_no_loops,X_test)
print("No loop version took %f seconds" %no_loop_time)

7.篩選不同的k

np.array_split(x,3)將x拆成3份，不用恰好分完

num_folds=5
k_choices=[1,3,5,8,10,12,15,20,50,100]

X_train_folds=[]
y_train_folds=[]
################################################################################
# TODO:                                                                        #
# Split up the training data into folds. After splitting, X_train_folds and    #
# y_train_folds should each be lists of length num_folds, where                #
# y_train_folds[i] is the label vector for the points in X_train_folds[i].     #
# Hint: Look up the numpy array_split function.                                #
################################################################################
X_train_folds=np.array_split(X_train,num_folds)#分成num_folds份驗證集, list資料型別
y_train_folds=np.array_split(y_train,num_folds)
################################################################################
#                                 END OF YOUR CODE                             #
################################################################################
# A dictionary holding the accuracies for different values of k that we find
# when running cross-validation. After running cross-validation,
# k_to_accuracies[k] should be a list of length num_folds giving the different
# accuracy values that we found when using that value of k.
k_to_accuracies = {}


################################################################################
# TODO:                                                                        #
# Perform k-fold cross validation to find the best value of k. For each        #
# possible value of k, run the k-nearest-neighbor algorithm num_folds times,   #
# where in each case you use all but one of the folds as training data and the #
# last fold as a validation set. Store the accuracies for all fold and all     #
# values of k in the k_to_accuracies dictionary.                               #
################################################################################
for k in k_choices:
k_to_accuracies[k]=np.zeros(num_folds)

    for i in range(num_folds):
Xtr=np.array(X_train_folds[:i]+X_train_folds[i+1:])
        ytr=np.array(y_train_folds[:i]+y_train_folds[i+1:])
        Xte=np.array(X_train_folds[i])
        yte=np.array(y_train_folds[i])

        Xtr=np.reshape(Xtr,(np.int32(X_train.shape[0] * 4 / 5), -1))
        ytr = np.reshape(ytr, (np.int32(y_train.shape[0] * 4 / 5), -1))
        Xte=np.reshape(Xte,(np.int32(X_train.shape[0]/5),-1))
        yte = np.reshape(yte, (np.int32(y_train.shape[0] / 5), -1))

        classifier.train(Xtr,ytr)
        yte_pred=classifier.predict(Xte,k)
        yte_pred=np.reshape(yte_pred,(yte_pred.shape[0],-1))
        num_correct=np.sum(yte_pred==yte)
        accuracy=float(num_correct)/len(yte)
        k_to_accuracies[k][i]=accuracy


#print out the computed accuracies
for k in sorted(k_to_accuracies):
    for accuracy in k_to_accuracies[k]:
print ('k = %d, accuracy = %f' % (k, accuracy))

CS231n——Assignment1-KNN

一、KNN 1.讀取資料 import numpy as np import random from cs231n.data_utils import load_CIFAR10 import matplotlib.pyplot as plt import os pl

cs231n作業：assignment1 - knn

title: cs231n作業：assignment1 - knn id: cs231n-1h-1 tags: cs231n homework categories: AI Deep Learning date: 2018-09-26 12:41:15

斯坦福cs231n課程記錄——assignment1 KNN

目錄 KNN原理某些API解釋 KNN實現作業問題記錄行業運用演算法改進參考文獻一、KNN原理 KNN是一種投票機制，依賴少數服從多數的原則，根據最近樣本的標籤進行分類的方法，屬於區域性近似。優點： 1.簡單（原因在

CS231N assignment1

位置元素 rand ali num 計算 ini itl 分享圖片 # Visualize some examples from the dataset. # We show a few examples of training images from each cla

CS231N assignment1 SVM

from cs231n.classifiers.softmax import softmax_loss_naive 線性分類器SVM,分成兩個部分 1.a score function that maps the raw data to class scores,也就是所謂的ｆ(w,x)

CS231n Assignment1總結

lecture3一些關於鏈式法則的基本知識。下面是對assignment1的程式碼一些關鍵點或者有意思實現的總結參考答案：https://github.com/sharedeeply/cs231n-assignment-solution/blob/master/assignment1/

斯坦福CS231n assignment1：softmax損失函式求導

斯坦福CS231n assignment1：softmax損失函式求導在前文斯坦福CS231n assignment1：SVM影象分類原理及實現中我們講解了利用SVM模型進行影象分類的方法，本文我們講解影象分類的另一種實現，利用softmax進行影象分類。

深度學習cs231n之knn學習的一些記錄（2）

防止在上篇文章上的修改產生覆蓋，我這裡就直接重啟一篇。繼續寫當前在knn.ipynb 裡面的box 15 Now implement the function. predict_labels and run the code below: 現在執行class

斯坦福深度學習課程cs231n assignment1作業筆記三：softmax實現相關

任務實現向量化的損失函式實現向量化的梯度計算分析梯度與數值梯度的驗證使用驗證集來選擇超引數使用SGD優化方法視覺化權重理論知識 softmax損失函式令W為權重矩陣，大小為D×C；x為輸入，大小為1×D；b為偏置項，大小為1×C。那麼模型的輸

影象與機器學習-2-基礎知識及cs231n/assignment1

part 1 機器學習基礎知識：包括線性迴歸，邏輯迴歸，交叉熵，softmax,KNN,神經網路中梯度的傳遞思想。關於線性迴歸和邏輯迴歸部分的知識，可以參考這個部落格的內容，就不再累述：http://blog.csdn.net/viewcode/article/details/8

CS231n assignment1 Q5 Level Representations: Image Feature

這個作業是討論對影象畫素進行進一步計算得到的特徵來訓練線性分類器是否可以提高效能。對於每張圖，我們會計算梯度方向直方圖(HOG)特徵和用HSV（Hue色調，Saturation飽和度,Value明度）顏色空間的色調特徵。把每張圖的梯度方向直方圖和顏色直方圖特徵合併形成我們最後的特徵向量。 HOG大致可以捕捉

cs231n assigment1 KNN部分程式碼執行結果及分析

這是cs231n的課程作業，通過在網上尋找的各個程式碼解析以及自己執行程式碼後，對於KNN特徵提取有了更加深刻的瞭解，也對python語言有了初步的認識。在之前的課程也對KNN演算法簡單介紹過，KNN演算法比較簡單：在最近的K個樣本中選擇概率最大的作為測試的值（需要進行距

cs231n assignment1 環境搭建+實踐操作

網易雲課程視訊及作業連結 http://study.163.com/course/courseMain.htm?courseId=1003223001 1. 環境搭建根據我第一篇的文章成功進入了環境。我用的是VM12+Ubuntu14.04.5，適合電腦配置低的童鞋（啊哦……）

cs231n assignment1--Softmax

svm實現完了，這部分會相對比較輕鬆，大部分和svm類似。關於梯度的推導，我主要參考這篇文章 http://www.jianshu.com/p/004c99623104multiclass 梯度推導：向量化的實現和svm類似，實現過svm應該不難

cs231n-assignment1-SVM/Softmax/two-layer-nets梯度求解

上週完成了cs231n的assignment1,作業中的難點是SVM/Softmax/two-layer-nets的梯度求導，特此寫篇部落格進行總結。作業assignment1的資源連結在這裡：http://download.csdn.net/detail/

cs231n assignment1 關於svm_loss_vectorized中程式碼的梯度部分

個人覺得svm和softmax的梯度部分是這份作業的難點，參考了一些程式碼覺得還是難以理解，網上似乎也沒有相關的解釋，所以想把自己的想法貼出來，提供一個參考。首先貼上參考的程式碼： def svm_loss_vectorized(W, X, y,

斯坦福深度學習課程cs231n assignment1作業筆記二：SVM實現相關

前言本次作業需要完成：實現SVM損失函式，並且是完全向量化的實現相關的梯度計算，也是向量化的使用數值梯度驗證梯度是否正確使用驗證集來選擇一組好的學習率以及正則化係數使用SGD方法優化loss 視覺化最終的權重程式碼實現使用for迴圈計算SVM

CS231N assignment1——SVM

Multiclass Support Vector Machine exercise Complete and hand in this completed worksheet (including its outputs and any supporting

CS231n assignment1 -- Two-layer neural network

接近assignment1的尾聲了，這次我們要完成的是一個兩層的神經網路，要求如下： RELU使用np.maximum()即可； Softmax與作業上個part相同，可以直接照搬。不同的地方在求導，兩個全連線層，共有W1 b1 W2 b2四個引數。對於它

CS231n-assignment1 K-fold 交叉驗證 python 中字典的用法

num_folds = 5 k_choices = [1, 3, 5, 8, 10, 12, 15, 20, 50, 100] X_train_folds = [] y_train_folds = [] ###################################

CS231n——Assignment1-KNN

相關推薦