cs231n作業：assignment1 - knn

阿新 • • 發佈：2018-11-09

title: cs231n作業：assignment1 - knn
id: cs231n-1h-1
tags:

cs231n
homework
categories:
AI
Deep Learning
date: 2018-09-26 12:41:15

GitHub地址：https://github.com/ZJUFangzh/cs231n
個人部落格：fangzh.top
使用KNN演算法來完成影象識別，資料集用的是cifar10。

首先看一下資料集的維度

# Load the raw CIFAR-10 data.
cifar10_dir = 
 'cs231n/datasets/cifar-10-batches-py'
X_train, y_train, X_test, y_test = load_CIFAR10(cifar10_dir)

# As a sanity check, we print out the size of the training and test data.
print('Training data shape: ', X_train.shape)
print('Training labels shape: ', y_train.shape)
print('Test data shape: ', X_test.shape) 

print('Test labels shape: ', y_test.shape)

可以看到，每一張圖片是 $32×32×3$ ，訓練集有50000張，測試集有10000張

Training data shape:  (50000, 32, 32, 3)
Training labels shape:  (50000,)
Test data shape:  (10000, 32, 32, 3)
Test labels shape:  (10000,)

為了更夠更快的計算，就選5000張做訓練，500張做測試就好了

# Subsample the data for more efficient code execution in this exercise
num_training = 5000
mask = list(range(num_training))
X_train = X_train[mask]
y_train = y_train[mask]

num_test = 500
mask = list(range(num_test))
X_test = X_test[mask]
y_test = y_test[mask]

而後把畫素拉成3072的行向量

# Reshape the image data into rows
X_train = np.reshape(X_train, (X_train.shape[0], -1))
X_test = np.reshape(X_test, (X_test.shape[0], -1))
print(X_train.shape, X_test.shape)

因為knn不需要訓練，所以先存入資料：

from cs231n.classifiers import KNearestNeighbor

# Create a kNN classifier instance. 
# Remember that training a kNN classifier is a noop: 
# the Classifier simply remembers the data and does no further processing 
classifier = KNearestNeighbor()
classifier.train(X_train, y_train)

然後要修改k_nearest_neighbor.py中的compute_distances_two_loops

這裡套了兩層迴圈，也就是比較訓練集和測試集的每一張圖片的間距：

def compute_distances_two_loops(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using a nested loop over both the training data and the 
    test data.

    Inputs:
    - X: A numpy array of shape (num_test, D) containing test data.

    Returns:
    - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
      is the Euclidean distance between the ith test point and the jth training
      point.
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train))
    for i in xrange(num_test):
      for j in xrange(num_train):
        #####################################################################
        # TODO:                                                             #
        # Compute the l2 distance between the ith test point and the jth    #
        # training point, and store the result in dists[i, j]. You should   #
        # not use a loop over dimension.                                    #
        #####################################################################
        dists[i][j] = np.sqrt(np.sum(np.square(X[i,:] - self.X_train[j,:])))
        #####################################################################
        #                       END OF YOUR CODE                            #
        #####################################################################
    return dists

得到了一個 $(500,5000)$ 的dists矩陣。

然後修改predict_labels函式

def predict_labels(self, dists, k=1):
    """
    Given a matrix of distances between test points and training points,
    predict a label for each test point.

    Inputs:
    - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
      gives the distance betwen the ith test point and the jth training point.

    Returns:
    - y: A numpy array of shape (num_test,) containing predicted labels for the
      test data, where y[i] is the predicted label for the test point X[i].  
    """
    num_test = dists.shape[0]
    y_pred = np.zeros(num_test)
    for i in xrange(num_test):
      # A list of length k storing the labels of the k nearest neighbors to
      # the ith test point.
      closest_y = []
      #########################################################################
      # TODO:                                                                 #
      # Use the distance matrix to find the k nearest neighbors of the ith    #
      # testing point, and use self.y_train to find the labels of these       #
      # neighbors. Store these labels in closest_y.                           #
      # Hint: Look up the function numpy.argsort.                             #
      #########################################################################
      #找到每一個測試圖片中對應的5000張訓練集圖片，距離最近的前k個
      closest_y = self.y_train[np.argsort(dists[i])[:k]]
      #########################################################################
      # TODO:                                                                 #
      # Now that you have found the labels of the k nearest neighbors, you    #
      # need to find the most common label in the list closest_y of labels.   #
      # Store this label in y_pred[i]. Break ties by choosing the smaller     #
      # label.                                                                #
      #########################################################################
      #然後將這K個圖片進行投票，得票數最多的就是預測值
      y_pred[i] = np.argmax(np.bincount(closest_y))
      #########################################################################
      #                           END OF YOUR CODE                            # 
      #########################################################################

    return y_pred

預測一下：

# Now implement the function predict_labels and run the code below:
# We use k = 1 (which is Nearest Neighbor).
y_test_pred = classifier.predict_labels(dists, k=1)

# Compute and print the fraction of correctly predicted examples
num_correct = np.sum(y_test_pred == y_test)
accuracy = float(num_correct) / num_test
print('Got %d / %d correct => accuracy: %f' % (num_correct, num_test, accuracy))

結果是0.274

再試試k=5的結果，是0.278

然後再修改compute_distances_one_loop函式，這次爭取只用一個迴圈

  def compute_distances_one_loop(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using a single loop over the test data.

    Input / Output: Same as compute_distances_two_loops
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train))
    for i in xrange(num_test):
      #######################################################################
      # TODO:                                                               #
      # Compute the l2 distance between the ith test point and all training #
      # points, and store the result in dists[i, :].                        #
      #######################################################################
      #利用python的廣播，一次性算出每一張圖片與5000張圖片的距離
      dists[i, :] = np.sqrt(np.sum(np.square(self.X_train - X[i, :]),axis=1))
      #######################################################################
      #                         END OF YOUR CODE                            #
      #######################################################################
    return dists

驗證一下間距是

Difference was: 0.000000
Good! The distance matrices are the same

然後爭取不用迴圈compute_distances_no_loops，這一步比較難，想法是利用平方差公式 $(x-y)^2 = x^2 + y^2 - 2xy$ ，使用矩陣乘法和二次廣播，直接算出距離，注意矩陣的維度

  def compute_distances_no_loops(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using no explicit loops.

    Input / Output: Same as compute_distances_two_loops
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train)) 
    #########################################################################
    # TODO:                                                                 #
    # Compute the l2 distance between all test points and all training      #
    # points without using any explicit loops, and store the result in      #
    # dists.                                                                #
    #                                                                       #
    # You should implement this function using only basic array operations; #
    # in particular you should not use functions from scipy.                #
    #                                                                       #
    # HINT: Try to formulate the l2 distance using matrix multiplication    #
    #       and two broadcast sums.                                         #
    #########################################################################
    temp_2xy = np.dot(X,self.X_train.T) * (-2)
    temp_x2 = np.sum(np.square(X),axis=1,keepdims=True)
    temp_y2 = np.sum(np.square(self.X_train),axis=1)
    dists = temp_x2 + temp_2xy + temp_y2
    dists = np.sqrt(dists)
    #########################################################################
    #                         END OF YOUR CODE                              #
    #########################################################################
    return dists

對比一下三種方法的時間，我這裡不知道為什麼two比one短，理論上是迴圈越少時間越短：

Two loop version took 24.510484 seconds
One loop version took 56.412211 seconds
No loop version took 0.183508 seconds

交叉驗證

用交叉驗證來找到最好的k

num_folds = 5
k_choices = [1, 3, 5, 8, 10, 12, 15, 20, 50, 100]

X_train_folds = []
y_train_folds = []
################################################################################
# TODO:                                                                        #
# Split up the training data into folds. After splitting, X_train_folds and    #
# y_train_folds should each be lists of length num_folds, where                #
# y_train_folds[i] is the label vector for the points in X_train_folds[i].     #
# Hint: Look up the numpy array_split function.                                #
################################################################################
X_train_folds = np.array_split(X_train, num_folds)
y_train_folds = np.array_split(y_train, num_folds)


################################################################################
#                                 END OF YOUR CODE                             #
################################################################################

# A dictionary holding the accuracies for different values of k that we find
# when running cross-validation. After running cross-validation,
# k_to_accuracies[k] should be a list of length num_folds giving the different
# accuracy values that we found when using that value of k.
k_to_accuracies = {}


################################################################################
# TODO:                                                                        #
# Perform k-fold cross validation to find the best value of k. For each        #
# possible value of k, run the k-nearest-neighbor algorithm num_folds times,   #
# where in each case you use all but one of the folds as training data and the #
# last fold as a validation set. Store the accuracies for all fold and all     #
# values of k in the k_to_accuracies dictionary.                               #
################################################################################
classifier = KNearestNeighbor()
for k in k_choices:
    accuracies = []
    for fold in range(num_folds):
        temp_X = X_train_folds[:]
        temp_y = y_train_folds[:]
        X_val_fold = temp_X.pop(fold)
        y_val_fold = temp_y.pop(fold)
        temp_X = np.array([y for x in temp_X for y in x])
        temp_y = np.array([y for x in temp_y for y in x])
        classifier.train(temp_X,temp_y)
        y_val_pred = classifier.predict(X_val_fold,k=k)
        num_correct = np.sum(y_val_fold == y_val_pred)
        accuracies.append(num_correct / y_val_fold.shape[0])
    k_to_accuracies[k] = accuracies
    
################################################################################
#                                 END OF YOUR CODE                             #
################################################################################

# Print out the computed accuracies
for k in sorted(k_to_accuracies):
    for accuracy in k_to_accuracies[k]:
        print('k = %d, accuracy = %f' % (k, accuracy))

畫個圖：

# plot the raw observations
for k in k_choices:
    accuracies = k_to_accuracies[k]
    plt.scatter([k] * len(accuracies), accuracies)

# plot the trend line with error bars that correspond to standard deviation
accuracies_mean = np.array([np.mean(v) for k,v in sorted(k_to_accuracies.items())])
accuracies_std = np.array([np.std(v) for k,v in sorted(k_to_accuracies.items())])
plt.errorbar(k_choices, accuracies_mean, yerr=accuracies_std)
plt.title('Cross-validation on k')
plt.xlabel('k')
plt.ylabel('Cross-validation accuracy')
plt.show()

# Based on the cross-validation results above, choose the best value for k,   
# retrain the classifier using all the training data, and test it on the test
# data. You should be able to get above 28% accuracy on the test data.
best_k = 10

classifier = KNearestNeighbor()
classifier.train(X_train, y_train)
y_test_pred = classifier.predict(X_test, k=best_k)

# Compute and display the accuracy
num_correct = np.sum(y_test_pred == y_test)
accuracy = float(num_correct) / num_test
print('Got %d / %d correct => accuracy: %f' % (num_correct, num_test, accuracy))

得到最好的k=10，準確率是0.282

小結

cs231n的作業比DeepLearning.ai的難多了，不是一個檔次的，關鍵是提示比較少，所以自己做起來比較費勁
主要要學會向量化的運算，少用loop迴圈
knn已經被淘汰了，這個作業只是讓我們入門看看影象識別大概怎麼做

cs231n作業：assignment1 - knn

title: cs231n作業：assignment1 - knn id: cs231n-1h-1 tags: cs231n homework categories: AI Deep Learning date: 2018-09-26 12:41:15

cs231n作業：assignment1 - softmax

title: cs231n作業：assignment1 - softmax id: cs231n-1h-3 tags: cs231n homework categories: AI Deep Learning date: 2018-09-27 16:02:

cs231n作業：assignment1 - svm

title: ‘cs231n作業：assignment1 - svm’ id: cs231n-1h-2 tags: cs231n homework categories: AI Deep Learning date: 2018-09-27 14:17:45

cs231n作業：assignment1 - features

GitHub地址：https://github.com/ZJUFangzh/cs231n 個人部落格：fangzh.top 抽取影象的HOG和HSV特徵。對於每張圖，我們會計算梯度方向直方圖(HOG)特徵和用HSV（Hue色調，Saturation飽和度,Value明度）顏

cs231n作業：assignment1 - two_layer_net

github地址：https://github.com/ZJUFangzh/cs231n 個人部落格：fangzh.top 搭建一個兩層的神經網路。 Forward pass 先計算前向傳播過程，編輯cs231n/classifiers/neural_net.py的Two

cs231n作業：assignment1

抽取影象的HOG和HSV特徵。對於每張圖，我們會計算梯度方向直方圖(HOG)特徵和用HSV（Hue色調，Saturation飽和度,Value明度）顏色空間的色調特徵。把每張圖的梯度方向直方圖和顏色直方圖特徵合併形成我們最後的特徵向量。粗略的講呢，HO

斯坦福cs231n課程記錄——assignment1 KNN

目錄 KNN原理某些API解釋 KNN實現作業問題記錄行業運用演算法改進參考文獻一、KNN原理 KNN是一種投票機制，依賴少數服從多數的原則，根據最近樣本的標籤進行分類的方法，屬於區域性近似。優點： 1.簡單（原因在

CS231n作業（一）KNN分類

作業說明學習ML和DL很關鍵的兩點在於對最基本的演算法的理解，以及通過程式設計將演算法復現的能力。做好這兩點，才有實現更加複雜演算法與工作的可能。否則，只會調包，跑跑開原始碼，永遠是重複別人的工作，沒有自己的理解，也就無法將演算法應用到實際任務中來。還好有cs231n

CS231n作業筆記2.1：兩層全連線神經網路的分層實現

CS231n簡介作業筆記 1. 神經網路的分層實現全連線前向傳播：out = x.reshape([x.shape[0],-1]).dot(w)+b 全連線後向傳播： x, w, b = cache dx, dw, db = No

cs231n：assignment1——Q2: Training a Support Vector Machine

目錄 svm.ipnb內容: Multiclass Support Vector Machine exercise Complete and hand in this completed worksheet (including it

CS231n——Assignment1-KNN

一、KNN 1.讀取資料 import numpy as np import random from cs231n.data_utils import load_CIFAR10 import matplotlib.pyplot as plt import os pl

python-作業：員工信息表

輸入 .get lin del 打包 staf com 字典獲取程序可實現以下功能：1、查詢，輸入select name,age from staff_table where age > 22，查詢到符合要求的信息；輸入select * from staff

ufldl學習筆記與編程作業：Linear Regression（線性回歸）

cal bug war 環境 art link 行數 ear sad ufldl學習筆記與編程作業：Linear Regression（線性回歸） ufldl出了新教程，感覺比之前的好。從基礎講起。系統清晰，又有編程實踐。在deep learning高質量群裏

day1作業：編寫登錄窗口一個文件實現

insert size strong 文件類型增加機會如果 user_list ssa 思路： 1、參考模型，這個作業我參考了linux的登錄認證流程以及結合網上銀行支付寶等鎖定規則； 1）認證流程參考的是Linux的登錄：當你輸入完用戶名

第一次互評作業：MIPS匯編程序設計

lower mov small search 在屏幕上 orm sof con print 1 .data 2 3 string1: .asciiz "*\n" 4 5 6 bstring: .asciiz 7

python之路——作業：高級FTP（僅供參考）

ice 靜態 enc lose 自己的創建目錄返回 msg 組成一、作業需求 1. 用戶加密認證2. 多用戶同時登陸3. 每個用戶有自己的家目錄且只能訪問自己的家目錄4. 對用戶進行磁盤配額、不同用戶配額可不同5. 用戶可以登陸server後，可切換目錄6. 查看當前

python之路——作業：Select FTP（僅供參考）

view info warn phi socket split list 開始序號一、作業需要使用SELECT或SELECTORS模塊實現並發簡單版FTP允許多用戶並發上傳下載文件二、README 程序實現了以下功能： 1、用戶登錄註冊（測試用戶：japh

老男孩Day3作業：工資管理系統

當前 chan 輸入 github ref 函數 txt文件 nes 第三周作業需求： 1、從info.txt文件中讀取員工及其工資信息，最後將修改或增加的員工工資信息也寫入原info.txt文件。 2、能增查改員工工資 3、增、改員工工資用空格分隔 4、實現退出

作業：閱讀任務

建議 bsp 閱讀構建什麽 code 個人概論 sts 我讀了概論與個人技術和流程這兩章，有五個問題 1.什麽軟件工程 2.軟件工程是不是只有理論 3.為什麽要學軟件工程 4.軟件的構建過程 5.怎麽用VSTS寫單元測試 6.為什麽要進行單元測試 7.Coder和Ha

第一次作業：無題

影子一點他會做的職業我也在改變 book 多公司摘要：本文在閱讀【1】博主的論文後的感想。我本身是非常不喜歡看書的人，因此看了這篇將近1.3萬自己的博主經歷自述，居然用了2個多小時，也因此算是看得比較仔細，從中多少能看到自己的影子。文中提到的博主均為博文【1】

cs231n作業：assignment1 - knn

相關推薦