林軒田-機器學習基石-作業3-python原始碼

阿新 • • 發佈：2019-01-10

大家好，以下是林軒田機器學習基石--作業3的Python的參考程式碼，自己碼的。Python方面沒有工程經驗，如有錯誤或者更好的程式碼優化方法，麻煩大家留言提醒一下下，大家互相交流學習，謝謝。

13-15題主要考察在分類問題上的線性迴歸和特徵轉換，所使用的樣本點均由目標函式
f(x1; x2) = sign(x1^2 + x2^2 − 0.6)
產生

13.1：在使用線性迴歸且不進行引數轉換的情況下（也就是直接使用特徵向量(1; x1; x2)），對資料進行擬合。進行1000次試驗，並且畫出1000次試驗的Ein的直方圖，並求出Ein的平均值。

### For Questions 13-15, Generate a training set of N = 1000 points on X = [−1; 1] × [−1; 1] with uniform probability of 

### picking each x 2 X. Generate simulated noise by flipping the sign of the output in a random 10% subset
### of the generated training set.

import random
import numpy as np
import matplotlib.pyplot as plt

### target function f(x1, x2) = sign(x1^2 + x2^2 - 0.6)
def target_function(x1, x2):
    return 
 (1 if (x1*x1 + x2*x2 - 0.6) >= 0 else -1)

### plot dot picture, two dimension features
def plot_dot_picture(features, lables, w=np.zeros((3, 1))):
    x1 = features[:,1]
    x2 = features[:,2]
    y = lables[:,0]

    plot_size = 20
    size = np.ones((len(x1)))*plot_size

    size_x1 = np.ma.masked_where(y<0 
, size)
    size_x2 = np.ma.masked_where(y>0, size)

    ### plot scatter
    plt.scatter(x1, x2, s=size_x1, marker='x', c='r')
    plt.scatter(x1, x2, s=size_x2, marker='o', c='b')

    ### plot w line
    x1_tmp = np.arange(-1,1,0.01)
    x2_tmp = np.arange(-1,1,0.01)

    x1_tmp, x2_tmp = np.meshgrid(x1_tmp, x2_tmp)

    f = x1_tmp*w[1, 0] + x2_tmp*w[2, 0] + w[0, 0]

    try:
        plt.contour(x1_tmp, x2_tmp, f, 0)
    except ValueError:
        pass

    plt.xlabel('X1')
    plt.ylabel('X2')

    plt.title('Feature scatter plot')

    plt.legend()

    plt.show()

### return a numpy array
def training_data_with_random_error(num=1000):
    features = np.zeros((num, 3))
    labels = np.zeros((num, 1))

    points_x1 = np.array([round(random.uniform(-1, 1) ,2) for _ in range(num)])
    points_x2 = np.array([round(random.uniform(-1, 1) ,2) for _ in range(num)])

    for i in range(num):
        features[i, 0] = 1
        features[i, 1] = points_x1[i]
        features[i, 2] = points_x2[i]
        labels[i] = target_function(points_x1[i], points_x2[i])
        ### choose 10 error labels
        if i <= num*0.1:
            labels[i] = (1 if labels[i]<0 else -1)
    return features, labels

def error_rate(features, labels, w):
    wrong = 0
    for i in range(len(labels)):
        if np.dot(features[i], w)*labels[i,0] < 0:
            wrong += 1
    return wrong/(len(labels)*1.0)

(features,labels) = training_data_with_random_error(1000)
plot_dot_picture(features, labels)

因為特徵是二維的，很容易用圖片表述。從圖上可以看出，點的分佈和目標方程大致一致。
這裡寫圖片描述

### 13.1 (*) Carry out Linear Regression without transformation, i.e., with feature vector:
### (1; x1; x2);
### to find wlin, and use wlin directly for classification. Run the experiments for 1000 times and plot
### a histogram on the classification (0/1) in-sample error (Ein). What is the average Ein over 1000
### experiments?

"""
    linear regression:
    model     : g(x) = Wt * X
    strategy  : squared error
    algorithm : close form(matrix)
    result    : w = (Xt.X)^-1.Xt.Y
"""
def linear_regression_closed_form(X, Y):
    return np.linalg.inv(np.dot(X.T, X)).dot(X.T).dot(Y)

w = linear_regression_closed_form(features, labels)

"""
    plot the one result(just for visual)
"""
plot_dot_picture(features, labels, w)

"""
    run 1000 times, and plot histogram
"""
error_rate_array = []
for i in range(1000):
    (features,labels) = training_data_with_random_error(1000)
    w = linear_regression_closed_form(features, labels)
    error_rate_array.append(error_rate(features, labels, w))
bins = np.arange(0,1,0.05)
plt.hist(error_rate_array, bins, rwidth=0.8, histtype='bar')
plt.title("Error rate histogram(without feature transform)")
plt.show()

### error rate, approximately 0.5
avr_err = sum(error_rate_array)/(len(error_rate_array)*1.0)

print "13.1--Linear regression for classification without feature transform:Average error--",avr_err

下面這張圖片是執行1次試驗學習到的直線，可見效果很糟糕

這裡寫圖片描述

下面這張直方圖描述的是執行1000次的Ein的直方圖，Ein大概為0.5左右，可以說沒什麼學習效果。證明我們選擇的一次模型不能夠滿足該資料集。

這裡寫圖片描述

13.1--Linear regression for classification without feature transform:Average error-- 0.50587

### Now, transform the training data into the following nonlinear feature vector:
### (1; x1; x2; x1x2; x1^2; x2^2)
### Find the vector ~w that corresponds to the solution of Linear Regression, and take it for classification.

"""
    feature transform φ(x) = z = (1; x1; x2; x1x2; x1^2; x2^2)
"""
def feature_transform(features):
    new = np.zeros((len(features), 6))
    new[:, 0:3] = features[:,:]*1
    new[:, 3] = features[:, 1] * features[:, 2]
    new[:, 4] = features[:, 1] * features[:, 1]
    new[:, 5] = features[:, 2] * features[:, 2]
    return new

def plot_dot_pictures(features, lables, w=np.zeros((6, 1))):
    x1 = features[:,1]
    x2 = features[:,2]
    y = lables[:,0]

    plot_size = 20
    size = np.ones((len(x1)))*plot_size

    size_x1 = np.ma.masked_where(y<0, size)
    size_x2 = np.ma.masked_where(y>0, size)

    ### plot scatter
    plt.scatter(x1, x2, s=size_x1, marker='x', c='r')
    plt.scatter(x1, x2, s=size_x2, marker='o', c='b')

    ### plot w line
    x1_tmp = np.arange(-1,1,0.01)
    x2_tmp = np.arange(-1,1,0.01)

    x1_tmp, x2_tmp = np.meshgrid(x1_tmp, x2_tmp)

    f = w[0, 0] + x1_tmp*w[1, 0] + x2_tmp*w[2, 0] + x1_tmp*x2_tmp*w[3, 0] \
        + x1_tmp*x1_tmp*w[4, 0] + x2_tmp*x2_tmp*w[5, 0]

    try:
        plt.contour(x1_tmp, x2_tmp, f, 0)
    except ValueError:
        pass

    plt.xlabel('X1')
    plt.ylabel('X2')

    plt.title('Feature scatter plot')

    plt.legend()

    plt.show()

"""
    plot the one result(just for visual)
"""
(features,labels) = training_data_with_random_error(1000)
new_features = feature_transform(features)
w = linear_regression_closed_form(new_features, labels)
plot_dot_pictures(features, labels, w)

"""
    run 1000 times, and plot histogram
"""
error_rate_array = []
for i in range(1000):
    (features,labels) = training_data_with_random_error(1000)
    new_features = feature_transform(features)

    w = linear_regression_closed_form(new_features, labels)
    error_rate_array.append(error_rate(new_features, labels, w))

bins = np.arange(0,1,0.05)
plt.hist(error_rate_array, bins, rwidth=0.8, histtype='bar')
plt.title("Error rate histogram(with feature transform)")
plt.show()

### error rate, approximately 0.5
avr_err = sum(error_rate_array)/(len(error_rate_array)*1.0)

print "13.2--Linear regression for classification with feature transform:Average error--",avr_err

所以在13題後面，我們使用二次的假設，並使用使用了特徵轉換，將非線性問題轉換為線性問題，以便於使用線性迴歸。從圖中看出來，我們學習的效果很不錯，錯誤率在12%左右（資料集裡面本身有10%的噪聲點）

這裡寫圖片描述

13.2--Linear regression for classification with feature transform:Average error-- 0.124849

所以說線性迴歸總是適合分類分類問題嗎？下面做了一個小實驗。有意地挑選了六個樣本點，分別在[1,1]附近和[-1, -1]附近。

### is linear regression always good for classification, see the following example

features = np.array([[1, 1.1, 1.2], [1, 1.2,1.0], [1, 1.0, 1.0], [1, -1.1, -1.2], [1, -1.2, -1.0], [1, -1.0, -1.0]])
labels = np.array([[1],[1],[1],[-1],[-1],[-1]])

w = linear_regression_closed_form(features, labels)

"""
    plot the one result(just for visual)
"""
plot_dot_picture(features, labels, w)

使用線性迴歸，得到如圖的一條直線（其實該結果出乎了我的意料，我還以為會生成一條類似y=x的直線呢）。
這裡寫圖片描述

### if add a new large x point, what happens?
features = np.array([[1, 100, 100], [1, 1.1, 1.2], [1, 1.2,1.0], [1, 1.0, 1.0], [1, -1.1, -1.2], [1, -1.2, -1.0], [1, -1.0, -1.0]])
labels = np.array([[1], [1],[1],[1],[-1],[-1],[-1]])

w = linear_regression_closed_form(features, labels)

"""
    plot the one result(just for visual)
"""
print w
plot_dot_picture(features, labels, w)
print np.dot(features, w)

### total 7 points, 2 points error!!!!!

現在加入一個[100, 100]的樣本點，加入這個點是很合理的，可見生成了一條類似Y=X的直線，但是居然有2個點分類錯誤（本來有圖的。。）。但如果該問題用binary classification或者其他的分類器，均可以很好的工作。所以現行迴歸並不是總是適合分類問題的。

### 14. (*) Run the experiment for 1000 times, and plot a histogram on ~ w3, the weight associated with
### x1x2. What is the average ~ w3?
"""
    run 1000 times, and plot histogram
"""
w3_array = []
for i in range(1000):
    (features,labels) = training_data_with_random_error(1000)
    new_features = feature_transform(features)

    w = linear_regression_closed_form(new_features, labels)
    w3_array.append(w[3,0])

bins = np.arange(-2,2,0.05)
plt.hist(w3_array, bins, rwidth=0.8, histtype='bar')
plt.title("Parameters W3(with feature transform)")
plt.show()

print "Average of W3 is: ", sum(w3_array)/(len(w3_array)*1.0)

這裡寫圖片描述

Average of W3 is:  0.00120328875641

### 15. (*) Continue from Question 14, and plot a histogram on the classification Eout instead. You can
### estimate it by generating a new set of 1000 points and adding noise as before. What is the average
### Eout?

error_out = []
for i in range(1000):
    (features,labels) = training_data_with_random_error(1000)
    new_features = feature_transform(features)
    error_out.append(error_rate(new_features,labels, w))

bins = np.arange(-1,1,0.05)
plt.hist(error_out, bins, rwidth=0.8, histtype='bar')
plt.title("Error out(with feature transform)")
plt.show()

print "Average of Eout is: ", sum(error_out)/(len(error_out)*1.0)

這裡寫圖片描述

Average of Eout is:  0.133649

### 18. (*) Implement the fixed learning rate gradient descent algorithm below for logistic regression, initialized with 0. Run the algorithm with η = 0:001 and T = 2000 on the following set for training:
###                http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_train.dat
### and the following set for testing:
###                http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_test.dat
### What is the weight vector within your g? What is the Eout(g) from your algorithm, evaluated using
### the 0=1 error on the test set?
import math
import numpy as np

"""
Read data from data file
"""
def data_load(file_path):

    ### open file and read lines
    f = open(file_path)
    try:
        lines = f.readlines()
    finally:
        f.close()

    ### create features and lables array
    example_num = len(lines)
    feature_dimension = len(lines[0].strip().split())  ###i do not know how to calculate the dimension  

    features = np.zeros((example_num, feature_dimension))
    features[:,0] = 1
    labels = np.zeros((example_num, 1))

    for index,line in enumerate(lines):
        ### items[0:-1]--features   items[-1]--label
        items = line.strip().split(' ')
        ### get features
        features[index,1:] = [float(str_num) for str_num in items[0:-1]]

        ### get label
        labels[index] = float(items[-1])

    return features,labels

### gradient descent
def gradient_descent(X, Y, w):
    ### -YnWtXn
    tmp = -Y*(np.dot(X, w))

    ### θ(-YnWtXn) = exp(tmp)/1+exp(tmp)
    ### weight_matrix = np.array([math.exp(_)/(1+math.exp(_)) for _ in tmp]).reshape(len(X), 1)
    weight_matrix = np.exp(tmp)/((1+np.exp(tmp))*1.0)
    gradient = 1/(len(X)*1.0)*(sum(weight_matrix*-Y*X).reshape(len(w), 1))

    return gradient

### gradient descent
def stochastic_gradient_descent(X, Y, w):
    ### -YnWtXn
    tmp = -Y*(np.dot(X, w))

    ### θ(-YnWtXn) = exp(tmp)/1+exp(tmp)
    ###weight = math.exp(tmp[0])/((1+math.exp(tmp[0]))*1.0)
    weight = np.exp(tmp)/((1+np.exp(tmp))*1.0)

    gradient = weight*-Y*X
    return gradient.reshape(len(gradient), 1)

### LinearRegression Class,first time use Class, HaHa...
class LinearRegression:
    'Linear Regression of My'

    def __init__(self):
        pass

    ### fit model
    def fit(self, X, Y, Eta=0.001, max_interate=2000, sgd=False):
        ### ∂E/∂w = 1/N * ∑θ(-YnWtXn)(-YnXn)
        self.__w = np.zeros((len(X[0]),1))

        if sgd == False:
            for i in range(max_interate):
                self.__w = self.__w - Eta*gradient_descent(X, Y, self.__w)
        else:
            index = 0
            for i in range(max_interate):
                if (index >= len(X)):
                    index = 0
                self.__w = self.__w - Eta*stochastic_gradient_descent(np.array(X[index]), Y[index], self.__w)
                index += 1
    ### predict
    def predict(self, X):
        binary_result = np.dot(X, self.__w) >= 0
        return np.array([(1 if _ > 0 else -1) for _ in binary_result]).reshape(len(X), 1) 

    ### get vector w
    def get_w(self):
        return self.__w

    ### score(error rate)
    def score(self, X, Y):
        predict_Y = self.predict(X)
        return sum(predict_Y != Y)/(len(Y)*1.0)

### training model
(X, Y) = data_load("hw3_train.dat")
lr = LinearRegression()
lr.fit(X, Y, max_interate = 2000)

### get weight vector
print "weight vector: ", lr.get_w()

### get 0/1 error in test data
test_X, test_Y = data_load("hw3_test.dat")
###print "Eout: ", lr.score(test_X,test_Y)     
lr.score(test_X,test_Y)

weight vector:  [[ 0.01878417]
 [-0.01260595]
 [ 0.04084862]
 [-0.03266317]
 [ 0.01502334]
 [-0.03667437]
 [ 0.01255934]
 [ 0.04815065]
 [-0.02206419]
 [ 0.02479605]
 [ 0.06899284]
 [ 0.0193719 ]
 [-0.01988549]
 [-0.0087049 ]
 [ 0.04605863]
 [ 0.05793382]
 [ 0.061218  ]
 [-0.04720391]
 [ 0.06070375]
 [-0.01610907]
 [-0.03484607]]





array([ 0.475])

### 19. (*) Implement the fixed learning rate gradient descent algorithm below for logistic regression,
### initialized with 0. Run the algorithm with η = 0:01 and T = 2000 on the following set for training:
###                http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_train.dat
### and the following set for testing:
###                http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_test.dat
### What is the weight vector within your g? What is the Eout(g) from your algorithm, evaluated using
### the 0=1 error on the test set?

### training model
(X, Y) = data_load("hw3_train.dat")
lr_eta = LinearRegression()
lr_eta.fit(X, Y, 0.01, 2000)

### get weight vector
print "weight vector: ", lr_eta.get_w()

### get 0/1 error in test data
test_X, test_Y = data_load("hw3_test.dat")
print "Eout: ", lr_eta.score(test_X,test_Y)

weight vector:  [[-0.00385379]
 [-0.18914564]
 [ 0.26625908]
 [-0.35356593]
 [ 0.04088776]
 [-0.3794296 ]
 [ 0.01982783]
 [ 0.33391527]
 [-0.26386754]
 [ 0.13489328]
 [ 0.4914191 ]
 [ 0.08726107]
 [-0.25537728]
 [-0.16291797]
 [ 0.30073678]
 [ 0.40014954]
 [ 0.43218808]
 [-0.46227968]
 [ 0.43230193]
 [-0.20786372]
 [-0.36936337]]
Eout:  [ 0.22]

### 20. (*) Implement the fixed learning rate stochastic gradient descent algorithm below for logistic regression,
### initialized with 0. Instead of randomly choosing n in each iteration, please simply pick
### the example with the cyclic order n = 1; 2; : : : ; N; 1; 2; : : :. Run the algorithm with η = 0:001 and
### T = 2000 on the following set for training:
### http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_train.dat
### and the following set for testing:
### http://www.csie.ntu.edu.tw/~htlin/course/ml15fall/hw3/hw3_test.dat
### What is the weight vector within your g? What is the Eout(g) from your algorithm, evaluated using
### the 0=1 error on the test set?
### training model
(X, Y) = data_load("hw3_train.dat")
lr_sgd = LinearRegression()
lr_sgd.fit(X, Y, sgd=True, max_interate = 2000)

### get weight vector
print "weight vector: ", lr_sgd.get_w()

### get 0/1 error in test data
test_X, test_Y = data_load("hw3_test.dat")
print "Eout: ", lr_sgd.score(test_X,test_Y)

weight vector:  [[ 0.01826899]
 [-0.01308051]
 [ 0.04072894]
 [-0.03295698]
 [ 0.01498363]
 [-0.03691042]
 [ 0.01232819]
 [ 0.04791334]
 [-0.02244958]
 [ 0.02470544]
 [ 0.06878235]
 [ 0.01897378]
 [-0.02032107]
 [-0.00901469]
 [ 0.04589259]
 [ 0.05776824]
 [ 0.06102487]
 [-0.04756147]
 [ 0.06035018]
 [-0.01660574]
 [-0.03509342]]
Eout:  [ 0.473]

林軒田-機器學習基石-作業3-python原始碼

大家好，以下是林軒田機器學習基石--作業3的Python的參考程式碼，自己碼的。Python方面沒有工程經驗，如有錯誤或者更好的程式碼優化方法，麻煩大家留言提醒一下下，大家互相交流學習，謝謝。 13-15題主要考察在分類問題上的線性迴歸和特徵轉換，所使用的樣

林軒田機器學習基石入門（二）

上一節中我們主要講到機器學習的應用場景，而這一節主要向大家介紹我們身邊機器學習的例子，讓大家對機器學習有更多的直觀瞭解。機器學習如今已滲透在我們的日行中，這很讓人驚訝，你每天都能夠接觸到它。對於人們來說“衣食住行”是每天的基礎要求。當你肚子餓想

林軒田機器學習基石入門（三）

上一節我們主要向大家介紹我們身邊機器學習的例子，這一節我們將探討機器學習由什麼元素組成（機器學習的模型結構）。首先我們先看個信用卡的例子。假設我們想用機器學習來判斷“是否同意貸款給這個客戶？”，我們會將使用者的資訊資料輸給模型（比如年齡，性別，職業，工

臺大林軒田機器學習課程筆記3----機器學習的可行性

引例先引入一個矛盾問題：圖3.1 圖案學習問題這是一道推理題，根據第一行和第二行圖形的規律分別輸出-1和+1，然後通過上述規則學習推理出第三行圖形的輸出。每個人通過學習所獲得的答案是會不一致的，例如通過對稱的規律可以得到第三行的圖形f=+1，而如果通過圖案

臺灣大學林軒田機器學習基石課程學習筆記8 -- Noise and Error

上一節課，我們主要介紹了VC Dimension的概念。如果Hypotheses set的VC Dimension是有限的，且有足夠多N的資料，同時能夠找到一個hypothesis使它的Ein≈0Ein≈0，那麼就能說明機器學習是可行的。本節課主要講了資料集

林軒田機器學習基石（Machine Learning Foundation）

第一課機器學習問題什麼是機器學習？什麼是“學習”？學習就是人類通過觀察、積累經驗，掌握某項技能或能力。就好像我們從小學習識別字母、認識漢字，就是學習的過程。而機器學習（Machine Learning），顧名思義，就是讓機器（計算機）也能向人類一樣，通過觀察大量

臺灣大學林軒田機器學習基石課程學習筆記1 -- The Learning Problem

最近在看NTU林軒田的《機器學習基石》課程，個人感覺講的非常好。整個基石課程分成四個部分： When Can Machine Learn? Why Can Machine Learn? How Can Machine Learn? How Can M

臺灣大學林軒田機器學習基石課程學習筆記6 -- Theory of Generalization

上一節課，我們主要探討了當M的數值大小對機器學習的影響。如果M很大，那麼就不能保證機器學習有很好的泛化能力，所以問題轉換為驗證M有限，即最好是按照多項式成長。然後通過引入了成長函式mH(N)mH(N)和dichotomy以及break point的概念，提出

林軒田--機器學習技法--SVM筆記5--核邏輯迴歸(Kernel+Logistic+Regression)

核邏輯迴歸這一章節主要敘述的內容是如何使用SVM來做像logistics regression那樣的soft binary classification(輸出正類的概率值)，如何在此基礎上加上核方法。 1. 把SVM看成一種regularization

臺大林軒田機器學習課程筆記4----訓練 VS. 測試

引言上一篇講到了在有限的hypotheses下，學習錯誤的發生率，即E_in與E_out不同的概率邊界，本篇將會探討在infinite hypotheses情況下的概率邊界。線的有效數字（Effective Number of Lines）我們先將學習劃分為兩個核心的問題

臺大林軒田機器學習課程筆記2----機器學習的分類

1. 根據輸出集合二分類根據輸出空間，二分類的輸出結果只有兩種，即y={-1,1}，具體的應用包括： *信用卡申請問題：Client Data=>Accept or Deny 郵件分類問題：Email Text=>Rubbish or Not 病人生病問

臺大林軒田機器學習課程筆記----機器學習初探及PLA演算法

機器學習初探 1、什麼是機器學習學習指的是一個人在觀察事物的過程中所提煉出的技能，相比於學習，機器學習指的就是讓計算機在一堆資料中通過觀察獲得某些經驗（即數學模型），從而提升某些方面（例如推薦系統的精度）的效能（可測量的）。 2、機器學習使用的條件需要有規則可以學習有事先準

臺灣大學林軒田機器學習技法課程學習筆記1 -- Linear Support Vector Machine

關於臺灣大學林軒田老師的《機器學習基石》課程，我們已經總結了16節課的筆記。這裡附上基石第一節課的部落格地址：本系列同樣分成16節課，將會介紹《機器學習基石》的進階版《機器學習技法》，更深入地探討機器學習一些高階演算法和技巧。 Large-Marg

林軒田--機器學習技法--SVM筆記2--對偶支援向量機（dual+SVM）

對偶支援向量機咦？怎麼還有關於支援向量機的內容，我們不是在上一講已經將支援向量機解決了麼？怎麼又引入了對偶這個概念？ 1.動機我們在上一講已經講過，可以使用二次規劃來解決支援向量機的問題。如果現在想要解決非線性的支援向量機的問題，也很簡單，如下圖所

臺灣大學林軒田機器學習技法課程學習筆記8 -- Adaptive Boosting

上節課我們主要開始介紹Aggregation Models，目的是將不同的hypothesis得到的gtgt集合起來，利用集體智慧得到更好的預測模型G。首先我們介紹了Blending，blending是將已存在的所有gtgt結合起來，可以是uniformly

臺大林軒田·機器學習技法記要

6/1/2016 7:42:34 PM 第一講線性SVM 廣義的SVM，其實就是二次規劃問題把SVM問題對應到二次規劃的係數這就是線性SVM，如果想變成非

臺灣大學林軒田機器學習技法課程學習筆記10 -- Random Forest

上節課我們主要介紹了Decision Tree模型。Decision Tree演算法的核心是通過遞迴的方式，將資料集不斷進行切割，得到子分支，最終形成數的結構。C&RT演算法是決策樹比較簡單和常用的一種演算法，其切割的標準是根據純度來進行，每次切割都

Coursera機器學習基石作業一python實現

機器學習基石作業一 import numpy as np def train_matrix(): with open("hw1_15_train.dat.txt","r") as f: rawData=f.readlines() dataNum

Coursera機器學習基石作業二python實現

##機器學習基石作業二下面的程式碼是17、18題的結合： import numpy as np import random class decisonStump(object): def __init__(self,dimension,data_count,noise)

機器學習基石作業四python實現

總體來說，13-20題總的框架都是一樣，因此程式碼都集中在一起。 import numpy as np def getData(path): with open(path,'r') as fr: rawData=fr.readlines()

林軒田-機器學習基石-作業3-python原始碼

相關推薦