邏輯迴歸的講解和程式碼

阿新 • • 發佈：2018-12-17

邏輯迴歸模型是由以下條件概率分佈表示的分類模型。

邏輯迴歸模型源自邏輯分佈，其分佈函式使S形函式；

邏輯迴歸：用於分類問題中，預測值為離散值；演算法的性質是輸出值永遠在0和1之間；

邏輯迴歸的模型假設：，

h(x)的作用：對於給定的輸入變數，根據選擇的引數計算輸出變數=1的可能性，

代價函式：

梯度下降演算法：

高階優化演算法：共軛梯度法、BFGS變尺度法、L-BFGS限制變尺度法、fminunc無約束最小化函式

正則化：保留所有的特徵，減小引數的大小；

其中lamda是正則化引數，lamda越大，引數越小。因為需要最小化代價函式，但是加上了尾部的這一部分，尾部越大，則整個代價函式越大，則theta越小，才能保證最小的代價函式。

程式碼部分，最重要的是實現代價函式和sigmoid函式。

import matplotlib.pyplot as plt
import numpy as np
import scipy.optimize as opt
from plotData import *
import costFunctionReg as cfr
import plotDecisionBoundary as pdb
import predict as predict
import mapFeature as mf

plt.ion()
# Load data
# The first two columns contain the exam scores and the third column contains the label.
data = np.loadtxt('ex2data2.txt', delimiter=',')
X = data[:, 0:2]
y = data[:, 2]

plot_data(X, y)

plt.xlabel('Microchip Test 1')
plt.ylabel('Microchip Test 2')
plt.legend(['y = 1', 'y = 0'])

input('Program paused. Press ENTER to continue')

# ===================== Part 1: Regularized Logistic Regression =====================
X = mf.map_feature(X[:, 0], X[:, 1])

# Initialize fitting parameters
initial_theta = np.zeros(X.shape[1])

# Set regularization parameter lambda to 1
lmd = 1

# Compute and display initial cost and gradient for regularized logistic regression
cost, grad = cfr.cost_function_reg(initial_theta, X, y, lmd)

np.set_printoptions(formatter={'float': '{: 0.4f}\n'.format})
print('Cost at initial theta (zeros): {}'.format(cost))
print('Expected cost (approx): 0.693')
print('Gradient at initial theta (zeros) - first five values only: \n{}'.format(grad[0:5]))
print('Expected gradients (approx) - first five values only: \n 0.0085\n 0.0188\n 0.0001\n 0.0503\n 0.0115')

input('Program paused. Press ENTER to continue')

# Compute and display cost and gradient with non-zero theta
test_theta = np.ones(X.shape[1])

cost, grad = cfr.cost_function_reg(test_theta, X, y, lmd)

print('Cost at test theta: {}'.format(cost))
print('Expected cost (approx): 2.13')
print('Gradient at test theta - first five values only: \n{}'.format(grad[0:5]))
print('Expected gradients (approx) - first five values only: \n 0.3460\n 0.0851\n 0.1185\n 0.1506\n 0.0159')

input('Program paused. Press ENTER to continue')

# ===================== Part 2: Regularization and Accuracies =====================
# Optional Exercise:
# In this part, you will get to try different values of lambda and
# see how regularization affects the decision boundary
#
# Try the following values of lambda (0, 1, 10, 100).
#
# How does the decision boundary change when you vary lambda? How does
# the training set accuracy vary?
#

# Initializa fitting parameters
initial_theta = np.zeros(X.shape[1])

# Set regularization parameter lambda to 1 (you should vary this)
lmd = 1

# Optimize
def cost_func(t):
    return cfr.cost_function_reg(t, X, y, lmd)[0]

def grad_func(t):
    return cfr.cost_function_reg(t, X, y, lmd)[1]

theta, cost, *unused = opt.fmin_bfgs(f=cost_func, fprime=grad_func, x0=initial_theta, maxiter=400, full_output=True, disp=False) #使用的是優化庫函式裡面的牛頓法

# Plot boundary
print('Plotting decision boundary ...')
pdb.plot_decision_boundary(theta, X, y)
plt.title('lambda = {}'.format(lmd))

plt.xlabel('Microchip Test 1')
plt.ylabel('Microchip Test 2')

# Compute accuracy on our training set
p = predict.predict(theta, X)

print('Train Accuracy: {:0.4f}'.format(np.mean(y == p) * 100))
print('Expected accuracy (with lambda = 1): 83.1 (approx)')

input('ex2_reg Finished. Press ENTER to exit')



import numpy as np
from sigmoid import *

def cost_function_reg(theta, X, y, lmd):
    m = y.size

    hypothesis = sigmoid(np.dot(X, theta))

    reg_theta = theta[1:]

    cost = np.sum(-y * np.log(hypothesis) - (1 - y) * np.log(1 - hypothesis)) / m \
           + (lmd / (2 * m)) * np.sum(reg_theta * reg_theta)

    normal_grad = (np.dot(X.T, hypothesis - y) / m).flatten()

    grad[0] = normal_grad[0]
    grad[1:] = normal_grad[1:] + reg_theta * (lmd / m)

    # ===========================================================

    return cost, grad


import matplotlib.pyplot as plt
import numpy as np
from plotData import *
from mapFeature import *

def plot_decision_boundary(theta, X, y):
    plot_data(X[:, 1:3], y)

    if X.shape[1] <= 3:
        # Only need two points to define a line, so choose two endpoints
        plot_x = np.array([np.min(X[:, 1]) - 2, np.max(X[:, 1]) + 2])

        # Calculate the decision boundary line
        plot_y = (-1/theta[2]) * (theta[1]*plot_x + theta[0])

        plt.plot(plot_x, plot_y)

        plt.legend(['Decision Boundary', 'Admitted', 'Not admitted'], loc=1)
        plt.axis([30, 100, 30, 100])
    else:
        # Here is the grid range
        u = np.linspace(-1, 1.5, 50)
        v = np.linspace(-1, 1.5, 50)

        z = np.zeros((u.size, v.size))

        # Evaluate z = theta*x over the grid
        for i in range(0, u.size):
            for j in range(0, v.size):
                z[i, j] = np.dot(map_feature(u[i], v[j]), theta)

        z = z.T

        # Plot z = 0
        # Notice you need to specify the range [0, 0]
        cs = plt.contour(u, v, z, levels=[0], colors='r', label='Decision Boundary')
        plt.legend([cs.collections[0]], ['Decision Boundary'])
   
        
import numpy as np

def map_feature(x1, x2):
    degree = 6

    x1 = x1.reshape((x1.size, 1))
    x2 = x2.reshape((x2.size, 1))
    result = np.ones(x1[:, 0].shape)

    for i in range(1, degree + 1):
        for j in range(0, i + 1):
            result = np.c_[result, (x1**(i-j)) * (x2**j)]

    return result


import matplotlib.pyplot as plt
import numpy as np

def plot_data(X, y):
    plt.figure()

    pos = np.where(y == 1)[0] #輸出滿足條件的座標
    neg = np.where(y == 0)[0]

    plt.scatter(X[pos, 0], X[pos, 1], marker="+", c='b')
    plt.scatter(X[neg, 0], X[neg, 1], marker="o", c='y')


import numpy as np
from sigmoid import *

def predict(theta, X):
    m = X.shape[0]
    p = np.zeros(m)
    p = sigmoid(np.dot(X, theta))
    pos = np.where(p >= 0.5)
    neg = np.where(p < 0.5)

    p[pos] = 1
    p[neg] = 0

    # ===========================================================

    return p


import numpy as np

def sigmoid(z):
    g = np.zeros(z.size)
    g = 1 / (1 + np.exp(-z))

    return g

邏輯迴歸的講解和程式碼

邏輯迴歸模型是由以下條件概率分佈表示的分類模型。邏輯迴歸模型源自邏輯分佈，其分佈函式使S形函式；邏輯迴歸：用於分類問題中，預測值為離散值；演算法的性質是輸出值永遠在0和1之間；邏輯迴歸的模型假設：， h(x)的作用：對於給定的輸入變數，根據選擇的引數計算輸出

梯度下降和邏輯迴歸例子(Python程式碼實現)

import numpy as np import pandas as pd import os data = pd.read_csv("iris.csv") # 這裡的iris資料已做過處理 m, n = data.shape dataMatIn = np.ones((m, n)) dataM

決策樹講解和程式碼

決策樹最關鍵的兩個部分：1.面對一個實際的資料集，如何構建出一棵樹 2.構建樹的過程中，樹分裂節點時，如何選擇出最優的屬性作為分裂節點。演算法就是為了選出最優的屬性。為什麼要選擇最優的屬性，是因為不同的屬性排列組合導致的演算法能力的好壞不一樣。決策樹包含：根節點（樣本全集）、葉節點（決策

傻瓜式的go modules的講解和程式碼，及gomod能不能引入另一個gomod和gomod的use of internal package xxxx not allowed

一國內關於gomod的文章，哪怕是使用了百度 -csdn，依然全是理論，雖然golang的使用者大多是大神但是也有像我這樣的的弱雞是不是？所以，我就寫個傻瓜式教程了。程式碼很少很簡單。。。。二環境變數 GO111MODULE，有三個值on，off，auto，很好理解，不配置的話預設是auto

邏輯迴歸基礎和SVM基礎

LR: Logistic Regression Model是一種有監督學習方法，主要用於二元分類，也可以進行多元分類。其本質上是一種符合伯努利分佈的線性迴歸模型（Linear Regression Model），不同之處就在於邏輯迴歸是將連續域的輸出通過邏輯

資料探勘經典演算法：Logistic(邏輯迴歸) python和sklearn實現

Logistic雖然不是十大經典演算法之一，但卻是資料探勘中常用的有力演算法，所以這裡也專門進行了學習，以下內容皆為親自實踐後的感悟和總結（Logistic原理、程式碼實現和優化、真實樣例資料、sklearn實現）。為了記錄的比較清楚，所以內容可能有點多，但都比較淺顯，下面進

【機器學習】邏輯迴歸基礎知識+程式碼實現

1. 基本概念邏輯迴歸用於二分類，將對輸入的線性表示對映到0和1之間，輸出為label為1的概率。優點：實現代價低，可輸出分類概率。適用於資料線性不可分。缺點：容易欠擬合，分類精度可能不高，且僅限二分類。使用資料型別：數值型和標稱資料。邏輯迴歸本質也是線性迴歸，但是

《統計學習方法》-邏輯迴歸筆記和python原始碼

邏輯迴歸（Logistic regression）邏輯迴歸是統計學習中的經典分類方法。其多用在二分類{0,1}問題上。定義1：設X是連續隨機變數，X服從邏輯迴歸分佈是指X具有下列分佈函式與密度函式：分佈函式屬於邏輯斯諦函式，其圖形是一條S形曲線。定義2：二

TensorFlow基礎4：四種類型資料的讀取流程及API講解和程式碼實現

在上篇文章中梳理了資料讀取的三種方式,但是在實際專案當中，由於資料量一般會比較大，所以更多的會使用第三種方法（即直接從檔案中讀取）。但是對於不同的檔案型別，需要不同的檔案處理API，有時候比較容易弄混淆，接下來就來梳理一下。一.檔案讀取流程如上圖

機器學習：邏輯迴歸與Python程式碼實現

前言：本篇博文主要介紹邏輯迴歸（logistic regression），首先介紹相關的基礎概念和原理，然後通過Python程式碼實現邏輯迴歸的二分類問題。特別強調，其中大多理論知識來源於《統計學習方法_李航》和斯坦福課程翻譯筆記以及Coursera機器學習課程。本篇博

Coursera機器學習 week3 邏輯迴歸程式設計作業程式碼

這是Coursera上 Week3 的 “邏輯迴歸” 的程式設計作業程式碼。經過測驗，全部通過。下面是 sigmoid.m 的程式碼： function g = sigmoid(z) %SIG

邏輯迴歸：原理+程式碼

（作者：陳玓玏）邏輯迴歸算是傳統機器學習中最簡單的模型了，它的基礎是線性迴歸，為了弄明白邏輯迴歸，我們先來看線性迴歸。一、線性迴歸假設共N個樣本，每個樣本有M個特徵，這樣就產生了一個N*M大小的樣本矩陣。令矩陣為X，第i個樣本為Xi，第i個樣本的

06_邏輯迴歸演算法和最大熵模型

　　今天是2020年2月12日星期三，現在對學習有點麻木。看了一天的最大熵模型，多少理解了一些內容，在學校看的啥啊真是，要不是寫這個部落格，關鍵的地方真是一點看不出來啊。反思再反思，看書的時候難懂的地方直接翻過去了，現在為了寫出來，多查了很多資料。切忌眼高手低啊，不把模型真正用出來，很難有深入的理解啊。　　

線性迴歸和梯度下降講解與程式碼

本文也是根據吳恩達機器學習課程作業的答案。迴歸：預測值是連續的；分類：預測值是離散的；建模誤差：預測值與實際值之間的差距；目標：選擇模型引數，使得建模誤差的平方和能夠最小，即代價函式最小；代價函式：選擇平方誤差函式，是解決迴歸問題最常用的手段；代價函式是幫助我們選擇最優

邏輯迴歸和樸素貝葉斯演算法實現二值分類（matlab程式碼）

資料簡介：共有306組資料，每組資料有三個屬性(x1,x2,x2)，屬於0類或者1類。資料序號末尾為1的是測試集，有31組；其他的作為訓練集，有275組。 clear clc load('

機器學習——從線性迴歸到邏輯迴歸【附詳細推導和程式碼】

本文始發於個人公眾號：TechFlow，原創不易，求個關注在之前的文章當中，我們推導了線性迴歸的公式，線性迴歸本質是線性函式，模型的原理不難，核心是求解模型引數的過程。通過對線性迴歸的推導和學習，我們基本上了解了機器學習模型學習的過程，這是機器學習的精髓，要比單個模型的原理重要得多。新關注和有所遺忘的同

【原】Andrew Ng斯坦福機器學習 Coursera—Programming Exercise 3 邏輯迴歸多分類和神經網路

作業說明 Exercise 3，Week 4，使用Octave實現手寫數字0-9的識別，採用兩種方式（1）邏輯迴歸多分類（2）三層神經網路多分類。對比結果。每張圖片20px * 20px，也就是一共400個特徵（因為Octave裡從1開始。所以將0對映為10）（1）邏輯迴歸多分類：實現 lrCost

邏輯迴歸（LR）和支援向量機（SVM）的區別和聯絡

1. 前言在機器學習的分類問題領域中，有兩個平分秋色的演算法，就是邏輯迴歸和支援向量機，這兩個演算法個有千秋，在不同的問題中有不同的表現效果，下面我們就對它們的區別和聯絡做一個簡單的總結。 2. LR和SVM的聯絡都是監督的分類演算法。都是線性分類方法 (不考慮核函式時）。都是判別

【番外】線性迴歸和邏輯迴歸的 MLE 視角

線性迴歸令 z = w

tensorflow實現線性迴歸和邏輯迴歸

關於線性迴歸和邏輯迴歸的原理和python實現，請左轉：邏輯迴歸、線性迴歸。這裡就直接貼程式碼了。線性迴歸： # -*- coding: utf-8 -*- """ Created on Thu Aug 30 09:40:50 2018 @author: 96jie """ im

邏輯迴歸的講解和程式碼

相關推薦