廣義線性模型2

阿新 • • 發佈：2017-05-27

nor alt 能夠 ever ... mat rcv shape dwt

1.1.2 Ridge Regression（嶺回歸）

嶺回歸和普通最小二乘法回歸的一個重要差別是前者對系數模的平方進行了限制。例如以下所看到的：

In [1]: from sklearn import linear_model

In [2]: clf = linear_model.R
linear_model.RandomizedLasso
linear_model.RandomizedLogisticRegression
linear_model.Ridge
linear_model.RidgeCV
linear_model.RidgeClassifier
linear_model.RidgeClassifierCV

In [2]: clf = linear_model.Ridge(alpha = .5)

In [3]: clf.fit([[0, 0], [0, 0], [1, 1]], [0, .1, 1])
Out[3]: 
Ridge(alpha=0.5, copy_X=True, fit_intercept=True, max_iter=None,
   normalize=False, solver=‘auto‘, tol=0.001)

In [4]: clf.coef_
Out[4]: array([ 0.34545455,  0.34545455])

In [5]: clf.intercept_
Out[5]: 0.13636363636363641

解析：

（1）sklearn.linear_model.Ridge類構造方法

class sklearn.linear_model.Ridge(alpha=1.0, fit_intercept=True, normalize=False, copy_X=True, max_iter=None,

tol=0.001, solver=‘auto‘)

（2）sklearn.linear_model.Ridge類實例的屬性和方法

技術分享

（3）Ridge Regression（嶺回歸）

嶺回歸分析是一種專用於共線性數據分析的有偏預計回歸方法，實質上是一種改良的最小二乘預計法，通過放棄最小

二乘法的無偏性。以損失部分信息、減少精度為代價獲得回歸系數更為符合實際、更可靠的回歸方法，對病態數據的

耐受性遠遠強於最小二乘法。

嶺回歸分析主要解決兩類問題：數據點少於變量個數；變量間存在共線性。

Examples: Plot Ridge coefficients as a function of the regularization

print(__doc__)

import numpy as np
import pylab as pl
from sklearn import linear_model

# X is the 10x10 Hilbert matrix
X = 1. / (np.arange(1, 11) + np.arange(0, 10)[:, np.newaxis])
y = np.ones(10)

###############################################################################
# Compute paths

n_alphas = 200
alphas = np.logspace(-10, -2, n_alphas)
clf = linear_model.Ridge(fit_intercept=False)

coefs = []
for a in alphas:
    clf.set_params(alpha=a)
    clf.fit(X, y)
    coefs.append(clf.coef_)

###############################################################################
# Display results

ax = pl.gca()
ax.set_color_cycle([‘b‘, ‘r‘, ‘g‘, ‘c‘, ‘k‘, ‘y‘, ‘m‘])

ax.plot(alphas, coefs)
ax.set_xscale(‘log‘)
ax.set_xlim(ax.get_xlim()[::-1])  # reverse axis
pl.xlabel(‘alpha‘)
pl.ylabel(‘weights‘)
pl.title(‘Ridge coefficients as a function of the regularization‘)
pl.axis(‘tight‘)
pl.show()

圖形輸出。例如以下所看到的：

技術分享

解析：

（1）希爾伯特矩陣

在線性代數中，希爾伯特矩陣是一種系數都是單位分數的方塊矩陣。詳細來說一個希爾伯特矩陣H的第i橫行第j縱列的

系數是：

$技術分享$

舉例來說。 $技術分享$ 的希爾伯特矩陣就是：

$技術分享$

希爾伯特矩陣的系數也能夠看作是下面積分：

$技術分享$

也就是當向量是關於變量x 的各階冪時關於積分範數 $技術分享$ 的格拉姆矩陣。

希爾伯特矩陣是低條件矩陣的典型樣例。

與希爾伯特矩陣的數值計算是十分困難的。

舉例來說，當範數為 $技術分享$ 矩陣範數

時希爾伯特矩陣的條件數大約是 $技術分享$ ，遠大於1。

（2）np.arange()方法

In [31]: 1. / (np.arange(1, 11))
Out[31]: 
array([ 1.        ,  0.5       ,  0.33333333,  0.25      ,  0.2       ,
        0.16666667,  0.14285714,  0.125     ,  0.11111111,  0.1       ])

In [32]: (1. / (np.arange(1, 11))).shape
Out[32]: (10,)

（3）np.newaxis屬性

In [5]: np.arange(0, 10)
Out[5]: array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

In [6]: type(np.arange(0, 10))
Out[6]: numpy.ndarray

In [7]: np.arange(0, 10).shape
Out[7]: (10,)

In [8]: np.arange(0, 10)[:, np.newaxis]
Out[8]: 
array([[0],
       [1],
       [2],
       [3],
       [4],
       [5],
       [6],
       [7],
       [8],
       [9]])

In [9]: np.arange(0, 10)[:, np.newaxis].shape
Out[9]: (10, 1)

（4）廣播原理

In [25]: x = np.arange(0, 5)

In [26]: x[:, np.newaxis]
Out[26]: 
array([[0],
       [1],
       [2],
       [3],
       [4]])

In [27]: x[np.newaxis, :]
Out[27]: array([[0, 1, 2, 3, 4]])

In [28]: x[:, np.newaxis] + x[np.newaxis, :]
Out[28]: 
array([[0, 1, 2, 3, 4],
       [1, 2, 3, 4, 5],
       [2, 3, 4, 5, 6],
       [3, 4, 5, 6, 7],
       [4, 5, 6, 7, 8]])

（5）10階希爾伯特矩陣X

In [33]: X = 1. / (np.arange(1, 11) + np.arange(0, 10)[:, np.newaxis])

In [34]: X
Out[34]: 
array([[ 1.        ,  0.5       ,  0.33333333,  0.25      ,  0.2       ,
         0.16666667,  0.14285714,  0.125     ,  0.11111111,  0.1       ],
       [ 0.5       ,  0.33333333,  0.25      ,  0.2       ,  0.16666667,
         0.14285714,  0.125     ,  0.11111111,  0.1       ,  0.09090909],
       [ 0.33333333,  0.25      ,  0.2       ,  0.16666667,  0.14285714,
         0.125     ,  0.11111111,  0.1       ,  0.09090909,  0.08333333],
       [ 0.25      ,  0.2       ,  0.16666667,  0.14285714,  0.125     ,
         0.11111111,  0.1       ,  0.09090909,  0.08333333,  0.07692308],
       [ 0.2       ,  0.16666667,  0.14285714,  0.125     ,  0.11111111,
         0.1       ,  0.09090909,  0.08333333,  0.07692308,  0.07142857],
       [ 0.16666667,  0.14285714,  0.125     ,  0.11111111,  0.1       ,
         0.09090909,  0.08333333,  0.07692308,  0.07142857,  0.06666667],
       [ 0.14285714,  0.125     ,  0.11111111,  0.1       ,  0.09090909,
         0.08333333,  0.07692308,  0.07142857,  0.06666667,  0.0625    ],
       [ 0.125     ,  0.11111111,  0.1       ,  0.09090909,  0.08333333,
         0.07692308,  0.07142857,  0.06666667,  0.0625    ,  0.05882353],
       [ 0.11111111,  0.1       ,  0.09090909,  0.08333333,  0.07692308,
         0.07142857,  0.06666667,  0.0625    ,  0.05882353,  0.05555556],
       [ 0.1       ,  0.09090909,  0.08333333,  0.07692308,  0.07142857,
         0.06666667,  0.0625    ,  0.05882353,  0.05555556,  0.05263158]])

（6）np.ones()方法

In [35]: y = np.ones(10)

In [36]: y
Out[36]: array([ 1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.,  1.])

In [37]: y.shape
Out[37]: (10,)

（7）numpy.logspace()方法

numpy.logspace(start, stop, num=50, endpoint=True, base=10.0)

說明：Return numbers spaced evenly on a log scale. In linear space, the sequence starts at base ** start (base to

the power of start) and ends with base ** stop (see endpoint below).

In [38]: n_alphas = 200

In [39]: alphas = np.logspace(-10, -2, n_alphas)

In [40]: alphas
Out[40]: 
array([  1.00000000e-10,   1.09698580e-10,   1.20337784e-10,
         1.32008840e-10,   1.44811823e-10,   1.58856513e-10,
         1.74263339e-10,   1.91164408e-10,   2.09704640e-10,
         ...,
         5.23109931e-03,   5.73844165e-03,   6.29498899e-03,
         6.90551352e-03,   7.57525026e-03,   8.30994195e-03,
         9.11588830e-03,   1.00000000e-02])

In [41]: alphas.shape
Out[41]: (200,)

In [42]: 1.00000000e-10
Out[42]: 1e-10

（8）set_params(**params)方法

技術分享

（9）matplotlib.pyplot.gca(**kwargs)方法

Return the current axis instance. This can be used to control axis properties either using set or the Axes methods,

for example, setting the x axis range.

參考文獻：

[1] 嶺回歸: http://baike.baidu.com/link?

url=S1DwT9XFOthlB5hjGP6Ramxt-fvtCJ-RUXYVSw-z9t7-hZIojL7eroUQwKaJd5KE9-jVEQeRtxZeuUz59SBE6q

[2] 正則化、歸一化含義解析: http://sobuhu.com/ml/2012/12/29/normalization-regularization.html

[3] 希爾伯特矩陣: http://zh.wikipedia.org/zh-cn/%E5%B8%8C%E5%B0%94%E4%BC%AF%E7%89%B9%E7%9F%A9%E9%98%B5

[4] 嶺回歸分析總結: http://download.csdn.net/detail/shengshengwang/7225251

廣義線性模型2

nor alt 能夠 ever ... mat rcv shape dwt 1.1.2 Ridge Regression（嶺回歸）嶺回歸和普通最小二乘法回歸的一個重要差別是前者對系數模的平方進行了限制。例如以下所看到的： In [1]: from sklearn im

第3章-從線性概率模型到廣義線性模型(2)

原文參考斯坦福機器學習cs229-2-Generative Learning algorithms https://mathdept.iut.ac.ir/sites/mathdept.iut.ac.ir/files/AGRESTI.PDF http://data.princeton.edu

從線性模型到廣義線性模型(2)——引數估計、假設檢驗

本文系轉載，原文連結：http://cos.name/2011/01/how-does-glm-generalize-lm-fit-and-test/ 1.GLM引數估計——極大似然法為了理論上簡化，這裡把GLM的分佈限定在指數分佈族。事實上，實際應用中

資料學習(2)·廣義線性模型

作者課堂筆記，有問題請聯絡[email protected] 目錄指數族，廣義線性模型 1 指數族如果一種分佈可以寫成如下形式，那麼這種分佈屬於指數族： p(y;η)=b(y)e

機器學習數學原理（2）——廣義線性模型

機器學習數學原理（2）——廣義線性模型這篇博文主要介紹的是在機器學習中的迴歸問題以及分類問題中的一個非常具有概括性的模型：廣義線性模型（Generalized Linear Models，簡稱GLMs），這類模型包括了迴歸問題中的正態分佈，也包含了分類問題中的伯努利分佈。隨著我們的

廣義線性模型 - Andrew Ng機器學習公開課筆記1.6

sans luci art 能夠 tro ron 便是 import grand 在分類問題中我們如果：他們都是廣義線性模型中的一個樣例，在理解廣義線性模型之前須要先理解指數分布族。指數分

廣義線性模型的理解

選擇現象 one 世界 logistic 是什麽 times 自己取值世界中（大部分的）各種現象背後，都存在著可以解釋這些現象的規律。機器學習要做的，就是通過訓練模型，發現數據背後隱藏的規律，從而對新的數據做出合理的判斷。雖然機器學習能夠自動地幫我們完成很多事情（

分類和邏輯回歸(Classification and logistic regression)，廣義線性模型(Generalized Linear Models) ，生成學習算法(Generative Learning algorithms)

line learning nbsp ear 回歸 logs http zdb del 分類和邏輯回歸(Classification and logistic regression) http://www.cnblogs.com/czdbest/p/5768467.html

廣義線性模型2

廣義線性模型2

第3章-從線性概率模型到廣義線性模型(2)

從線性模型到廣義線性模型(2)——引數估計、假設檢驗

資料學習(2)·廣義線性模型

機器學習數學原理（2）——廣義線性模型

廣義線性模型 - Andrew Ng機器學習公開課筆記1.6

廣義線性模型的理解

分類和邏輯回歸(Classification and logistic regression)，廣義線性模型(Generalized Linear Models) ，生成學習算法(Generative Learning algorithms)

R語言學習筆記（十一）：廣義線性模型

R語言-廣義線性模型

線性迴歸_邏輯迴歸_廣義線性模型_斯坦福CS229_學習筆記

深度學習基礎--loss與啟用函式--廣義線性模型與各種各樣的啟用函式(配圖)

廣義線性模型（Generalized Linear Models）

廣義線性模型與指數分佈族的理解

python 機器學習 sklearn 廣義線性模型

機器學習cs229——（三）區域性加權迴歸、邏輯迴歸、感知器、牛頓方法、廣義線性模型

ML—廣義線性模型導論

廣義線性模型定價模組（PYTHON3.5+)

線性模型選擇與廣義線性模型

牛頓方法，指數分佈族，廣義線性模型

廣義線性模型2

相關推薦