支援向量機SVM演算法應用【Python實現】

阿新 • • 發佈：2019-02-03

from __future__ import print_function

from time import time
import logging
import matplotlib.pyplot as plt

from sklearn.cross_validation import train_test_split
from sklearn.datasets import fetch_lfw_people
from sklearn.grid_search import GridSearchCV
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix
from sklearn.decomposition import RandomizedPCA
from sklearn.svm import SVC


print(__doc__)

# Display progress logs on stdout
logging.basicConfig(level=logging.INFO, format='%(asctime)s %(message)s')


###############################################################################
# Download the data, if not already on disk and load it as numpy arrays

lfw_people = fetch_lfw_people(min_faces_per_person=70, resize=0.4)

# introspect the images arrays to find the shapes (for plotting)
n_samples, h, w = lfw_people.images.shape

# for machine learning we use the 2 data directly (as relative pixel
# positions info is ignored by this model)
X = lfw_people.data
n_features = X.shape[1]

# the label to predict is the id of the person
y = lfw_people.target
target_names = lfw_people.target_names
n_classes = target_names.shape[0]

print("Total dataset size:")
print("n_samples: %d" % n_samples)
print("n_features: %d" % n_features)
print("n_classes: %d" % n_classes)


###############################################################################
# Split into a training set and a test set using a stratified k fold

# split into a training and testing set
X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.25)


###############################################################################
# Compute a PCA (eigenfaces) on the face dataset (treated as unlabeled
# dataset): unsupervised feature extraction / dimensionality reduction
n_components = 150

print("Extracting the top %d eigenfaces from %d faces"
      % (n_components, X_train.shape[0]))
t0 = time()
pca = RandomizedPCA(n_components=n_components, whiten=True).fit(X_train)
print("done in %0.3fs" % (time() - t0))

eigenfaces = pca.components_.reshape((n_components, h, w))

print("Projecting the input data on the eigenfaces orthonormal basis")
t0 = time()
X_train_pca = pca.transform(X_train)
X_test_pca = pca.transform(X_test)
print("done in %0.3fs" % (time() - t0))


###############################################################################
# Train a SVM classification model

print("Fitting the classifier to the training set")
t0 = time()
param_grid = {'C': [1e3, 5e3, 1e4, 5e4, 1e5],
              'gamma': [0.0001, 0.0005, 0.001, 0.005, 0.01, 0.1], }
clf = GridSearchCV(SVC(kernel='rbf', class_weight='auto'), param_grid)
clf = clf.fit(X_train_pca, y_train)
print("done in %0.3fs" % (time() - t0))
print("Best estimator found by grid search:")
print(clf.best_estimator_)


###############################################################################
# Quantitative evaluation of the model quality on the test set

print("Predicting people's names on the test set")
t0 = time()
y_pred = clf.predict(X_test_pca)
print("done in %0.3fs" % (time() - t0))

print(classification_report(y_test, y_pred, target_names=target_names))
print(confusion_matrix(y_test, y_pred, labels=range(n_classes)))


###############################################################################
# Qualitative evaluation of the predictions using matplotlib

def plot_gallery(images, titles, h, w, n_row=3, n_col=4):
    """Helper function to plot a gallery of portraits"""
    plt.figure(figsize=(1.8 * n_col, 2.4 * n_row))
    plt.subplots_adjust(bottom=0, left=.01, right=.99, top=.90, hspace=.35)
    for i in range(n_row * n_col):
        plt.subplot(n_row, n_col, i + 1)
        plt.imshow(images[i].reshape((h, w)), cmap=plt.cm.gray)
        plt.title(titles[i], size=12)
        plt.xticks(())
        plt.yticks(())


# plot the result of the prediction on a portion of the test set

def title(y_pred, y_test, target_names, i):
    pred_name = target_names[y_pred[i]].rsplit(' ', 1)[-1]
    true_name = target_names[y_test[i]].rsplit(' ', 1)[-1]
    return 'predicted: %s\ntrue:      %s' % (pred_name, true_name)

prediction_titles = [title(y_pred, y_test, target_names, i)
                     for i in range(y_pred.shape[0])]

plot_gallery(X_test, prediction_titles, h, w)

# plot the gallery of the most significative eigenfaces

eigenface_titles = ["eigenface %d" % i for i in range(eigenfaces.shape[0])]
plot_gallery(eigenfaces, eigenface_titles, h, w)

plt.show()

執行效果圖：

支援向量機SVM演算法應用【Python實現】

from __future__ import print_function from time import time import logging import matplotlib.pyplot as plt from sklearn.cross_validation import train_te

支援向量機(SVM)演算法應用——人臉識別

環境簡述:python3.6.4 根據python2.7版本程式碼進行勘誤 Class RandomizedPCA is deprecated; RandomizedPCA was deprecated in 0.18 and will be

一步步教你輕鬆學支援向量機SVM演算法之理論篇1

摘要：支援向量機即SVM(Support Vector Machine) ，是一種監督學習演算法，屬於分類的範疇。首先，支援向量機不是一種機器，而是一種機器學習演算法。在資料探勘的應用中，與無監督學習的聚類相對應和區別。廣泛應用於機器學習，計算機視覺和資料探勘當中。（本文原創，轉載必須註明出處.）

一步步教你輕鬆學支援向量機SVM演算法之案例篇2

支援向量機SVM：使用sklearn+python

程式碼這個例子主要是演示3種不同的核函式（線性核，高斯核和多項式核）的用法。使用的資料是自動生成的，生成資料的介面是make_blobs。 from sklearn import svm from sklearn.datasets import

支援向量機SVM通俗理解（python程式碼實現）

這是第三次來“複習”SVM了，第一次是使用SVM包，呼叫包並嘗試調節引數。聽聞了“流弊”SVM的演算法。第二次學習理論，看了李航的《統計學習方法》以及網上的部落格。看完後感覺，滿滿的公式。。。記不住啊。第三次，也就是這次通過python程式碼手動來實現SVM，才

帶你搞懂支援向量機SVM演算法原理

一、原理 1. 線性可分支援向量機問題的輸入輸出 X = {x1,x2,...,xnx1,x2,...,xn} Y = {+1, -1} 模型：感知機的目的是找到一個可以正確分類資料的超平面S：ω⋅x+b=0ω⋅x+b=0, 得到感知機

支援向量機(SVM)演算法

1. 背景： 1.1 最早是由 Vladimir N. Vapnik 和 Alexey Ya. Chervonenkis 在1963年提出 1.2 目前的版本(soft margin)是由Corinna Cortes 和 Vapnik在1993年提出，並在

《機器學習》周志華學習筆記第六章支援向量機（課後習題）python 實現

一、 1.間隔與支援向量 2.對偶問題 3.核函式 xi與xj在特徵空間的內積等於他們在原始yangben空間中通過函式k(.,.)計算的結果。核矩陣K總是半正定的。 4.軟間隔與正則化軟間隔允許某些samples不滿足約束鬆弛變數 5.支援

【支援向量機SVM】演算法原理公式推導 python程式設計實現

1.前言如圖，對於一個給定的資料集，通過直線A或直線B（多維座標系中為平面A或平面B）可以較好的將紅點與藍點分類。那麼線A與線B那個更優呢？在SVM演算法中，我們認為線A是優於線B的。因為A的‘分類間隔’大於B。

【機器學習】支援向量機SVM及例項應用

【機器學習】支援向量機1.分類超平面與最大間隔2.對偶問題與拉格朗日乘子法3.核函式4.軟間隔與正則化準備：資料集匯入SVM模組步驟：1.讀取資料集 2.劃分訓練樣本與測試樣本 3.訓練SVM

【Python-ML】SKlearn庫支援向量機(SVM) 使用

# -*- coding: utf-8 -*- ''' Created on 2018年1月15日 @author: Jason.F @summary: Scikit-Learn庫支援向量機分類演算法 ''' from sklearn import datasets im

演算法學習——支援向量機SVM

SVM現在的公式推導很多，都是現成的，而且寫的也很好，我會提供相關資源，這篇博文主要從思想理解的方面做一個簡單介紹。 1、SVM 是如何工作的？支援向量機的基礎概念可以通過一個簡單的例子來解釋。讓我們想象兩個類別：紅色和藍色，我們的資料有兩個特徵：x 和 y。我們想要一個分類器，給定一

Python實現支援向量機(SVM) MNIST資料集

Python實現支援向量機(SVM) MNIST資料集 SVM的原理這裡不講，大家自己可以查閱相關資料。下面是利用sklearn庫進行svm訓練MNIST資料集，準確率可以達到90%以上。 from sklearn import svm import numpy as np

Python中的支援向量機SVM的使用（有例項）

轉載自https://www.cnblogs.com/luyaoblog/p/6775342.html。謝謝作者整理，若侵權告知即刪。除了在Matlab中使用PRTools工具箱中的svm演算法，Python中一樣可以使用支援向量機做分類。因為Python中的sklearn庫也集成了SVM演算

機器學習演算法——支援向量機svm，實現過程

初學使用python語言來實現支援向量機演算法對資料進行處理的全過程。 from sklearn.datasets import load_iris #匯入資料集模組 from sklearn.model_selection import train_test_spli

支援向量機(SVM)理解以及在sklearn庫中的簡單應用

1. 什麼是支援向量機英文Support Vector Machines，簡寫SVM . 主要是基於支援向量來命名的，什麼是支援向量後面會講到…….最簡單的SVM是用來二分類的，在深度學習崛起之前被譽為最好的現成分類器，”現成”指的是資料處理好，SVM可

資料探勘十大演算法——支援向量機SVM（一）：線性支援向量機

首先感謝“劉建平pinard”的淵博知識以及文中詳細準確的推導！！！本文轉自“劉建平pinard”，原網址為：http://www.cnblogs.com/pinard/p/6097604.html。支援向量機原理SVM系列文章共分為5部分：（一）線性支援向量機

SVM支援向量機分類模型SVC理論+python sklean.svm實踐

支援向量機是啥有一次公司專案上的同事一起吃飯（面前是一鍋炒土雞），提到了支援向量機，學文的同事就問支援向量機是什麼，另一個數學物理大牛想了一下，然後說，一種雞。。。確實很難一句話解釋清楚這隻雞。。。support vector machine從字面意思來

Python中使用支援向量機(SVM)實踐

在機器學習領域，支援向量機SVM(Support Vector Machine)是一個有監督的學習模型，通常用來進行模式識別、分類(異常值檢測)以及迴歸分析。其具有以下特徵： (1)SVM可以表示為凸優化問題，因此可以利用已知的有效演算法發現目標函式的

支援向量機SVM演算法應用【Python實現】

相關推薦