無監督學習——K-means演算法

阿新 • • 發佈：2019-01-07

筆記：

這裡寫圖片描述

核心步驟：

這裡寫圖片描述

那我們就實現這兩個函式就行啦：

findClosestCentroids.m（把每個點染色）：

function idx = findClosestCentroids(X, centroids)
%FINDCLOSESTCENTROIDS computes the centroid memberships for every example
%   idx = FINDCLOSESTCENTROIDS (X, centroids) returns the closest centroids
%   in idx for a dataset X where each row is a single example. idx = m x 1  

%   vector of centroid assignments (i.e. each entry in range [1..K])
%

% Set K
K = size(centroids, 1);

% You need to return the following variables correctly.
idx = zeros(size(X,1), 1);

% ====================== YOUR CODE HERE ======================
% Instructions: Go over every example, find its closest centroid, and store 

%               the index inside idx at the appropriate location.
%               Concretely, idx(i) should contain the index of the centroid
%               closest to example i. Hence, it should be a value in the 
%               range 1..K
%
% Note: You can use a for-loop over the examples to compute this. 

%

m = size(X,1);
dis = zeros(m,K);       %(m,k)位置表示第m個樣本和第K個聚類中心的距離的平方
for i=1:m
    for j=1:K
        dis(i,j) = X(i,:)*X(i,:)' + centroids(j,:)*centroids(j,:)' - ...
            X(i,:)*centroids(j,:)'*2;
    end
end
[~, idx] = min(dis,[],2);       %尋找每一行中最小的元素索引

% =============================================================

end

computeCentroids.m（更新聚類中心）：

function centroids = computeCentroids(X, idx, K)
%COMPUTECENTROIDS returns the new centroids by computing the means of the 
%data points assigned to each centroid.
%   centroids = COMPUTECENTROIDS(X, idx, K) returns the new centroids by 
%   computing the means of the data points assigned to each centroid. It is
%   given a dataset X where each row is a single data point, a vector
%   idx of centroid assignments (i.e. each entry in range [1..K]) for each
%   example, and K, the number of centroids. You should return a matrix
%   centroids, where each row of centroids is the mean of the data points
%   assigned to it.
%

% Useful variables
[m n] = size(X);

% You need to return the following variables correctly.
centroids = zeros(K, n);

% ====================== YOUR CODE HERE ======================
% Instructions: Go over every centroid and compute mean of all points that
%               belong to it. Concretely, the row vector centroids(i, :)
%               should contain the mean of the data points assigned to
%               centroid i.
%
% Note: You can use a for-loop over the centroids to compute this.
%

for i=1:K
    index = find(idx == i);
    centroids(i,:) = mean(X(index,:));
end

% =============================================================

end

看看聚類中心是怎麼變化的吧~

這裡寫圖片描述

剩下的基本不怎麼變啦~

還有一點需要注意：聚類中心的隨機初始化：

Code（kMeansInitCentroids.m）：

function centroids = kMeansInitCentroids(X, K)
%KMEANSINITCENTROIDS This function initializes K centroids that are to be 
%used in K-Means on the dataset X
%   centroids = KMEANSINITCENTROIDS(X, K) returns K initial centroids to be
%   used with the K-Means on the dataset X
%

% You should return this values correctly
centroids = zeros(K, size(X, 2));

% ====================== YOUR CODE HERE ======================
% Instructions: You should set centroids to randomly chosen examples from
%               the dataset X
%

% Randomly reorder the indices of examples
randidx = randperm(size(X, 1));
% Take the first K examples as centroids
centroids = X(randidx(1:K), :);

% =============================================================

end

另外最後還給了個例子，是關於影象顏色壓縮的，也是用的K-means演算法，並不是很難，自己看看了解一下就好~

吳恩達機器學習 - 無監督學習——K-means演算法吳恩達機器學習 - 無監督學習——K-means演算法

原吳恩達機器學習 - 無監督學習——K-means演算法 2018年06月25日 12:02:37 離殤灬孤狼閱讀數：181

機器學習實踐（十七）—sklearn之無監督學習-K-means演算法

一、無監督學習概述什麼是無監督學習之所以稱為無監督，是因為模型學習是從無標籤的資料開始學習的。無監督學習包含演算法聚類 K-means(K均值聚類) 降維

無監督學習——K-means演算法

筆記：核心步驟：那我們就實現這兩個函式就行啦： findClosestCentroids.m（把每個點染色）： function idx = fi

【ML演算法】無監督學習——K-means聚類

前言這一系列文章將介紹各種機器學習演算法，部分演算法涉及公示推導，我的部落格中有另一個板塊介紹基於python和R實現各種機器學習演算法，詳情見置頂的目錄。 K-means演算法聚類演算法是一種無監督的機器學習演算法，通過距離測度實現樣本點的歸類，

無監督學習k-means簡單實現

%隨機獲取150個點 %X = [randn(50,2)+ones(50,2);randn(50,2)-ones(50,2);randn(50,2)+[ones(50,1),-ones(50,1)]]; X = load('test.txt') %二維高斯擬合函式 o

非監督學習—K-means演算法聚類學習筆記

非監督學習：無類別標記的一、 K-means 演算法： 1. Clustering 中的經典演算法，資料探勘十大經典演算法之一 2. 引數k 已知引數 k ；然後將事先輸入的n個數據物件劃分為 k個聚類以便使得所獲得的聚類滿足：同一聚類中的物件相似度較高；而不同聚

機器學習--K-means演算法（聚類，無監督學習）

一、基本思想聚類屬於無監督學習，以往的迴歸、樸素貝葉斯、SVM等都是有類別標籤y的，也就是說樣例中已經給出了樣例的分類。而聚類的樣本中卻沒有給定y，只有特徵x，比如假設宇宙中的星星可以表示成三維空間中的點集。聚類的目的是找到每個樣本x潛在的類別y，並將同類別y的樣本x

無監督學習——K-均值聚類算法對未標註數據分組

機器學習算法可能變化分類結果 sts lis mat 得到無監督學習和監督學習不同的是，在無監督學習中數據並沒有標簽（分類）。無監督學習需要通過算法找到這些數據內在的規律，將他們分類。（如下圖中的數據，並沒有標簽，大概可以看出數據集可以分為三類，

機器學習——K-means演算法（聚類演算法）

聚類在說K-means聚類演算法之前必須要先理解聚類和分類的區別。分類其實是從特定的資料中挖掘模式，作出判斷的過程。比如Gmail郵箱裡有垃圾郵件分類器，一開始的時候可能什麼都不過濾，在日常使用過程中，我人工對於每一封郵件點選“垃圾”或“不是垃圾”，過一段時間，Gmail就體現出

機器學習--K-means演算法

概述聚類（K-mean）是一種典型的無監督學習。採用距離作為相似性的評價指標，即認為兩個物件的距離越近，其相似度就越大。該演算法認為類簇是由距離靠近的物件組成的，因此把得到緊湊且獨立的簇作為最終目標。核心思想通過迭代尋找k個類簇的一種劃分方案，使得用這k個類簇的均值來代

機器學習——K-Means演算法

Unsupervised Learning task learning a distribution from sample(GMM/VAE) clustering(PAC) feature learning 按照演算法目的，無監督演算法大體可分為上述三類，

機器學習【三】無監督學習-聚類演算法-Kmeans

1.K-meansK-means，屬於無監督學習。即輸入資料沒有標籤y，經過一些演算法後，找到標籤y。聚類的目的就是找到每個樣本潛在的標籤y，並將同類別的樣本放到一起。k-means聚類：就是把n個點（可以是樣本的一次觀察或一個例項）劃分到k個聚類中，使得每個點都屬於離他最近

機器學習非監督學習—k-means及案例分析

一、非監督學習無監督學習，顧名思義，就是不受監督的學習，一種自由的學習方式。該學習方式不需要先驗知識進行指導，而是不斷地自我認知，自我鞏固，最後進行自我歸納，在機器學習中，無監督學習可以被簡單理解為不為訓練

Andrew Ng機器學習課程筆記（十三）之無監督學習之EM演算法

Preface Jensen’s Inequality（Jensen不等式） Expectation-Maximization Algorithm（EM演算法） Jensen’s Inequality 對於凸函式令f(x)f(x)為

機器學習-K-Means演算法（附原始碼）

定義俗話說“物以類聚”，其實從廣義上說，聚類就是將資料集中在某些方面相似的資料成員放在一起。一個聚類就是一些資料例項的集合，其中處於相同聚類中的資料元素彼此相似，但是處於不同聚類中的元素彼此不同。由於在聚類中那些表示資料類別的分類或分組資訊是沒有的，即這些資料是沒

python 機器學習K-means演算法實現

\編譯器:pycharm 1.匯入K-means相關包這個包匯入有點坑,有許多依賴包需要匯入,推薦下載Anaconda後,在pycharm匯入Anaconda中的python,在下載sklearn包,就可以開心的敲程式碼了~! 2正式開始: from

吳恩達機器學習（十一）K-means（無監督學習、聚類演算法）

目錄 0. 前言學習完吳恩達老師機器學習課程的無監督學習，簡單的做個筆記。文中部分描述屬於個人消化後的理解，僅供參考。如果這篇文章對你有一點小小的幫助，請給個關注喔~我會非常開心

【無監督學習】1：K-means聚類演算法原理

前言：粗略研究完神經網路基礎——BP、CNN、RNN、LSTM網路後自己算是鬆懈了很多，好長的時間都沒有堅持再更新部落格了。“腐敗”生活了這麼久，還是要找到自己一點樂趣吧，於是想了一想，決定把《機器學習》的演算法研究過得都重新梳理一遍，於是就從無監督學習——聚類

無監督學習——聚類（k-means演算法）

無監督學習是一種對不含標記的資料建立模型的機器學習正規化。無監督學習應用領域： - 資料探勘 - 醫學影像 - 股票市場分析 - 計算機視覺

無監督學習-聚類 K-means聚類演算法

#無監督學習-聚類 K-means聚類演算法 #以k為引數，把n個物件分為k個簇，使簇內具有較高相似度，簇間相似度較低 #1.隨機選擇k個點作為初始聚類中心；2.根據剩下點與聚類中心的距離(預設就是歐氏距離)，歸為最近的簇； #3.對每個簇，計算所有點的均值作為新聚類中心；4.重複2、3直至

無監督學習——K-means演算法

筆記：

核心步驟：

那我們就實現這兩個函式就行啦：

findClosestCentroids.m（把每個點染色）：

computeCentroids.m（更新聚類中心）：

看看聚類中心是怎麼變化的吧~

還有一點需要注意：聚類中心的隨機初始化：

Code（kMeansInitCentroids.m）：

相關推薦