yolov3 kmeans 計算anchor boxes

阿新 • • 發佈：2019-05-27

yolov3 kmeans

yolov3在做boundingbox預測的時候,用到了anchor boxes.
.cfg檔案內的配置如下:

[yolo]
mask = 3,4,5
anchors = 10,14,  23,27,  37,58,  81,82,  135,169,  344,319

在用我們自己的資料做訓練的時候,要先修改anchors,匹配我們自己的資料.anchors大小通過聚類得到.

通俗地說,聚類就是把捱得近的資料點劃分到一起.
kmeans演算法的思想很簡單

隨便指定k個cluster
把點劃分到與之最近的一個cluster
上面得到的cluster肯定是不好的,因為一開始的cluster是亂選的嘛

更新每個cluster為當前cluster的點的均值.(這時候cluster肯定變準了,為什麼呢?比如當前這個cluster裡有3個點,2個點靠的很近,還有1個點離得稍微遠點,那取均值的話,那相當於靠的很近的2個點有更多投票權,新算出來的cluster的中心會更加靠近這兩個點.你要是非要擡槓:那萬一一開始我隨機指定的cluster中心點就特別準呢,重新取均值反而把中心點弄的不準了?事實上這是kmeans的一個缺陷:比較依賴初始的k個cluster的位置.選擇不恰當的k值可能會導致糟糕的聚類結果。這也是為什麼要進行特徵檢查來決定資料集的聚類數目了。)
重新執行上述過程
- 把點劃分到與之最近的一個cluster
- 更新每個cluster為當前cluster的點的均值
不斷重複上述過程,直至cluster中心變化很小


Created on Feb 20, 2017

@author: jumabek
'''
from os import listdir
from os.path import isfile, join
import argparse
#import cv2
import numpy as np
import sys
import os
import shutil
import random 
import math

width_in_cfg_file = 416.
height_in_cfg_file = 416.

def IOU(x,centroids):
    similarities = []
    k = len(centroids)
    for centroid in centroids:
        c_w,c_h = centroid
        w,h = x
        if c_w>=w and c_h>=h:
            similarity = w*h/(c_w*c_h)
        elif c_w>=w and c_h<=h:
            similarity = w*c_h/(w*h + (c_w-w)*c_h)
        elif c_w<=w and c_h>=h:
            similarity = c_w*h/(w*h + c_w*(c_h-h))
        else: #means both w,h are bigger than c_w and c_h respectively
            similarity = (c_w*c_h)/(w*h)
        similarities.append(similarity) # will become (k,) shape
    return np.array(similarities) 

def avg_IOU(X,centroids):
    n,d = X.shape
    sum = 0.
    for i in range(X.shape[0]):
        #note IOU() will return array which contains IoU for each centroid and X[i] // slightly ineffective, but I am too lazy
        sum+= max(IOU(X[i],centroids)) 
    return sum/n

def write_anchors_to_file(centroids,X,anchor_file):
    f = open(anchor_file,'w')
    
    anchors = centroids.copy()
    print(anchors.shape)

    for i in range(anchors.shape[0]):
        anchors[i][0]*=width_in_cfg_file/32.
        anchors[i][1]*=height_in_cfg_file/32.
         

    widths = anchors[:,0]
    sorted_indices = np.argsort(widths)

    print('Anchors = ', anchors[sorted_indices])
        
    for i in sorted_indices[:-1]:
        f.write('%0.2f,%0.2f, '%(anchors[i,0],anchors[i,1]))

    #there should not be comma after last anchor, that's why
    f.write('%0.2f,%0.2f\n'%(anchors[sorted_indices[-1:],0],anchors[sorted_indices[-1:],1]))
    
    f.write('%f\n'%(avg_IOU(X,centroids)))
    print()

def kmeans(X,centroids,eps,anchor_file):
    
    N = X.shape[0]
    iterations = 0
    k,dim = centroids.shape
    prev_assignments = np.ones(N)*(-1)    
    iter = 0
    old_D = np.zeros((N,k))

    while True:
        D = [] 
        iter+=1           
        for i in range(N):
            d = 1 - IOU(X[i],centroids)
            D.append(d)
        D = np.array(D) # D.shape = (N,k)
        
        print("iter {}: dists = {}".format(iter,np.sum(np.abs(old_D-D))))
            
        #assign samples to centroids 
        assignments = np.argmin(D,axis=1)
        
        if (assignments == prev_assignments).all() :
            print("Centroids = ",centroids)
            write_anchors_to_file(centroids,X,anchor_file)
            return

        #calculate new centroids
        centroid_sums=np.zeros((k,dim),np.float)
        for i in range(N):
            centroid_sums[assignments[i]]+=X[i]        
        for j in range(k):            
            centroids[j] = centroid_sums[j]/(np.sum(assignments==j))
        
        prev_assignments = assignments.copy()     
        old_D = D.copy()  

def main(argv):
    parser = argparse.ArgumentParser()
    parser.add_argument('-filelist', default = '\\path\\to\\voc\\filelist\\train.txt', 
                        help='path to filelist\n' )
    parser.add_argument('-output_dir', default = 'generated_anchors/anchors', type = str, 
                        help='Output anchor directory\n' )  
    parser.add_argument('-num_clusters', default = 0, type = int, 
                        help='number of clusters\n' )  

   
    args = parser.parse_args()
    
    if not os.path.exists(args.output_dir):
        os.mkdir(args.output_dir)

    f = open(args.filelist)
  
    lines = [line.rstrip('\n') for line in f.readlines()]
    
    annotation_dims = []

    size = np.zeros((1,1,3))
    for line in lines:
                    
        #line = line.replace('images','labels')
        #line = line.replace('img1','labels')
        line = line.replace('JPEGImages','labels')        
        

        line = line.replace('.jpg','.txt')
        line = line.replace('.png','.txt')
        print(line)
        f2 = open(line)
        for line in f2.readlines():
            line = line.rstrip('\n')
            w,h = line.split(' ')[3:]            
            #print(w,h)
            annotation_dims.append(tuple(map(float,(w,h))))
    annotation_dims = np.array(annotation_dims)
  
    eps = 0.005
    
    if args.num_clusters == 0:
        for num_clusters in range(1,11): #we make 1 through 10 clusters 
            anchor_file = join( args.output_dir,'anchors%d.txt'%(num_clusters))

            indices = [ random.randrange(annotation_dims.shape[0]) for i in range(num_clusters)]
            centroids = annotation_dims[indices]
            kmeans(annotation_dims,centroids,eps,anchor_file)
            print('centroids.shape', centroids.shape)
    else:
        anchor_file = join( args.output_dir,'anchors%d.txt'%(args.num_clusters))
        indices = [ random.randrange(annotation_dims.shape[0]) for i in range(args.num_clusters)]
        centroids = annotation_dims[indices]
        kmeans(annotation_dims,centroids,eps,anchor_file)
        print('centroids.shape', centroids.shape)

if __name__=="__main__":
    main(sys.argv)

用法:python3 gen_anchors.py -filelist ./park_train.txt park_train.txt描述了訓練圖片路

yolov3 kmeans 計算anchor boxes

yolov3 kmeans yolov3在做boundingbox預測的時候,用到了anchor boxes. .cfg檔案內的配置如下: [yolo] mask = 3,4,5 anchors = 10,14, 23,27, 37,58, 81,82, 135,169, 344,319 在用我們自

深度學習之---yolo,kmeans計算anchor框原始碼解讀

k-means原理 K-means演算法是很典型的基於距離的聚類演算法，採用距離作為相似性的評價指標，即認為兩個物件的距離越近，其相似度就越大。該演算法認為簇是由距離靠近的物件組成的，因此把得到緊湊且獨立的簇作為最終目標。問題 K-Means演算法主要解決的問題如下圖所示。我們可以看到

【YOLOV2】K-means 計算 anchor boxes

目錄 k-means原理問題演算法概要 k-means演算法缺點 k-means++演算法 k-means 計算 anchor boxes 距離公式程式碼實現參考 k-means原理 K-means演算法是很典型的基於距離的聚類演算法，採用距離

K-means 計算 anchor boxes

k-means原理 K-means演算法是很典型的基於距離的聚類演算法，採用距離作為相似性的評價指標，即認為兩個物件的距離越近，其相似度就越大。該演算法認為簇是由距離靠近的物件組成的，因此把得到緊湊且獨立的簇作為最終目標。問題 K-Means演算法主

深度學習系列之YOLOv3 mAP計算

今天跑了以下YOLOv2，發現竟然沒有計算mAP的，這樣怎麼和其他模型對比呢？於是乎，百度了一下，折騰了好幾下，我發現，有些人不知道是不是故意的，竟給別人出餿主意。方法繞來繞去(雖然也能計算，但讓人覺得不知所云)。我竟無言以對。幸好，有一篇比較良心：我的

YOLOv2通過k-means來獲取anchor boxes

K-means原理 K-means演算法是很經典的基於距離的聚類演算法，採用距離作為相似性的評價指標，即認為兩個物件的距離越近，其相似度越高。該演算法認為簇是由距離靠近的物件組成的，因此把得到緊湊而獨立的簇作為最終目標。 K-means主要解決問題 K-

影象相似度計算-kmeans聚類

關於影象相似度,主要包括顏色,亮度,紋理等的相似度,比較直觀的相似度匹配是直方圖匹配.直方圖匹配演算法簡單,但受亮度,噪聲等影響較大.另一種方法是提取影象特徵,基於特徵進行相似度計算,常見的有提取影象的sift特徵,再計算兩幅影象的sift特徵相似度.對於不同的影象型別,也可以採用不同的

pytorch yolov3 yolo層的構建矩陣運算思維啟蒙損失函式要求公示裡面的乘以相應的anchor

上一篇：pytorch yolov3 構建class Darknet 腦海中過一遍其實上一篇講到的，構建route和shortcut層，基本是簡單的層之間的疊加操作，但是yolo層要相對複雜些。寫部落格的過程中意識到了，作者如何將功能分塊實現。你比如： 1. 轉換輸入

基於PySpark的網路服務異常檢測系統 (四) Mysql與SparkSQL對接同步資料 kmeans演算法計算預測異常

def get_current_timestamp(): 2 """ 3 獲取當前時間戳 4 :return: 5 """ 6 return int(time.time()) * 1000 7 8 9 def convert_datetime_to_

yolov3學習筆記（一）yolov3的配置實現與mAP的計算

本篇為yolov3原始碼配置實現部分及使用官方模型進行批量測試目錄原始碼配置下載原始碼並編譯 git clone https://github.com/pjreddie/darknet cd darknet make 下

YOLOv2和YOLOv3的anchor大小有什麼區別？

在YOLOv2中，作者用最後一層feature map的相對大小來定義anchor大小。也就是說，在YOLOv2中，輸入影象大小為416*416，下采樣32倍得到最後一層feature map大小為13X13，相對的anchor大小範圍就在（0x0，13x13]

yolov3的anchor機制與損失函式詳解

一、yolov3的anchor機制網路實際的預測值為tx、ty、tw、th,根據上圖中的四個公式計算得到預測框的中心點座標和寬高bx、 by、 bw、 bh。其中，cx、 cy為當前grid相對於左上角grid偏移的grid數量。 σ()函式為logist

關於YOLOv3對VOC型別資料集的mAP計算與PR曲線的繪製 windows和linux均適用

前言本文所做的工作均建立在已經已經用darknet訓練好自己的模型的基礎上的，不提供與YOLO訓練有關的東西（因為別人已經發夠多了）。儘量寫得傻瓜一些，保持一步一次截圖，因為能看這種部落格的基本都沒啥程式設計師基礎的，連指令碼都沒聽說的菜鳥，只是單純跑跑發個水文的，所以多

雲端計算期末報告無圖 kmeans和最短路徑演算法hadoop實現詳解

《雲端計算應用開發實驗》大作業報告一．實驗環境與實驗工具 ubuntu 16.04真機 + hadoop2.6 + 本地偽分佈　二．實驗原理以下內容為科普性內容，不過裡面還是有一些關鍵的解釋在配環境的時候用得上 Hadoop是一個

YOLOv2 YOLOv3 如何選擇先驗框（priors anchor）（自用）

在YOLOv2論文中，作者有對Dimension Cluster做一個介紹，這個cluster的目的就是尋找出anchor的先驗（簡稱為先驗框）。什麼是先驗框呢，簡單來說，在YOLOv1中，作者遇到了一個問題，雖然我們通過實驗知道要選兩個boxes是最優的，但是如何這兩個bo

聚類kmeans演算法在yolov3中的應用

yolov3 kmeans yolov3在做boundingbox預測的時候,用到了anchor boxes.這個anchors的含義即最有可能的object的width,height.事先通過聚類得到.比如某一個畫素單元,我想對這個畫素單元預測出一個object,圍繞這個畫素單元,可以預測出無數種objec

雲計算到底是個什麽？

雲計算在沒有雲計算沒有GPS的時代。每到陌生的地方總要準備一個當地的地圖。時常會遇到拿著地圖向當地人問路的情況。而現在我們只需要一部手機，就可以擁有一張全新的詳細的當地地圖。還能直觀的了解天氣情況，交通情況等信息。復雜的路況信息，周邊的美食、景點、酒店、休閑娛樂、加油站、公交站….等等的一

mysql 計算生日

代碼 date() log 根據 sql font blog ediff ont 生日（DATE）計算方法1： YEAR(CURDATE())-YEAR(birthday)-(RIGHT(CURDATE(),5)<RIGHT(birthday,5)) 計算方法2

計算程序的內存和占比

程序 odin main pre == ret 內存占用 put 列表 1 #!/usr/bin/env python 2 # _*_ coding:UTF-8 _*_ 3 # 收集程序所占用的物理內存大小，占所有物理內存的比例 4 # OS: Centos 6.

HDOJ2438:Turn the corner(計算幾何 + 三分)

scan can 計算 closed 4.5 pri cross cli idt Problem Description Mr. West bought a new car! So he is travelling around the city.One day he c

yolov3 kmeans 計算anchor boxes

yolov3 kmeans

相關推薦