python3 Dense SIFT演算法的應用與實現

阿新 • • 發佈：2018-12-15

python3 Dense SIFT演算法的應用與實現

Dense SIFT 因為課題的需求，傳統的SIFT演算法即Sparse SIFT，不能很好地表徵不同類之間的特徵差異，達不到所需的分類要求。而Dense SIFT演算法，是一種對輸入影象進行分塊處理，再進行SIFT運算的特徵提取過程。Dense SIFT根據可調的引數大小，來適當滿足不同分類任務下對影象的特徵表徵能力。而Sparse SIFT則是對整幅影象的處理，得到一系列特徵點(keypoints)。如下圖所示：在此背景下，通常來講Dense SIFT更適用於影象分類識別的任務，而Sparse SIFT更適用於影象檢索分割的任務。
Dense SIFT的程式碼實現

原始碼為python2，現用python3重新編譯：

import numpy as np
from scipy import signal
from matplotlib import pyplot
import matplotlib

# sift features
Nangles = 8   
Nbins = 4    
Nsamples = Nbins**2   
alpha = 9.0
angles = np.array(range(Nangles))*2.0*np.pi/Nangles

def gen_dgauss(sigma):
    '''
    generating a derivative of Gauss filter on both the X and Y
    direction.//在X和Y方向上生成高斯濾波器的導數。
    '''
    fwid = np.int(2*np.ceil(sigma))
    G = np.array(range(-fwid,fwid+1))**2
    G = G.reshape((G.size,1)) + G
    G = np.exp(- G / 2.0 / sigma / sigma)
    G /= np.sum(G)
    GH,GW = np.gradient(G)
    GH *= 2.0/np.sum(np.abs(GH))
    GW *= 2.0/np.sum(np.abs(GW))
    return GH,GW

class DsiftExtractor:
    '''
    The class that does dense sift feature extractor.//進行密集篩選的類提取器
    Sample Usage:
        extractor = DsiftExtractor(gridSpacing,patchSize,[optional params])
        feaArr,positions = extractor.process_image(Image)
    '''
    def __init__(self, gridSpacing, patchSize,
                 nrml_thres = 1.0,\
                 sigma_edge = 0.8,\
                 sift_thres = 0.2):
        '''
        gridSpacing: the spacing for sampling dense descriptors//密集描述符的取樣間隔
        patchSize: the size for each sift patch//每個sift patch的尺寸
        nrml_thres: low contrast normalization threshold//低對比度歸一化閾值
        sigma_edge: the standard deviation for the gaussian smoothing//高斯平滑的標準差
            before computing the gradient
        sift_thres: sift thresholding (0.2 works well based on
            Lowe's SIFT paper)//sift閾值化(0.2基於Lowe's sift paper效果很好)
        '''
        self.gS = gridSpacing
        self.pS = patchSize
        self.nrml_thres = nrml_thres
        self.sigma = sigma_edge
        self.sift_thres = sift_thres
        # compute the weight contribution map
        sample_res = self.pS / np.double(Nbins)
        sample_p = np.array(range(self.pS))
        sample_ph, sample_pw = np.meshgrid(sample_p,sample_p)
        sample_ph.resize(sample_ph.size)
        sample_pw.resize(sample_pw.size)
        bincenter = np.array(range(1,Nbins*2,2)) / 2.0 / Nbins * self.pS - 0.5 
        bincenter_h, bincenter_w = np.meshgrid(bincenter,bincenter)
        bincenter_h.resize((bincenter_h.size,1))
        bincenter_w.resize((bincenter_w.size,1))
        dist_ph = abs(sample_ph - bincenter_h)
        dist_pw = abs(sample_pw - bincenter_w)
        weights_h = dist_ph / sample_res
        weights_w = dist_pw / sample_res
        weights_h = (1-weights_h) * (weights_h <= 1)
        weights_w = (1-weights_w) * (weights_w <= 1)
        # weights is the contribution of each pixel to the corresponding bin center
        self.weights = weights_h * weights_w
        #pyplot.imshow(self.weights)
        #pyplot.show()
        
    def process_image(self, image, positionNormalize = True,\
                       verbose = True):
        '''
        processes a single image, return the locations
        and the values of detected SIFT features.//處理單個影象，返回檢測到的SIFT特徵的位置和值。
        image: a M*N image which is a numpy 2D array. If you 
            pass a color image, it will automatically be converted
            to a grayscale image.//一個M*N的影象，它是一個numpy二維陣列。如果您傳遞一個彩色影象，它將自動轉換為灰度影象。
        positionNormalize: whether to normalize the positions
            to [0,1]. If False, the pixel-based positions of the
            top-right position of the patches is returned.//是否將位置規範化為[0,1]。如果為False，則返回補丁右上角的基於畫素的位置。
        
        Return values:
        feaArr: the feature array, each row is a feature//特徵陣列，每一行都是一個特徵
        positions: the positions of the features//特徵的位置
        '''

        image = image.astype(np.double)
        if image.ndim == 3:
            # we do not deal with color images.
            image = np.mean(image,axis=2)
        # compute the grids
        H,W = image.shape
        gS = self.gS
        pS = self.pS
        remH = np.mod(H-pS, gS)
        remW = np.mod(W-pS, gS)
        offsetH = remH//2
        offsetW = remW//2
        gridH,gridW = np.meshgrid(range(offsetH,H-pS+1,gS), range(offsetW,W-pS+1,gS))
        
        
        gridH = gridH.flatten()
        gridW = gridW.flatten()
        if verbose:
            print('Image: w {}, h {}, gs {}, ps {}, nFea {}'.\
                    format(W,H,gS,pS,gridH.size))
        feaArr = self.calculate_sift_grid(image,gridH,gridW)
        feaArr = self.normalize_sift(feaArr)
        if positionNormalize:
            positions = np.vstack((gridH / np.double(H), gridW / np.double(W)))
        else:
            positions = np.vstack((gridH, gridW))
        return feaArr, positions

    def calculate_sift_grid(self,image,gridH,gridW):
        '''
        This function calculates the unnormalized sift features
        It is called by process_image().//此函式計算未規範化的sift特性。
                                        //它被process_image()呼叫。
        '''
        H,W = image.shape
        Npatches = gridH.size
        feaArr = np.zeros((Npatches,Nsamples*Nangles))

        # calculate gradient
        GH,GW = gen_dgauss(self.sigma)
        IH = signal.convolve2d(image,GH,mode='same')
        IW = signal.convolve2d(image,GW,mode='same')
        Imag = np.sqrt(IH**2+IW**2)
        Itheta = np.arctan2(IH,IW)
        Iorient = np.zeros((Nangles,H,W))
        for i in range(Nangles):
            Iorient[i] = Imag * np.maximum(np.cos(Itheta - angles[i])**alpha,0)
            #pyplot.imshow(Iorient[i])
            #pyplot.show()
        for i in range(Npatches):
            currFeature = np.zeros((Nangles,Nsamples))
            for j in range(Nangles):
                currFeature[j] = np.dot(self.weights,\
                        Iorient[j,gridH[i]:gridH[i]+self.pS, gridW[i]:gridW[i]+self.pS].flatten())
            feaArr[i] = currFeature.flatten()
        return feaArr

    def normalize_sift(self,feaArr):
        '''
        This function does sift feature normalization
        following David Lowe's definition (normalize length ->
        thresholding at 0.2 -> renormalize length)
        '''
        siftlen = np.sqrt(np.sum(feaArr**2,axis=1))
        hcontrast = (siftlen >= self.nrml_thres)
        siftlen[siftlen < self.nrml_thres] = self.nrml_thres
        # normalize with contrast thresholding
        feaArr /= siftlen.reshape((siftlen.size,1))
        # suppress large gradients
        feaArr[feaArr>self.sift_thres] = self.sift_thres
        # renormalize high-contrast ones
        feaArr[hcontrast] /= np.sqrt(np.sum(feaArr[hcontrast]**2,axis=1)).\
                reshape((feaArr[hcontrast].shape[0],1))
        return feaArr

class SingleSiftExtractor(DsiftExtractor):
    '''
    The simple wrapper class that does feature extraction, treating
    the whole image as a local image patch.//一個簡單的封裝類，它能把整個影象當作一個區域性影象補丁
    '''
    def __init__(self, patchSize,
                 nrml_thres = 1.0,\
                 sigma_edge = 0.8,\
                 sift_thres = 0.2):
        # simply call the super class __init__ with a large gridSpace
        DsiftExtractor.__init__(self, patchSize, patchSize, nrml_thres, sigma_edge, sift_thres)   
    
    def process_image(self, image):
        return DsiftExtractor.process_image(self, image, False, False)[0]
    
if __name__ == '__main__':
    # ignore this. I only use this for testing purpose...
    from scipy import misc
    extractor = DsiftExtractor(8,16,1)
    image = misc.imread('C:/Users/qgl/Desktop/articles/test1.png')
    image = np.mean(np.double(image),axis=2)
    feaArr,positions = extractor.process_image(image)
    #pyplot.hist(feaArr.flatten(),bins=100)
    #pyplot.imshow(feaArr[:256])
    #pyplot.plot(np.sum(feaArr,axis=0))
    pyplot.imshow(feaArr[np.random.permutation(feaArr.shape[0])[:256]])
    
    # test single sift extractor
    extractor = SingleSiftExtractor(16)
    feaArrSingle = extractor.process_image(image[:16,:16])
    pyplot.figure()
    pyplot.plot(feaArr[0],'r')
    pyplot.plot(feaArrSingle,'b')
    pyplot.show()

程式碼e.g. 如果你的輸入影象畫素大小是200 * 200，設定步長引數為8，patch塊大小為16 * 16，輸入影象轉換為24 * 24=576個patch塊，每個patch塊進行SIFT提取關鍵點，最後得到576個特徵點。這樣輸入影象就轉換為了576 * 128維的矩陣向量。繼續新增一個批量處理函式，處理好資料集，就可以進行後續的分類識別工作了。

python3 Dense SIFT演算法的應用與實現

python3 Dense SIFT演算法的應用與實現 Dense SIFT 因為課題的需求，傳統的SIFT演算法即Sparse SIFT，不能很好地表徵不同類之間的特徵差異，達不到所需的分類要求。而Dense SIFT演算法，是一種對輸入

【C++ STL應用與實現】72: 標準庫裡的堆--如何使用標準庫的heap演算法

本系列文章的目錄在這裡：目錄. 通過目錄裡可以對STL總體有個大概瞭解前言本文介紹如何使用STL裡的heap（堆）演算法。第一次接觸heap這種資料結構是在大學的資料結構教材上，它是一棵完全二叉樹。在STL中，heap是演算法的形式提供給我們使用的。

ssm redis 數據字典在J2EE中的多種應用與實現

stat ide ddk ucc gif ndt ida creat img 數據字典在項目中是不可缺少的“基礎設施”，關於數據字典如何設計如何實現，今天抽空講一下吧先看一下表設計：通過自定義標簽來實現頁面的渲染： public class DataDictVal

文字分類——NLV演算法研究與實現

內容提要 1 引言 2 NLV演算法理論 2.1 訓練模型 2.2 分類模型 3 NLV演算法實現 3.1 演算法描述 4 實驗及效能評估 4.1 實驗設計 4

特徵選擇——Matrix Projection演算法研究與實現

內容提要引言 MP特徵選擇思想 MP特徵選擇演算法 MP特徵選擇分析實驗結果分析總結引言一般選擇文字的片語作為分類器輸入向量的特徵語義單元，而作為單詞或詞語的片語，在任何一種語言中都有數萬或數十萬個。另外

機器學習（七）決策樹演算法研究與實現

前言從決策樹這三個字中我們既可以看出來它的主要用途幫助決策某一類問題，樹是輔助我們來決策用的，如下圖一個簡單的判斷不同階段人年齡的圖： &

zookeeper之應用與實現

Leader Elections(leader選舉) 指派一個程序作為組織者，將任務分發給各節點。在任務開始前，哪個節點都不知道誰是leader(領導者)或者coordinator(協調者)。當選舉演算法開始執行後，每個節點最終會得到一個唯一的節點作為任務l

希爾排序演算法原理與實現

1.問題描述輸入：n個數的序列<a1,a2,a3,...,an>。輸出：原序列的一個重排<a1*,a2*,a3*,...,an*>；，使得a1*<=a2*<=a3*<=...<=an*。 2. 問題分析例如，假設有

磁碟排程演算法設計與實現——C語言

一、設計分析共享裝置的典型代表為磁碟，磁碟物理塊的地址由柱面號、磁頭號、扇區號來指定，完成磁碟某一個物理塊的訪問要經過三個階段：尋道時間Ts、旋轉延遲時間Tw和讀寫時間Trw。尋道時間Ts是磁頭從當前磁軌移動到目標磁軌所需要的時間；旋轉延遲時間Tw是當磁頭停留在目標磁軌後，目

二十五、併發程式設計之join應用與實現原理剖析

1、join有什麼用呢？當一個執行緒正在進行中的時候，如果我們想呼叫另外一個執行緒的話，這時我們可以使用join。 2、join方法的底層原理，簡單來說就是，join方法能把所呼叫join方法的執行緒進入休眠狀態(wait())，等執行完joinThread執行緒之後，會自動

演算法導論－最大子陣列問題－線性時間複雜度演算法分析與實現

之前寫了最大子陣列問題的分治法，今天把這個問題的線性時間複雜度的演算法寫出來。這個方法在演算法導論最大子陣列問題的課後思考題裡面提出來了，只是說的不夠詳細。思考題如下：使用如下思想為最大子陣列問題設計一個非遞迴的，線性時間複雜度的演算法。從陣列左邊界開始，由左至右處理，

【C++ STL應用與實現】5: 如何使用std::array (since C++11)

本系列文章的目錄在這裡：目錄. 通過目錄裡可以對STL總體有個大概瞭解前言本文總結了STL中的序列式容器array的用法及注意事項。array的出現代表著C++的程式碼更進一步“現代化”，就像std::string的出現代替了c風格字串並且能和STL

Java排序演算法分析與實現：快排、氣泡排序、選擇排序、插入排序、歸併排序（一）

轉載 https://www.cnblogs.com/bjh1117/p/8335628.html 一、概述：　　本文給出常見的幾種排序演算法的原理以及java實現，包括常見的簡單排序和高階排序演算法，以及其他常用的演算法知識。　　簡單排序：氣泡排序、選擇排序、

演算法--中級演算法題目與實現

1、區間求值我們會傳遞給你一個包含兩個數字的陣列。返回這兩個數字和它們之間所有數字的和。最小的數字並非總在最前面。 2、找出陣列間的差別比較兩個陣列，然後返回一個新陣列，該陣列的元素為兩個給定陣列中所有獨有的陣列元素。換言之，返回兩個陣列的差異 3、數

常見限流演算法研究與實現

一、限流場景很多做服務介面的人或多或少的遇到這樣的場景，由於業務應用系統的負載能力有限，為了防止非預期的請求對系統壓力過大而拖垮業務應用系統。也就是面對大流量時，如何進行流量控制？服務介面的流量控制策略：分流、降級、限流等。本文討論下限流策略，雖然降低了服務

SM2演算法第二十五篇：ECDSA數字簽名演算法原理與實現

---------------------------------------------轉載原因------------------------------------------------- 這邊部落格中有關 EC_KEY_set_private_key和EC_KEY_set_public_key

knn演算法原理與實現（1）

一、演算法原理與模型 knn演算法即最近鄰演算法，其原理非常簡單即根據給定的資料集，計算資料集中點的特徵到待分類資料的歐氏距離，然後選擇距離最近的k個作為判斷依據，這k個數據中出現類別最多的作為新輸入資料的label。模型用公式表示如下：二、python程式碼實現

24點演算法講解與實現

題目描述：在52張撲克牌中（去掉大小王），隨機抽取4張牌，找到所有可能的情況和解。前言博主曾在網上看到有很多關於24點的演算法，但很多都是沒有找全所有表示式，要麼就是沒有去重，而且搜尋的時間過長，有些慢的要半個小時才能得到結果。所以經過我的不懈努力，經過幾

求子集問題演算法分析與實現（遞迴、非遞迴）

問題描述：若有數字集合{1，2，3}，則其子集為NULL、{1}、{2}、{3}、{1，2}、{1，3}、{2，3}、{1，2，3}。現給定陣列，求其的全部子集。實現如下： //非

【C++ STL應用與實現】56: 使用std::unique刪除重複元素

本系列文章的目錄在這裡：(目錄). 通過目錄裡可以對STL總體有個大概瞭解前言本文介紹了STL中的unique演算法的使用，結合一個具體例子講解如何使用它刪除自定義型別結合裡面的重複元素(不僅僅是連續的)。原型 <algorithm>中的unique函

python3 Dense SIFT演算法的應用與實現

python3 Dense SIFT演算法的應用與實現

相關推薦