基於unet網路的眼底血管分割程式碼解析、retina-unet

阿新 • • 發佈：2018-12-17

unet 網路眼底血管分割

unet 網路結構是比較常用的影象分割網路結構。這裡使用了unet網路結構對眼底血管進行了分割。整個程式碼參考了。retina-unet。整個網路準確率還是不錯的。

眼底血管

整個眼底血管的圖片如下。、在這裡插入圖片描述上面展示了一個眼底血管的影象以及它的分割例子。我們可以發現整個影象的血管完整給分割出來了。

unet程式碼的解析

這裡我也不詳細介紹unet網路的整體結構了。相信既然看這個的人基本都是有一定基礎的。這裡unet網路使用了keras 基於tensorflow 進行編寫。

def get_unet(n_ch,patch_height,patch_width):
    inputs = Input(shape=(n_ch,patch_height,patch_width))
    conv1 = Conv2D(32, (3, 3), activation='relu', padding='same',data_format='channels_first')(inputs)
    conv1 = Dropout(0.2)(conv1)
    conv1 = Conv2D(32, (3, 3), activation='relu', padding='same',data_format='channels_first')(conv1)
    pool1 = MaxPooling2D((2, 2))(conv1)

    conv2 = Conv2D(64, (3, 3), activation='relu', padding='same',data_format='channels_first')(pool1)
    conv2 = Dropout(0.2)(conv2)
    conv2 = Conv2D(64, (3, 3), activation='relu', padding='same',data_format='channels_first')(conv2)
    pool2 = MaxPooling2D((2, 2))(conv2)
 
    conv3 = Conv2D(128, (3, 3), activation='relu', padding='same',data_format='channels_first')(pool2)
    conv3 = Dropout(0.2)(conv3)
    conv3 = Conv2D(128, (3, 3), activation='relu', padding='same',data_format='channels_first')(conv3)

    up1 = UpSampling2D(size=(2, 2))(conv3)
    up1 = concatenate([conv2,up1],axis=1)
    conv4 = Conv2D(64, (3, 3), activation='relu', padding='same',data_format='channels_first')(up1)
    conv4 = Dropout(0.2)(conv4)
    conv4 = Conv2D(64, (3, 3), activation='relu', padding='same',data_format='channels_first')(conv4)
    #
    up2 = UpSampling2D(size=(2, 2))(conv4)
    up2 = concatenate([conv1,up2], axis=1)
    conv5 = Conv2D(32, (3, 3), activation='relu', padding='same',data_format='channels_first')(up2)
    conv5 = Dropout(0.2)(conv5)
    conv5 = Conv2D(32, (3, 3), activation='relu', padding='same',data_format='channels_first')(conv5)

    conv6 = Conv2D(2, (1, 1), activation='relu',padding='same',data_format='channels_first')(conv5)
    conv6 = core.Reshape((2,patch_height*patch_width))(conv6)
    conv6 = core.Permute((2,1))(conv6)

    conv7 = core.Activation('softmax')(conv6)

    model = Model(inputs=inputs, outputs=conv7)
    model.compile(optimizer='sgd', loss='categorical_crossentropy',metrics=['accuracy'])

    return model

這是整個unet網路的設計結構。這裡只進行了兩次下采樣操作。為什麼只進行了兩次下采樣。圖片那麼大可以嗎？這些問題我後面再說。

資料的處理

既然網路已經設計好了。下面就要開始跑資料了。在開啟DRIVE 訓練集時，我們發現整個圖片量太小了。所以要進行擴充操作。這裡原本的圖片大小為 $565 \times584$ .所以這裡進行了裁剪運算隨機在中心圖片上採取了 $48\times48$ 大小的圖片進行卷積操作。這也就是為什麼上面的unet 網路設計為兩次下采樣。因為這樣完全就可以了。另外圖片還進行了 1.Gray-scale conversion 2.Standardization 3.Contrast-limited adaptive histogram equalization (CLAHE) 4.Gamma adjustment’ 這樣的四步操作。程式碼如下

def get_data_training(DRIVE_train_imgs_original,
                      DRIVE_train_groudTruth,
                      patch_height,
                      patch_width,
                      N_subimgs,
                      inside_FOV):
    train_imgs_original = load_hdf5(DRIVE_train_imgs_original)
    train_masks = load_hdf5(DRIVE_train_groudTruth) #masks always the same
    # visualize(group_images(train_imgs_original[0:20,:,:,:],5),'imgs_train')#.show()  #check original imgs train


    train_imgs = my_PreProc(train_imgs_original)
    train_masks = train_masks/255.

    train_imgs = train_imgs[:,:,9:574,:]  #cut bottom and top so now it is 565*565
    train_masks = train_masks[:,:,9:574,:]  #cut bottom and top so now it is 565*565
    data_consistency_check(train_imgs,train_masks)

    #check masks are within 0-1
    assert(np.min(train_masks)==0 and np.max(train_masks)==1)

    print("\ntrain images/masks shape:")
    print(train_imgs.shape)
    print("train images range (min-max): " +str(np.min(train_imgs)) +' - '+str(np.max(train_imgs)))
    print("train masks are within 0-1\n")

    #extract the TRAINING patches from the full images
    patches_imgs_train, patches_masks_train = extract_random(train_imgs,train_masks,patch_height,patch_width,N_subimgs,inside_FOV)
    data_consistency_check(patches_imgs_train, patches_masks_train)

    print("\ntrain PATCHES images/masks shape:")
    print(patches_imgs_train.shape)
    print("train PATCHES images range (min-max): " +str(np.min(patches_imgs_train)) +' - '+str(np.max(patches_imgs_train)))

    return patches_imgs_train, patches_masks_train#, patches_imgs_test, patches_masks_test

這是獲得隨機大小的程式碼。

def histo_equalized(imgs):
    assert (len(imgs.shape)==4)  #4D arrays
    assert (imgs.shape[1]==1)  #check the channel is 1
    imgs_equalized = np.empty(imgs.shape)
    for i in range(imgs.shape[0]):
        imgs_equalized[i,0] = cv2.equalizeHist(np.array(imgs[i,0], dtype = np.uint8))
    return imgs_equalized


# CLAHE (Contrast Limited Adaptive Histogram Equalization)
#adaptive histogram equalization is used. In this, image is divided into small blocks called "tiles" (tileSize is 8x8 by default in OpenCV). Then each of these blocks are histogram equalized as usual. So in a small area, histogram would confine to a small region (unless there is noise). If noise is there, it will be amplified. To avoid this, contrast limiting is applied. If any histogram bin is above the specified contrast limit (by default 40 in OpenCV), those pixels are clipped and distributed uniformly to other bins before applying histogram equalization. After equalization, to remove artifacts in tile borders, bilinear interpolation is applied
def clahe_equalized(imgs):
    assert (len(imgs.shape)==4)  #4D arrays
    assert (imgs.shape[1]==1)  #check the channel is 1
    #create a CLAHE object (Arguments are optional).
    clahe = cv2.createCLAHE(clipLimit=2.0, tileGridSize=(8,8))
    imgs_equalized = np.empty(imgs.shape)
    for i in range(imgs.shape[0]):
        imgs_equalized[i,0] = clahe.apply(np.array(imgs[i,0], dtype = np.uint8))
    return imgs_equalized


# ===== normalize over the dataset
def dataset_normalized(imgs):
    assert (len(imgs.shape)==4)  #4D arrays
    assert (imgs.shape[1]==1)  #check the channel is 1
    imgs_normalized = np.empty(imgs.shape)
    imgs_std = np.std(imgs)
    imgs_mean = np.mean(imgs)
    imgs_normalized = (imgs-imgs_mean)/imgs_std
    for i in range(imgs.shape[0]):
        imgs_normalized[i] = ((imgs_normalized[i] - np.min(imgs_normalized[i])) / (np.max(imgs_normalized[i])-np.min(imgs_normalized[i])))*255
    return imgs_normalized


def adjust_gamma(imgs, gamma=1.0):
    assert (len(imgs.shape)==4)  #4D arrays
    assert (imgs.shape[1]==1)  #check the channel is 1
    # build a lookup table mapping the pixel values [0, 255] to
    # their adjusted gamma values
    invGamma = 1.0 / gamma
    table = np.array([((i / 255.0) ** invGamma) * 255 for i in np.arange(0, 256)]).astype("uint8")
    # apply gamma correction using the lookup table
    new_imgs = np.empty(imgs.shape)
    for i in range(imgs.shape[0]):
        new_imgs[i,0] = cv2.LUT(np.array(imgs[i,0], dtype = np.uint8), table)
    return new_imgs

這是進行影象預處理的四個步驟。

進行程式執行操作

既然網路結構設計好了以後，現在進行程式執行操作。在經過200輪的迭代以後。可以獲得準確率在95%左右。下一本在進行評估操作。要對剩下的測試資料進行操作。最後整個ROC曲線達到了97.6%。在這裡插入圖片描述

基於unet網路的眼底血管分割程式碼解析、retina-unet

unet 網路眼底血管分割

眼底血管

unet程式碼的解析

資料的處理

進行程式執行操作

基於unet網路的眼底血管分割程式碼解析、retina-unet

基於OkHttp網路通訊工具類(傳送get、post請求、檔案上傳和下載)

七種常見閾值分割程式碼(Otsu、最大熵、迭代法、自適應閥值、手動、迭代法、基本全域性閾值法)

HTMLParser（一個比較流行的html程式碼解析、處理開源專案）學習，總結

Matlab程式設計之——卷積神經網路CNN程式碼解析

用TensorFlow基於神經網路實現井字棋（含程式碼）

小川學習筆記--BP神經網路JAVA程式碼解析

基於C語言的編碼器（光耦）程式設計之C程式碼解析（二）

深度學習（七）U-Net原理以及keras程式碼實現醫學影象眼球血管分割

基於ReactiveCocoa和MVVM設計的購物車基本操作實現程式碼解析

練習題︱影象分割與識別——UNet網路練習案例（兩則）

搭積木般構建深度學習網路——Xception完整程式碼解析

wordcount 程式碼解析

複雜網路|基於複雜網路的新型分類器

GraphSAGE 程式碼解析 - minibatch.py

2018/10/04-網路API-《惡意程式碼實戰分析》

Faster RCNN演算法訓練程式碼解析（2）

Faster RCNN演算法訓練程式碼解析（3）

pytorch筆記：05)UNet網路簡單實現

基於Tensorflow的影象風格轉換程式碼

基於unet網路的眼底血管分割程式碼解析、retina-unet

unet 網路眼底血管分割

眼底血管

unet程式碼的解析

資料的處理

進行程式執行操作

相關推薦