語義分割後處理

阿新 • • 發佈：2020-12-27

對於語義分割網路，其輸出為（b, h, w, classes）,對索引求最大值，得到維度為（b, h, w, 1）

相對於得到一個灰度圖，其亮度值為類別index。因為類別值為[1, num_classes]，如果對輸出

結果直接顯示，會的到一副純黑的圖。

所以需要進行預測結果視覺化

將預測結果轉化為RGB影象

首先建立預測類別和相應rgb顏色的對映

Label = namedtuple('Label', [
    'name', 
    'trainId',  
    'category', 
    'categoryId', 
    'hasInstances',  
    'ignoreInEval', 
    'color',  
])

labels = [
    #     name                     id trainId  category     catId hasInstances ignoreInEval color
    Label('unlabeled',              0,  255,    'void',         0, False,       True,       (0,  0,  0)),
    Label('ego vehicle',            1,  255,    'void',         0, False,       True,       (0,  0,  0)),
    Label('rectification border',   2,  255,    'void',         0, False,       True,       (0,  0,  0)),
    Label('out of roi',             3,  255,    'void',         0, False,       True,       (0,  0,  0)),
    Label('static',                 4,  255,    'void',         0, False,       True,       (0,  0,  0)),
    Label('dynamic',                5,  255,    'void',         0, False,       True,       (111, 74,  0)),
    Label('ground',                 6,  255,    'void',         0, False,       True,       (81,  0, 81)),
    Label('road',                   7,  0,      'flat',         1, False,       False,      (128, 64, 128)),
    Label('sidewalk',               8,  1,      'flat',         1, False,       False,      (244, 35, 232)),
    Label('parking',                9,  255,    'flat',         1, False,       True,       (250, 170, 160)),
    Label('rail track',             10, 255,    'flat',         1, False,       True,       (230, 150, 140)),
    Label('building',               11, 2,      'construction', 2, False,       False,      (70, 70, 70)),
    Label('wall',                   12, 3,      'construction', 2, False,       False,      (102, 102, 156)),
    Label('fence',                  13, 4,      'construction', 2, False,       False,      (190, 153, 153)),
    Label('guard rail',             14, 255,    'construction', 2, False,       True,       (180, 165, 180)),
    Label('bridge',                 15, 255,    'construction', 2, False,       True,       (150, 100, 100)),
    Label('tunnel',                 16, 255,    'construction', 2, False,       True,       (150, 120, 90)),
    Label('pole',                   17, 5,      'object',       3, False,       False,      (153, 153, 153)),
    Label('polegroup',              18, 255,    'object',       3, False,       True,       (153, 153, 153)),
    Label('traffic light',          19, 6,      'object',       3, False,       False,      (250, 170, 30)),
    Label('traffic sign',           20, 7,      'object',       3, False,       False,      (220, 220,  0)),
    Label('vegetation',             21, 8,      'nature',       4, False,       False,      (107, 142, 35)),
    Label('terrain',                22, 9,      'nature',       4, False,       False,      (152, 251, 152)),
    Label('sky',                    23, 10,     'sky',          5, False,       False,      (70, 130, 180)),
    Label('person',                 24, 11,     'human',        6, True,        False,      (220, 20, 60)),
    Label('rider',                  25, 12,     'human',        6, True,        False,      (255,  0,  0)),
    Label('car',                    26, 13,     'vehicle',      7, True,        False,      (0,  0, 142)),
    Label('truck',                  27, 14,     'vehicle',      7, True,        False,      (0,  0, 70)),
    Label('bus',                    28, 15,     'vehicle',      7, True,        False,      (0, 60, 100)),
    Label('caravan',                29, 255,    'vehicle',      7, True,        True,       (0,  0, 90)),
    Label('trailer',                30, 255,    'vehicle',      7, True,        True,       (0,  0, 110)),
    Label('train',                  31, 16,     'vehicle',      7, True,        False,      (0, 80, 100)),
    Label('motorcycle',             32, 17,     'vehicle',      7, True,        False,      (0,  0, 230)),
    Label('bicycle',                33, 18,     'vehicle',      7, True,        False,      (119, 11, 32)),
    Label('license plate',          -1, -1,     'vehicle',      7, False,       True,       (0,  0, 142)),
]

trainId2label = {label.trainId: label for label in reversed(labels)}
// {-1： Label（）， 18：Label（），。。。。}

生成一個與原圖大小一樣的三維矩陣

colored_image = np.zeros(
    (class_id_image.shape[0], class_id_image.shape[1], 3), np.uint8)

將對應位置填補為類別對應的RGB

for row in range(class_id_image.shape[0]):
    for col in range(class_id_image.shape[1]):
        try:
            colored_image[row, col, :] = class_id_to_rgb_map[
                int(class_id_image[row, col])].color



所以全過程為

probs = pspnet.predict(img)
cm = np.argmax(probs, axis=2)

colored_class_image = color_class_image(cm)

alpha_blended = 0.5 * colored_class_image + 0.5 * img
與原圖混合

補充：還可以用PIL內建調色盤方法，

new_mask=PIL.Image.fromarray(mask.astype(np.uint8)).convert('P') new_mask.putpalette(palette) fromPILimportImage Image.open('PennFudanPed/PNGImages/FudanPed00001.png') mask=Image.open('PennFudanPed/PedMasks/FudanPed00001_mask.png') mask.putpalette([ 0,0,0,#blackbackground 255,0,0,#index1isred 255,255,0,#index2isyellow 255,153,0,#index3isorange ])

語義分割後處理

對於語義分割網路，其輸出為（b, h, w, classes）,對索引求最大值，得到維度為（b, h, w, 1）

DeepLearning-語義分割資料處理例項

資料集：Pascal VOC2012，參考材料：動手學深度學習以下示例實現了對資料的預讀取，處理等操作

基於Android studio3.6的JNI教程之ncnn之語義分割ENet

程式碼連結： https://github.com/watersink/enet-as-linux 本程式碼可以在模擬器下進行跑。

Keras:Unet網路實現多類語義分割方式

1 介紹 U-Net最初是用來對醫學影象的語義分割，後來也有人將其應用於其他領域。但大多還是用來進行二分類，即將原始影象分成兩個灰度級或者色度，依次找到影象中感興趣的目標部分。

PyTorch中的MIT ADE20K資料集的語義分割

PyTorch中的MIT ADE20K資料集的語義分割程式碼地址：https://github.com/CSAILVision/semantic-segmentation-pytorch

Unity3D+Post Processing Stack V2自定義後處理效果研究

背景眾所周知，Unity3D支援自定義後處理效果，實現過程有三步：新增著色器，在著色器裡書寫後處理程式碼；

SENet&語義分割相關知識學習

SENet&語義分割相關知識學習對上一次學習的 HybridSN 高光譜分類網路進行優化改進；SENet網路學習和實現；學習視訊北京大學李夏的《語義分割中的自注意力機制和低秩重重建》，南開大學程明明教授的《影象語義

後處理邏輯整理

1. 文書處理(WordPro) -|編碼轉換 -|對映引數儲存 -|文字轉音素序列 -|轉換中間計算

關於unity 後處理檔案（volumn）沒有效果的問題

首先，需要保證volumn檔案在預設層，或者新建一個專屬的層接下來是最重要的，攝像機中的這兩個設定要設定好，postprocessing勾上，下方的volumnmask要選擇volumn檔案所在的層

語義分割丨DeepLab系列總結「v1、v2、v3、v3+」

花了點時間梳理了一下DeepLab系列的工作，主要關注每篇工作的背景和貢獻，理清它們之間的聯絡，而實驗和部分細節並沒有過多介紹，請見諒。

【Unity遊戲開發】升級Unity2019後，資源管線後處理採坑記錄

一、引子　　最近我們的專案由Unity2018升級到了Unity2019.4，但是突然間發現FBX資源匯入時的後處理不生效了。經過一系列的實驗，發現了升級到Unity2019以後，資源管線後處理中的一些坑，今天馬三來和大家分享一下這

【論文彙總】 ECCV 2020 語義分割paper彙總

語義分割 segmentation [email protected] 2020 ECCV 2020語義分割文章總結，文章下載連結。

關於pytorch語義分割二分類問題的兩種做法

形式1：輸出為單通道分析即網路的輸出 output 為 [batch_size, 1, height, width] 形狀。其中 batch_szie 為批量大小，1 表示輸出一個通道，height 和 width 與輸入影象的高和寬保持一致。

影象分類，目標檢測，語義分割，例項分割，全景分割聯絡與區別

一、影象分類識別影象中存在的內容，如下圖，有人（person）、樹（tree）、草地（grass）、天空（sky），只知道有沒有這一類東西就行。

語義分割單通道和多通道輸出交叉熵損失函式的計算問題

摘要本文驗證了語義分割任務下，單通道輸出和多通道輸出時，使用交叉熵計算損失值的細節問題。對比驗證了使用簡單的函式和自帶損失函式的結果，通過驗證，進一步加強了對交叉熵的理解。

ue 新增後處理

1. shader 類 2.pass 類 class FPostProcessmgTestVS_ES2 : public FGlobalShader { DECLARE_SHADER_TYPE(FPostProcessmgTestVS_ES2, Global);

url中特殊字元被轉義成編碼後處理

技術標籤：JAVA 開發時有時服務端返回的json中包含url，url中可能含有一些特殊字元，這些特殊字元在傳輸的過程中可能會被轉義成編碼。這時候我們拿到手裡要如何轉換回去呢，先看下那些字元可能會被編碼

Mysql分割字串並對分割後的資料進行查詢翻譯

技術標籤：資料同步Mysqlmysqlelasticsearch 最近在處理ElasticSearch的資料同步。有一個需求要在sql裡對字串進行分割並對其進行翻譯。需要同步的表裡的資料結構是這樣子的，而mysql的函式是沒有split的，只有S

小菊的語義分割1——資料集的製作(一): ISPRS_Potsdam遙感影象資料集

技術標籤：小菊的語義分割語義分割ISPRS資料集製作遙感影象小菊的語義分割——對ISPRS資料集中的遙感影象進行切分，製作自己的訓練資料集

語義分割損失函式

1. 交叉熵損失語義分割時相當於對每個畫素進行分類，所以實際是一個分類任務

語義分割後處理

相關推薦