語義分割中資料樣本的整理標註及調色盤程式碼

阿新 • • 發佈：2018-11-01

語義分割中標註的彩色圖如何利用調色盤轉為只包含對應label的灰度圖，其中會有一些繁瑣的地方，下面將自己寫的程式碼分享出來。程式碼主要作用如下圖所示，將標註的彩色圖按照事先定義的調色盤轉成只含label的groundtruth圖片。

程式碼中的關鍵部分在於定義的myquantize函式，如果使用Python PIL庫自帶的quantizetopalette函式的話會導致轉化出的label圖有細小的孔洞，這應該是quantizetopalette的一個bug，具體後面有時間再詳細分析，我程式碼裡面寫上了兩個函式的對比。

圖片中，左側為原始待分割的圖，中間為標註的色彩label圖，最右邊為色彩圖轉的用於訓練的灰度label圖。其中邊緣是由於標註時PS插值導致的雜色，具體使用中可以置0，也可以置為255，作為ignore label。下面四具體的程式碼：

# -*- coding: utf-8 -*-

import sys
import os
import cv2
import shutil
import numpy as np
from PIL import Image

def getpallete(num_cls):
	# this function is to get the colormap for visualizing the segmentation mask
	n = num_cls
	pallete = [0] * (n * 3)
	for j in xrange(0, n):
		lab = j
		pallete[j * 3 + 0] = 0
		pallete[j * 3 + 1] = 0
		pallete[j * 3 + 2] = 0
		i = 0
		while (lab > 0):
			pallete[j * 3 + 0] |= (((lab >> 0) & 1) << (7 - i))
			pallete[j * 3 + 1] |= (((lab >> 1) & 1) << (7 - i))
			pallete[j * 3 + 2] |= (((lab >> 2) & 1) << (7 - i))
			i = i + 1
			lab >>= 3
	# return pallete
	mypallete = pallete[0*3:1*3]+pallete[249*3:256*3]+pallete[106*3:109*3]+pallete[77*3:80*3]+pallete[1*3:7*3]
	otherpallete = pallete[7*3:77*3]+pallete[80*3:106*3]+pallete[109*3:249*3]

	return mypallete,otherpallete
def quantizetopalette(silf, palette, dither=False):
    """Convert an RGB or L mode image to use a given P image's palette."""

    silf.load()

    # use palette from reference image
    palette.load()
    if palette.mode != "P":
        raise ValueError("bad mode for palette image")
    if silf.mode != "RGB" and silf.mode != "L":
        raise ValueError(
            "only RGB or L mode images can be quantized to a palette"
            )
    im = silf.im.convert("P", 1 if dither else 0, palette.im)
    # the 0 above means turn OFF dithering
    return silf._new(im)
    # return im

def myquantize(self, colors=256, method=None, kmeans=0, palette=None):
        """
        Convert the image to 'P' mode with the specified number
        of colors.

        :param colors: The desired number of colors, <= 256
        :param method: 0 = median cut
                       1 = maximum coverage
                       2 = fast octree
                       3 = libimagequant
        :param kmeans: Integer
        :param palette: Quantize to the :py:class:`PIL.ImagingPalette` palette.
        :returns: A new image

        """

        self.load()

        if method is None:
            # defaults:
            method = 0
            if self.mode == 'RGBA':
                method = 2

        if self.mode == 'RGBA' and method not in (2, 3):
            # Caller specified an invalid mode.
            raise ValueError(
                'Fast Octree (method == 2) and libimagequant (method == 3) ' +
                'are the only valid methods for quantizing RGBA images')

        if palette:
            # use palette from reference image
            palette.load()
            if palette.mode != "P":
                raise ValueError("bad mode for palette image")
            if self.mode != "RGB" and self.mode != "L":
                raise ValueError(
                    "only RGB or L mode images can be quantized to a palette"
                    )
            # im = self.im.convert("P", 1, palette.im)
            im = self.im.convert("P", 1 if dither else 0, palette.im)
    		# the 0 above means turn OFF dithering
            return self._new(im)

        return self._new(self.im.quantize(colors, method, kmeans))

def run_automatting_file(inputfile,outputfile):
	mypallete,otherpallete = getpallete(256)
	# allpallete = mypallete + otherpallete
	allpallete = mypallete + otherpallete
	if not inputfile.endswith('.png') and not inputfile.endswith('.PNG'):
		return
	print 'process image : ' + inputfile
	img = cv2.imread(inputfile,cv2.IMREAD_UNCHANGED)

	row = img.shape[0]
	col = img.shape[1]
	# img_label = np.zeros((row, col, 1), dtype=np.uint8)
	alpha = np.zeros((row, col), dtype=np.uint8)
	alpha[:, :] = img[:, :, 3]
	img[alpha < 128] = 0
	img_bgr = img[:,:,0:3]
	# Rearrange channels to form BGR
	img_rgb = img_bgr[:,:,::-1]

	pil_im = Image.fromarray(img_rgb)

	palimage = Image.new('P', (16, 16))
	palimage.putpalette(allpallete)

	pil_im_p = quantizetopalette(pil_im, palimage, dither=False)

	cv_im = np.array(pil_im_p)
	cv_im = resizeImage(cv_im,1000,Image.NEAREST)
	# print np.unique(cv_im)
	cv_im_new = cv2.erode(cv_im,None)
	cv_im_new[cv_im_new > 19] = 0
	# print np.unique(cv_im_new)
	cv2.imwrite(outputfile,cv_im_new)

def resizeImage(image,resize_dim,resize_flag):
	# if (resize_flag != Image.NEAREST) or (resize_flag != Image.BILINEAR):
	# 	print "resize_flag should = Image.NEAREST or Image.BILINEAR!"
		# retu
	width = image.shape[0]
	height = image.shape[1]
	maxDim = max(width,height)
	# max_resize_dim = 321.0
	max_resize_dim = float(resize_dim)
	if maxDim>max_resize_dim:
		if height>width:
			ratio = float(max_resize_dim/height)
		else:
			ratio = float(max_resize_dim/width)
			# print max_resize_dim,"height=",height,"ratio=",ratio
		image = Image.fromarray(np.uint8(image))
		image = image.resize((int(height*ratio), int(width*ratio)),resample=resize_flag)
		# image = image.resize((300, 450),resample=PILImage.BILINEAR)
		image = np.array(image)
	return image


def main(argv):

	if 1 == len(argv):
		inputfile = argv[0]
		outputfile = "testout.png"
		print 'Input file is "', inputfile
		run_automatting_file(inputfile,outputfile)
	elif 2 == len(argv):
		inputfiledir = argv[0]
		outputfiledir = argv[1]
		print 'Input dir is "', inputfiledir
		print 'Output dir is "', outputfiledir
		for name in sorted(os.listdir(inputfiledir)):
			if not name.endswith('.png') and not name.endswith('.PNG'):
				continue
			# print 'process image : ' + name
			pngpath = inputfiledir + '/' + name
			jpgname = name.split('.')[0] + '.jpg'
			jpgpath = inputfiledir + '/' + jpgname
			if not os.path.isfile (jpgpath): 
				print jpgpath, "is not exist!"
				continue
			jpgoutpath = outputfiledir + '/' + jpgname
			shutil.copy (jpgpath, jpgoutpath)
			outpath = outputfiledir + '/' + name
			run_automatting_file(pngpath,outpath)
	else:
		print "format err : please imput: inputfiledir [outputfiledir]"
		return

if __name__ == "__main__":
	main(sys.argv[1:])

語義分割中資料樣本的整理標註及調色盤程式碼

語義分割中標註的彩色圖如何利用調色盤轉為只包含對應label的灰度圖，其中會有一些繁瑣的地方，下面將自己寫的程式碼分享出來。程式碼主要作用如下圖所示，將標註的彩色圖按照事先定義的調色盤轉成只含label的groundtruth圖片。程式碼中的關鍵部分在於定義的myquantize函式，如果

見微知著：語義分割中的弱監督學習

點選上方“深度學習大講堂”可訂閱哦！編者按：語義分割是AI領域的一個重要分支，被廣泛應用於自動駕

FCN語義分割訓練資料（以siftflow和voc2012資料集為例）

截至目前，現已經跑通了siftflow-fcn32s，voc-fcn32s，並製作好了自己的資料集，現在就等大批資料的到來，進而針對資料進行引數fine-tuning，現對我訓練的訓練流程和訓練過程中遇到的問題，做出總結和記錄，從而對以後的學習作鋪墊。通過這篇分析語義分割

語義分割中的深度學習方法全解：從FCN、SegNet到各代DeepLab

語義分割是什麼？語義分割方法在處理影象時，具體到畫素級別，也就是說，該方法會將影象中每個畫素分配到某個物件類別。下面是一個具體案例。△ 左邊為輸入影象，右邊為經過語義分割後的輸出影象。該模型不僅要識別出摩托車和駕駛者，還要標出每個物件的邊界。因此，與分類目的不同，相關模型要具

機器學習中資料缺失的處理及建模方法

　　在機器學習中建模的時候，往往面臨兩個困難，一是選擇哪個模型，二是怎樣處理資料。處於資料包括資料獲取、資料清洗和資料分析。其實對於不同的場景和不同的資料，選擇的模型也是不一樣的，本文簡單聊一聊在資料缺失的時候該怎樣選擇合適的模型。一、缺失資料處理及建模方法　　資料缺失時，處理資料的方式有如下三種：　　

語義分割(semantic segmentation) 常用神經網絡介紹對比-FCN SegNet U-net DeconvNet，語義分割,簡單來說就是給定一張圖片,對圖片中的每一個像素點進行分類；目標檢測只有兩類,目標和非目標，就是在一張圖片中找到並用box標註出所有的目標.

avi projects div 般的 ict 中間接受 img dense from：https://blog.csdn.net/u012931582/article/details/70314859 2017年04月21日 14:54:10 閱讀數：4369

影象語義分割標註工具labelme製作自己的資料集用於mask-rcnn訓練

labelme（標註mask資料集用的） windows python2 pip install pyqt pip install labelme python3 pip install pyqt5 pip install labelm

處理coco資料集-語義分割

PythonAPI/cocoSegmentationToPngDemo.py函式是用來做語義分割的，參考這裡https://blog.csdn.net/qq_33000225/article/details/78985635?utm_source=blogxgwz2 由於我用的是2017資料

pandas將DataFrame中的tuple分割成資料框的多列

通過apply(pd.Series)實現將tuple進行分列 df = pd.DataFrame({'a':[1,2], 'b':[(1,2), (3,4)]}) df['b'].apply(pd.Series) df[['b1', 'b2']] = df['b'].apply(pd.S

django django中的HTML控制元件及引數傳遞方法以及 HTML form 裡的資料是怎麼被包成http request 的？如何在瀏覽器裡檢視到這些資料？從HTML form submit 到 django response是怎麼完成的

https://www.jb51.net/article/136738.htm django中的HTML控制元件及引數傳遞方法下面小編就為大家分享一篇django中的HTML控制元件及引數傳遞方法，具有很好的參考價值，希望對大家有所幫助。一起跟隨小編過來看看吧

Django基礎-----ORM簡介、資料庫中資料操作及簡單的一對多模型

一：ORM 物件關係對映，是一種程式技術，用於實現面向物件程式語言裡不同型別系統的資料之間的轉換。從效果上說，它其實是建立了一個可在程式語言裡使用的–“虛擬物件資料庫”。在ORM框架中，它幫我們把類和資料表進行了一個對映，可以讓我們通過類和類物件就能操作它所對應的表格中的資料。ORM框架

基於本地redis、protostuff序列化對於資料層的優化及java中對於泛型的使用

此次對於redis、protostuff的應用是在一個高併發的秒殺系統中實現的。在高併發的秒殺系統的優化中主要有以下幾個方面： 1.對於獲取秒殺地址的介面的優化每次獲取秒殺介面我們都要訪問資料庫，在高併發的系統中我們可以使用redis快取進行優化，不需要每次都訪問資料庫，從

Python中的資料型別轉換舉例及指令碼統計伺服器記憶體例項

統計系統剩餘的記憶體 In [1]: s1 = 'abc' In [2]: help(s1.startswith) Hel

如何在分組報表中實現組內資料補空行及組內頁碼

在對報表資料進行列印時，經常會要求進行精確列印，比如一張紙能列印 20 行資料，如果超過就分頁，如果不滿 20 行，則在資料下方進行補夠空行。這種情況最常見於銀行對賬資訊等明細資料的列印。同時，在某些業務中還會按照一些欄位分組，比如地區、類別等，在報表展示或列印時則要求先按照欄位進行分組，將分組欄位

百度地圖標註及結合ECharts圖譜資料視覺化

本示例中根據企業位置經緯度，在頁面右側百度地圖中標註企業名稱。同時頁面左側ECharts圖譜餅狀圖用於統計企業行業與註冊資本。當右側百度地圖縮放拖拽，左側ECharts圖譜根據右側地圖上出現的企業動態變化。詳細過程如下兩圖所示：本示例

ros如何抓取資料包及如何解析包中資料

從小車抓資料包小車自動作業後或執行後，先source 環境 A: source cleaner/workspace_a/app_pkg/setup.bash B: rosbag record –o bagwang /scan 這個將topic scan中的所有內容都存

Polygon-RNN++ （影象分割資料集自動標註）

一、Polygon-RNN整體架構 Polygon-RNN++（和之前的Polygon-RNN類似）：使用了CNN（卷積神經網路）提取影象特徵。使用RNN（迴圈神經網路）解碼多邊形頂點。為了提高RNN的預測效果。加入了注意力機制（att

faster rcnn中資料標註pascal voc格式

<?xml version="1.0" encoding="utf-8"?> <annotation> <folder>VOC2007</folder> <filename>test100.mp4_3

linux中資料倉庫工具hive簡介及安裝部署詳解

簡介： Apache Hive是一個建立在Hadoop架構之上的資料倉庫。它能夠提供資料的精煉，查詢和分析。 hive是基於Hadoop的一個數據倉庫工具，可以將結構化的資料檔案對映為一張資料庫表，並提

FMDB中FMDatabaseQueue簡單使用筆記及錯誤整理

FMDB是什麼專案需要資料本地快取，所以想到了使用本地資料庫。iOS的本地資料庫就是SQLite，而FMDB就是一個Objective-C編寫的SQLite封裝庫。 FMDB的匯入及使用 1.FMDB地址：Github地址 2.使用Cocoapods匯入，只需要在Podfile中新

語義分割中資料樣本的整理標註及調色盤程式碼

相關推薦