標註資料集處理---Python製作VOC標註格式的xml標註檔案

阿新 • • 發佈：2022-03-11

引：

　　　　近期做CV方面演算法，分享幾個簡單的視訊、圖片處理指令碼

　　　　指令碼中均有print除錯程式碼，，方便更改

Python製作VOC格式xml的函式方法：

import os
import xml.dom.minidom


def write_xml(folder: str, img_name: str, path: str, img_width: int, img_height: int, tag_num: int, tag_name: str, box_list:list):
    '''
    VOC標註xml檔案生成函式
    :param folder: 資料夾名
    :param img_name:
    :param path:
    :param img_width:
    :param img_height:
    :param tag_num: 圖片內的標註框數量
    :param tag_name: 標註名稱
    :param box_list: 標註座標,其資料格式為[[xmin1, ymin1, xmax1, ymax1],[xmin2, ymin2, xmax2, ymax2]....]
    :return: a standard VOC format .xml file, named "img_name.xml"
    '''
    # 建立dom樹物件
    doc = xml.dom.minidom.Document()

    # 建立root結點annotation，並用dom物件新增根結點
    root_node = doc.createElement("annotation")
    doc.appendChild(root_node)

    # 建立結點並加入到根結點
    folder_node = doc.createElement("folder")
    folder_value = doc.createTextNode(folder)
    folder_node.appendChild(folder_value)
    root_node.appendChild(folder_node)

    filename_node = doc.createElement("filename")
    filename_value = doc.createTextNode(img_name)
    filename_node.appendChild(filename_value)
    root_node.appendChild(filename_node)

    path_node = doc.createElement("path")
    path_value = doc.createTextNode(path)
    path_node.appendChild(path_value)
    root_node.appendChild(path_node)

    source_node = doc.createElement("source")
    database_node = doc.createElement("database")
    database_node.appendChild(doc.createTextNode("Unknown"))
    source_node.appendChild(database_node)
    root_node.appendChild(source_node)

    size_node = doc.createElement("size")
    for item, value in zip(["width", "height", "depth"], [img_width, img_height, 3]):
        elem = doc.createElement(item)
        elem.appendChild(doc.createTextNode(str(value)))
        size_node.appendChild(elem)
    root_node.appendChild(size_node)

    seg_node = doc.createElement("segmented")
    seg_node.appendChild(doc.createTextNode(str(0)))
    root_node.appendChild(seg_node)

    for i in range(tag_num):
        obj_node = doc.createElement("object")
        name_node = doc.createElement("name")
        name_node.appendChild(doc.createTextNode(tag_name))
        obj_node.appendChild(name_node)

        pose_node = doc.createElement("pose")
        pose_node.appendChild(doc.createTextNode("Unspecified"))
        obj_node.appendChild(pose_node)

        trun_node = doc.createElement("truncated")
        trun_node.appendChild(doc.createTextNode(str(0)))
        obj_node.appendChild(trun_node)

        trun_node = doc.createElement("difficult")
        trun_node.appendChild(doc.createTextNode(str(0)))
        obj_node.appendChild(trun_node)

        bndbox_node = doc.createElement("bndbox")
        for item, value in zip(["xmin", "ymin", "xmax", "ymax"], box_list[i]):
            elem = doc.createElement(item)
            elem.appendChild(doc.createTextNode(str(value)))
            bndbox_node.appendChild(elem)
        obj_node.appendChild(bndbox_node)
        root_node.appendChild(obj_node)

    with open(img_name.split('.')[-2] + ".xml", "w", encoding="utf-8") as f:
        # writexml()第一個引數是目標檔案物件，第二個引數是根節點的縮排格式，第三個引數是其他子節點的縮排格式，
        # 第四個引數制定了換行格式，第五個引數制定了xml內容的編碼。
        doc.writexml(f, indent='', addindent='\t', newl='\n', encoding="utf-8")

方法使用演示(以lp-annot.idl為例) ：

if __name__ == '__main__':
    f = open("./lp-annot.idl", "r")
    lines = f.readlines()  # 讀取全部內容 ，並以列表方式返回
    for line in lines:
        try:
            line_list = line.split(":-1")
            # print(f'line_list: {line_list}\nlen(line_list):{len(line_list)}\n')
            temp_line0_file_name = line_list[0].split(':')[0].split('"')[1].split('/')[-1]
            temp_line0_tag = line_list[0].split(':')[1]
            print(f'temp_line0_file_name: {temp_line0_file_name}       temp_line0_tag: {temp_line0_tag}')
            new_line_list = list()
            new_line_list.append(temp_line0_tag)
            for i in range(1, len(line_list) - 1):
                new_line_list.append(line_list[i])
            print(f'new_line_list:  {new_line_list}')
            box_list = []
            for i in range(len(new_line_list)):
                print(f'new_line_list[i]: {new_line_list[i]}')
                box = new_line_list[i].split("(")[1].split(',')
                box[3] = box[3].split(')')[0]
                print(f"box: {box}")
                x1, y1, x2, y2 = box[0], box[1], box[2], box[3]
                # if (int(x2) - int(x1)) >= (50*(float(img_width)/640.0)) and (int(y2) - int(y1)) >= (50*(float(img_height)/640.0)):
                if (int(x2) - int(x1)) >= 40 and (int(y2) - int(y1)) >= 70:
                    box_list.append(box[:])
            print(f"box_list: {box_list}")
            if len(box_list) == 0:
                continue
            write_xml(folder='VOC2014_instance/person', img_name='lp-annot_' + temp_line0_file_name,
                      path=temp_line0_file_name, img_width=640, img_height=480, tag_num=len(box_list),
                      tag_name='person', box_list=box_list)
        except Exception as e:
            print("ERROR, e----------------------------------------------\n:", e, "\n--------------------------------")

標註資料集處理---Python製作VOC標註格式的xml標註檔案

引：　　　　近期做CV方面演算法，分享幾個簡單的視訊、圖片處理指令碼　　　　指令碼中均有print除錯程式碼，，方便更改

【教程】使用TensorFlow物件檢測介面標註資料集

當為機器學習物件檢測和識別模型構建資料集時，為資料集中的所有影象生成標註非常耗時。而這些標註是訓練和測試模型所必需的，並且標註必須是準確的。因此，資料集中的所有影象都需要人為監督。不過，這並不意味著機

自己的資料集由json轉為voc資料集

技術標籤：語義分割pythonjsonlinux神經網路開發環境：python3.7 下面以pascal voc2012為例進行演示：

機器學習sklearn（五）：資料集處理（二）缺失值處理

6.4.Imputation of missing values For various reasons, many real world datasets contain missing values, often encoded as blanks, NaNs or other placeholders. Such datasets however are incompatible with

Python讀取VOC中的xml目標框例項

程式碼： #!/usr/bin/python # -*- coding: UTF-8 -*- # get annotation object bndbox location import os import cv2

Python讀寫JSON格式的文字檔案

技術標籤：Python基礎jsonpython Python讀寫JSON格式的文字檔案使用JSON模組讀寫使用Pandas庫讀寫

Pascal VOC資料集標註

Pascal VOC資料集標註標註資料檔案目前流行的資料標註檔案格式主要有VOC_2007、VOC_2012，該文字格式來源於Pascal VOC標準資料集，這是衡量影象分類識別能力的重要基準之一。本文采用VOC_2007資料格式檔案，以xml格

python實現將兩個資料夾合併至另一個資料夾(製作資料集)

此操作目的是為了製作自己的資料集，深度學習框架進行資料準備，此操作步驟包括對資料夾進行操作，將兩個資料夾合併至另一個資料夾

Anaconda下labelme的安裝（訓練資料集之影象標註工具的安裝）以及Anaconda的安裝

終於有時間來寫寫部落格啦，即便現在在校內實習，想想還是挺開心的，時間free。

目標檢測 – 解析VOC和COCO格式並製作自己的資料集

http://www.xyu.ink/3612.html xhy2020年10月9日無評論　　相對其他計算機視覺任務，目標檢測演算法的資料格式更為複雜。為了對資料進行統一的處理，目標檢測資料一般都會做成VOC或者COCO的格式。　　VOC和COCO都

如何用 Python 處理不平衡資料集

1. 什麼是資料不平衡所謂的資料不平衡（imbalanced data）是指資料集中各個類別的數量分佈不均衡；不平衡資料在現實任務中十分的常見。如

影象分割把用labelme標註生成的資料集改成PaddleSeg支援的資料集格式

影象分割把用labelme標註生成的資料集改成PaddleSeg支援的資料集格式 labelme標註後生成的資料集檔案格式

labelme標註後如何生成資料集

安裝labelme環境開啟Anaconda Prompt, 直接輸入pip install labelme即可安裝

KDD CUP99資料集預處理（Python實現）

目錄 KDD CUP99資料集預處理 1、資料集下載 2、KDD99網路入侵檢測資料集介紹 3、基於KDD99資料集的入侵檢測分析

Lab-VOC資料集（多分類）製作

Lab-VOC資料集（多分類）製作 1.使用精靈標記助手標註標記時對一張圖片標記四次

製作自己的python版本的類CIFAR10資料集

關於python 版本的CIFAR10的資料格式，官網上已經介紹： data – a 10000x3072 numpy array of uint8s. Each row of the array stores a 32x32 colour image. The first 1024 entries contain the red channel valu

深度學習——製作自己的VOC影象分割資料集

Override the entrypoint of an image Introduced in GitLab and GitLab Runner 9.4. Read more about the extended configuration options.

python製作mysql資料遷移指令碼

用python寫了個數據遷移指令碼，主要是利用從庫將大的靜態表匯出表空間，載匯入到目標例項中。

python KNN演算法實現鳶尾花資料集分類

一、knn演算法描述 1.基本概述 knn演算法，又叫k-近鄰演算法。屬於一個分類演算法，主要思想如下：

python中struct模組之位元組型資料的處理方法

簡介這個模組處理python中常見型別資料和Python bytes之間轉換。這可用於處理儲存在檔案或網路連線中的bytes資料以及其他來源。在python中沒有專門處理位元組的資料型別，建立位元組型資料也比較麻煩，我們知道的by

標註資料集處理---Python製作VOC標註格式的xml標註檔案

相關推薦