CV(一)之自定義資料集

阿新 • • 發佈：2021-10-24

本文以 PASCAL VOC2012 資料集為例子進行說明。(下載地址：PASCAL VOC2012)

Pytorch 自定義資料集見文件：TorchVision Object Detection Finetuning Tutorial

本文將以PASCAL VOC為基礎自定義一個數據集VOCDataset，並隨機選取五張圖片給將其對應的標註轉化為矩形框畫在圖片上。

生成自定義資料集

一些需要匯入的基本庫

import os
import torch
import json
from torch.utils.data import Dataset
from PIL import Image
from os import path
from lxml import etree

# 類別資料
class_dict = {
    "aeroplane": 1,
    "bicycle": 2,
    "bird": 3,
    "boat": 4,
    "bottle": 5,
    "bus": 6,
    "car": 7,
    "cat": 8,
    "chair": 9,
    "cow": 10,
    "diningtable": 11,
    "dog": 12,
    "horse": 13,
    "motorbike": 14,
    "person": 15,
    "pottedplant": 16,
    "sheep": 17,
    "sofa": 18,
    "train": 19,
    "tvmonitor": 20
}

按照文件要求，在VOCDataset中實現三個方法__len__、__getitem__、以及get_height_and_width。

初始化 VOCDataset 類

建構函式定義如下

'''
voc_root: voc 資料集的根目錄
year: 哪一個年份的資料集
transforms: 資料預處理
text_name: train.txt or val.txt 該txt檔案在資料集的 VOCdevkit\VOC2012\ImageSets\Main 資料夾下
'''
def __init__(self, voc_root, year='2012', transforms=None, text_name='train.txt'):

在建構函式中，我們主要完成以下三個功能

設定圖片路徑image_root和標註路徑anno_root

設定此次要訓練的樣本所有標註檔案路徑列表xml_list

設定要檢測的目標類別資訊class_dict

設定圖片路徑`image_root`和標註路徑`anno_root`

        # 設定資料集、圖片、標註的根目錄
        self.root = path.join(voc_root, 'VOCdevkit', f'VOC{year}')
        self.image_root = path.join(self.root, 'JPEGImages')
        self.anno_root = path.join(self.root, 'Annotations')

設定此次要訓練的樣本所有標註檔案路徑列表`xml_list`

        # 根據 text_name 拿到對應的標註xml檔案路徑
        text_path = path.join(self.root, 'ImageSets','Main', text_name)
        # 讀取txt檔案的每一行並生成xml標註檔案路徑存放在xml_list中
        with open(text_path) as file_reader:
            self.xml_list = [
                path.join(self.anno_root, f'{line.strip()}.xml')
                for line in file_reader.readlines() if len(line.strip()) > 0
            ]

設定要檢測的目標類別資訊`class_dict`

        self.class_dict = class_dict

一般使用 0 來表示當前類別是背景

獲取所有樣例條數

    def __len__(self):
        return len(self.xml_list)

樣本的條數即標註檔案列表長度

根據索引獲取指定樣本

函式定義如下

    def __getitem__(self, idx):

傳入的即為樣本的索引值，其取值範圍為 0 ~ len(xml_list)

獲取指定樣本需要分為如下兩大步

獲取圖片

獲取圖片資訊（標註資訊、索引、區域面積等）

獲取圖片

首先我們需要根據索引拿到對應標註資訊，並將其轉化為json格式
定義一個獲取json格式的annotation的方法

    def get_annotation(self, idx):
        xml_path = self.xml_list[idx]
        assert path.exists(xml_path), f'file {xml_path} not found'

        xml_reader = open(xml_path)
        xml_text = xml_reader.read()
        xml = etree.fromstring(xml_text)
        annotation = parse_xml_to_dict(xml)['annotation']

xml格式轉化為json格式函式如下

def parse_xml_to_dict(xml):
    if len(xml) == 0:
        return {xml.tag: xml.text}
    
    result = {}
    for child in xml:
        child_result = parse_xml_to_dict(child)
        if child.tag != 'object':
            result[child.tag] = child_result[child.tag]
        else: # 一張圖片中可能標註有多個 object
            if child.tag not in result:
                result[child.tag] = []
            result[child.tag].append(child_result[child.tag])
    
    return {xml.tag: result}

獲取annotation

        annotation = self.get_annotation(idx)

然後我們就可以從annotation中拿到檔名稱並獲取到檔案

        image_path = path.join(self.image_root, annotation['filename'])
        image = Image.open(image_path)

獲取圖片資訊

宣告需要獲取的所有資訊

        # 生成 target
        target = {
            'boxes': [], # 標註的左上、右下座標(xmin, ymin, xmax, ymax)
            'labels': [],# 標註類別
            'image_id': [], # 圖片索引
            'area': [], # 含有目標區域的面積 (xmax-xmin) * (ymax-ymin)
            'iscrowd': [], # 是不是一堆密集的東西在一起
        }

便利所有的object


        for obj in annotation['object']:
            bndbox = obj['bndbox']
            xmin = float(bndbox['xmin'])
            ymin = float(bndbox['ymin'])
            xmax = float(bndbox['xmax'])
            ymax = float(bndbox['ymax'])
            target['boxes'].append([xmin, ymin, xmax, ymax]) # 設定有目標的座標資訊
            target['labels'].append(self.class_dict[obj['name']]) # 獲取對應的標籤
            target['area'].append((xmax - xmin) * (ymax - ymin)) # 計算面積

            # 使用 difficult(當前目標是否難以識別) 欄位來設定 iscrowd
            if 'difficult' in obj:
                target['iscrowd'].append(int(obj['difficult']))
            else:
                target['iscrowd'].append(0)

將所有資訊轉化為Tensor

        # Convert to tensor
        target['boxes'] = torch.as_tensor(target['boxes'])
        target['labels'] = torch.as_tensor(target['labels'])
        target['iscrowd'] = torch.as_tensor(target['iscrowd'])
        target['area'] = torch.as_tensor(target['area'])
        target['image_id'] = torch.tensor([idx])

如果有設定資料前處理器，則在返回資料前呼叫

        if self.transforms is not None:
            image = self.transforms(image)

返回圖片以及對應的資訊

        return image, target

根據索引獲取當前圖片的寬高

在標註資訊裡面含有圖片寬高資訊，所以可以很容易獲取到

    def get_height_and_width(self, idx):
        annotation = annotation = self.get_annotation(idx)
        # 從 annotation 中取出寬高並返回
        width = int(annotation['size']['width'])
        height = int(annotation['size']['height'])

        return height, width

以上我們就完成了資料集的定義，下面我們將使用例項程式碼來使用這個資料集

使用自定義資料集並畫上標註框

匯入一些基本庫

import random
import matplotlib.pyplot as plt
import torchvision.transforms as ts
from draw_box_utils import draw_box

生成類別資料，將 k、v 互換，便於查詢

category_index = {}

category_index = {
    v: k
    for k, v in class_dict.items()
}

定義transformer，將資料轉化為Tensor

data_transform = ts.Compose([ts.ToTensor()])

由於ToTensor會將資料標準化，為了程式碼簡潔，這裡不使用

拿到資料集並將目標框以及類別畫出來

train_data_set = VOCDataset(os.getcwd(), '2012', None, 'train.txt')

for index in random.sample(range(0, len(train_data_set)), k=5):
    image, target = train_data_set[index]
    image = draw_bounding_boxes(
        np.array(image),
        target['boxes'],
        target['labels'],
    )
    plt.imshow(image)
    plt.show()

畫目標框draw_bounding_boxes程式碼如下(參考程式碼： vision/utils.py at main · pytorch/vision (github.com))


def draw_bounding_boxes(
    image,
    boxes: torch.Tensor,
    labels: Optional[List[str]] = None
) -> torch.Tensor:
    img_to_draw = Image.fromarray(image)
    img_boxes = boxes.to(torch.int64).tolist()
    draw = ImageDraw.Draw(img_to_draw)

    for i, bbox in enumerate(img_boxes):
        draw.rectangle(bbox, width=2, outline='red')
        margin = 2
        draw.text((bbox[0] + margin, bbox[1] + margin),  category_index[labels[i] - 1], fill='red')


    return np.array(img_to_draw)

這樣就完成了整個流程了！

執行與測試

可見執行結果正確！

CV(一)之自定義資料集

本文以 PASCAL VOC2012 資料集為例子進行說明。(下載地址：PASCAL VOC2012) Pytorch 自定義資料集見文件：TorchVision Object Detection Finetuning Tutorial

pytorch學習教程之自定義資料集

自定義資料集在訓練深度學習模型之前，樣本集的製作非常重要。在pytorch中，提供了一些介面和類，方便我們定義自己的資料集合，下面完整的試驗自定義樣本集的整個流程。

Pytorch 神經網路—自定義資料集上實現教程

第一步、匯入需要的包 import os import scipy.io as sio import numpy as np import torch import torch.nn as nn

在PyTorch中構建高效的自定義資料集

學習Dataset類的來龍去脈，使用乾淨的程式碼結構，同時最大限度地減少在訓練期間管理大量資料的麻煩。

如何在自定義資料集上訓練YOLOv5

作者|Jacob Solawetz, Joseph Nelson 編譯|Flin 來源|blog YOLO系列的目標檢測模型隨著YOLOv5的引入變得越來越強大。在這篇文章中，我們將介紹如何訓練YOLOv5為你的定製用例識別定製物件。

pytorch載入語音類自定義資料集的方法教程

前言　　pytorch對一下常用的公開資料集有很方便的API介面，但是當我們需要使用自己的資料集訓練神經網路時，就需要自定義資料集，在pytorch中，提供了一些類，方便我們定義自己的資料集合

Scaled-YOLOv4 快速開始，訓練自定義資料集

程式碼： https://github.com/ikuokuo/start-scaled-yolov4 Scaled-YOLOv4 程式碼: https://github.com/WongKinYiu/ScaledYOLOv4

mmdetection2.6自定義資料集pipeline

技術標籤：pythonjavaspring bootsparkvue 1. data pipeline的基本使用 mmdetection的資料讀取方式分為兩部分，第一部分為資料集，第二部分為data pipeline，通常資料集定義如何處理標註資訊，而pipeline定義處

PaddlePaddle飛槳《高層API助你快速上手深度學習》『深度學習7日打卡營』--自定義資料集OCEMOTION–中文情感分類

技術標籤：筆記paddlepaddlepython 賽題背景自從2017年具有劃時代意義的Transformer模型問世以來，短短兩年多的時間內，如雨後春筍般的出現了大量的預訓練模型，比如：Bert，Albert，ELECTRA，RoBERta，T5，GP

scikit基礎與機器學習入門（4） sklearn模組資料集的使用——自帶資料集和自定義資料集

API通用方法型別獲取方式自帶的小資料集 sklearn.datasets.load_ 線上下載的資料集 sklearn.datasets.fetch_

vuex-persistence資料持久化之自定義資料獲取【白名單】

vuex-persistence資料持久化之自定義資料獲取【白名單】前景導讀：在我們進行對vue腳手架建立的頁面進行重新整理時，會出現資料消失的情況，所以我們嘗試引入了資料持久化Pluigns外掛，但此時問題出現了，如果

ROS之自定義服務資料

在我們編譯原始碼時，會經常遇到以下問題： /home/zztsj/catkin_ws/src/AS_6Dof_Arm/as_arm_control/src/check_collision.cpp:11:10: fatal error: as_arm_description/CheckCollisionValid.h: 沒有那個檔案或目錄

一、自定義XML結構資料-轉換為List集合-操作節點

1.建立SkillInfo.txt 右擊檔案以xml文字編輯器開啟 2.簡單的xml結構文件 1 <skills>

大資料開發技術之自定義MySQLSource過程

自定義Source說明 Source是負責接收資料到Flume Agent的元件。Source元件可以處理各種型別、各種格式的日誌資料，包括avro、thrift、exec、jms、spooling directory、netcat、sequence generator、syslog、http、le

STL容器之set容器API（二）無重複元素原理、set容器排序、自定義資料

1.set如何實現無重複元素 void printSet(set<int>& s) { for (set<int>::iterator it = s.begin(); it != s.end(); it++)

node之自定義解析表單資料的中介軟體

// 匯入 express 模組 const express = require(\'express\') // 建立 express 的伺服器例項 const app = express()

SpringBoot之自定義註解掃描器@XXXScan

前言: 最近在學習Netty的過程中，跟著前輩們的思路用Netty作為底層通訊開發了一個非常牛逼，宇宙第一(實際超級垃圾)的Netty Rpc Demo。為啥不叫框架叫Demo呢，一個好的框架是需要非常長時間的開發和優化的，離不開大

SpringBoot系列教程web篇之自定義異常處理HandlerExceptionResolver

關於Web應用的全域性異常處理，上一篇介紹了ControllerAdvice結合@ExceptionHandler的方式來實現web應用的全域性異常管理；

Springboot之自定義全域性異常處理

本文首發至java技術部落格[碼上]：jdkcb.com/ 前言：在實際的應用開發中，很多時候往往因為一些不可控的因素導致程式出現一些錯誤，這個時候就要及時把異常資訊反饋給客戶端，便於客戶端能夠及時地進行處理，而針對

Android之自定義實現BaseAdapter（通用介面卡三）

在上一篇中，我們說過，在setData中如果有很多控制元件的話，我們還是要在該方法中寫入很多程式碼，為了降低開發的方便性，本次就在此基礎上再一次優化。實現原理是這樣的，每次在setData中都要查詢控制元件，然後se

CV(一)之自定義資料集

生成自定義資料集

初始化 VOCDataset 類

設定圖片路徑image_root和標註路徑anno_root

設定此次要訓練的樣本所有標註檔案路徑列表xml_list

設定要檢測的目標類別資訊class_dict

獲取所有樣例條數

根據索引獲取指定樣本

獲取圖片

獲取圖片資訊

根據索引獲取當前圖片的寬高

使用自定義資料集並畫上標註框

執行與測試

相關推薦

設定圖片路徑`image_root`和標註路徑`anno_root`

設定此次要訓練的樣本所有標註檔案路徑列表`xml_list`

設定要檢測的目標類別資訊`class_dict`