影象資料增強（Data Augmentation）（旋轉）

阿新 • • 發佈：2020-12-26

首先是XML資訊

<annotation>
    <folder>well</folder>
    <filename>15278480618780.jpg</filename>
    <path>15278480618780.jpg</path>
    <size>
        <width>828</width>
        <height>1104</height>
        <depth>3</depth>
    </size>
    <segmented>0 
</segmented>
    <object>
        <name>3</name>
        <pose>Unspecified</pose>
        <truncated>1</truncated>
        <difficult>0</difficult>
        <bndbox>
            <xmin>250</xmin>
            <ymin>672</ymin>
            <xmax>531 
</xmax>
            <ymax>1104</ymax>
        </bndbox>
    </object>
</annotation>

處理方式：

讀取對應的影象，解析對應的xml，根據旋轉的角度來變換之前檢測到的座標，以及儲存變換後的影象。

#!/usr/bin/env python

import cv2
import math
import numpy as np
import os
import pdb
import xml.etree.ElementTree as ET


class ImgAugemention():
    def __init__(self):
        self.angle  
= 90

    # rotate_img
    def rotate_image(self, src, angle, scale=1.):
        w = src.shape[1]
        h = src.shape[0]
        # convet angle into rad
        rangle = np.deg2rad(angle)  # angle in radians
        # calculate new image width and height
        nw = (abs(np.sin(rangle)*h) + abs(np.cos(rangle)*w))*scale
        nh = (abs(np.cos(rangle)*h) + abs(np.sin(rangle)*w))*scale
        # ask OpenCV for the rotation matrix
        rot_mat = cv2.getRotationMatrix2D((nw*0.5, nh*0.5), angle, scale)
        # calculate the move from the old center to the new center combined
        # with the rotation
        rot_move = np.dot(rot_mat, np.array([(nw-w)*0.5, (nh-h)*0.5, 0]))
        # the move only affects the translation, so update the translation
        # part of the transform
        rot_mat[0, 2] += rot_move[0]
        rot_mat[1, 2] += rot_move[1]
        # map
        return cv2.warpAffine(
            src, rot_mat, (int(math.ceil(nw)), int(math.ceil(nh))),
            flags=cv2.INTER_LANCZOS4)

    def rotate_xml(self, src, xmin, ymin, xmax, ymax, angle, scale=1.):
        w = src.shape[1]
        h = src.shape[0]
        rangle = np.deg2rad(angle)  # angle in radians
        # now calculate new image width and height
        # get width and heigh of changed image
        nw = (abs(np.sin(rangle)*h) + abs(np.cos(rangle)*w))*scale
        nh = (abs(np.cos(rangle)*h) + abs(np.sin(rangle)*w))*scale
        # ask OpenCV for the rotation matrix
        rot_mat = cv2.getRotationMatrix2D((nw*0.5, nh*0.5), angle, scale)
        # calculate the move from the old center to the new center combined
        # with the rotation
        rot_move = np.dot(rot_mat, np.array([(nw-w)*0.5, (nh-h)*0.5, 0]))
        # the move only affects the translation, so update the translation
        # part of the transform
        rot_mat[0, 2] += rot_move[0]
        rot_mat[1, 2] += rot_move[1]
        # rot_mat: the final rot matrix
        # get the four center of edges in the initial martix，and convert the coord
        point1 = np.dot(rot_mat, np.array([(xmin+xmax)/2, ymin, 1]))
        point2 = np.dot(rot_mat, np.array([xmax, (ymin+ymax)/2, 1]))
        point3 = np.dot(rot_mat, np.array([(xmin+xmax)/2, ymax, 1]))
        point4 = np.dot(rot_mat, np.array([xmin, (ymin+ymax)/2, 1]))
        # concat np.array
        concat = np.vstack((point1, point2, point3, point4))
        # change type
        concat = concat.astype(np.int32)
        print(concat)
        rx, ry, rw, rh = cv2.boundingRect(concat)
        return rx, ry, rw, rh

    def process_img(self, imgs_path, xmls_path, img_save_path, xml_save_path, angle_list):
        # assign the rot angles
        for angle in angle_list:
            for img_name in os.listdir(imgs_path):
                # split filename and suffix
                n, s = os.path.splitext(img_name)
                # for the sake of use yolo model, only process '.jpg'
                if s == ".jpg":
                    img_path = os.path.join(imgs_path, img_name)
                    img = cv2.imread(img_path)
                    rotated_img = self.rotate_image(img, angle)
                    save_name = n + "_" + str(angle) + "d.jpg"
                    # 寫入影象
                    cv2.imwrite(img_save_path + save_name, rotated_img)
                    print("log: [%sd] %s is processed." % (angle, img))
                    xml_url = img_name.split('.')[0] + '.xml'
                    xml_path = os.path.join(xmls_path, xml_url)
                    tree = ET.parse(xml_path)
                    file_name = tree.find('filename').text  # it is origin name
                    path = tree.find('path').text  # it is origin path
                    # change name and path
                    tree.find('filename').text = save_name  # change file name to rot degree name
                    tree.find('path').text = save_name  #  change file path to rot degree name
                    root = tree.getroot()
                    for box in root.iter('bndbox'):
                        xmin = float(box.find('xmin').text)
                        ymin = float(box.find('ymin').text)
                        xmax = float(box.find('xmax').text)
                        ymax = float(box.find('ymax').text)
                        x, y, w, h = self.rotate_xml(img, xmin, ymin, xmax, ymax, angle)
                        # change the coord
                        box.find('xmin').text = str(x)
                        box.find('ymin').text = str(y)
                        box.find('xmax').text = str(x+w)
                        box.find('ymax').text = str(y+h)
                        box.set('updated', 'yes')
                    # write into new xml
                    tree.write(xml_save_path + n + "_" + str(angle) + "d.xml")
                print("[%s] %s is processed." % (angle, img_name))


if __name__ == '__main__':
    img_aug = ImgAugemention()
    imgs_path = './image/'
    xmls_path = './xml/'
    img_save_path = './rotate/'
    xml_save_path = './xml_rot/'
    angle_list = [60, 90, 120, 150, 210, 240, 300]
    img_aug.process_img(imgs_path, xmls_path, img_save_path, xml_save_path, angle_list)

處理結果：

<annotation>
    <folder>well</folder>
    <filename>15278480618780_60d.jpg</filename>
    <path>15278480618780_60d.jpg</path>
    <size>
        <width>828</width>
        <height>1104</height>
        <depth>3</depth>
    </size>
    <segmented>0</segmented>
    <object>
        <name>3</name>
        <pose>Unspecified</pose>
        <truncated>1</truncated>
        <difficult>0</difficult>
        <bndbox updated="yes">
            <xmin>777</xmin>
            <ymin>701</ymin>
            <xmax>1152</xmax>
            <ymax>945</ymax>
        </bndbox>
    </object>
</annotation>

Tensorflow實現影象資料增強（Data Augmentation）

在我們處理有關影象的任務，比如目標檢測，分類，語義分割等等問題當中，我們常常需要對訓練集當中的圖片進行資料增強（data augmentation）,這樣會讓訓練集的樣本增多，同時讓神經網路模型的泛化能力更強。在進行圖

影象資料增強（Data Augmentation）（旋轉）

首先是XML資訊 <annotation> <folder>well</folder> <filename>15278480618780.jpg</filename>

訓練技巧之資料增強（data augmentation）

YouTube視訊我們在訓練資料之前，需要先對輸入資料進行預處理，其中資料增強就是預處理過程中的一個手段，即通過這樣的方式能夠使得輸入資料成倍的增長。為什麼要做資料增強因為在深度學習領域，更多的資

資料增強（Data augmentation）

資料增強（Data augmentation）或許最簡單的資料增強方法就是垂直映象對稱，假如，訓練集中有這張圖片，然後將其翻轉得到右邊的影象，實際是做了一個映象對稱，如果映象操作保留了影象中想識別的物體的前提下，這是

Python基本常用包整理（data analysis and machine learning），附查詢包版本語句

python 資料分析模組（Numpy、Scipy、Scikit和Pandas等） python進行機器學習(tensorflow）

深度學習下的影象資料增強

在深度學習領域，對於資料量的要求是巨大的，在CV領域，我們通過影象資料增強對現有影象資料進行處理來豐富影象訓練集，這樣可以有效的泛化模型，解決過擬合的問題。

WIP:【資料增強】深度學習中的影象資料增強及實踐

Test Time Augmentation What is Test Time Augmentation (TTA)? Similar to what Data Augmentation is doing to the training set, the purpose of Test Time Augmentation is to perform random modifications t

深度學習之資料增強一（opencv影象填充）

技術標籤：opecv學習pythonopencv深度學習本文目的：將所有影象等比例縮放到小於224x224x3，然後再對不足224x224x3部分填充黑邊，達到所有圖片都是224x224x3的規格。本人才疏學淺，也不知道如何準確論證為何填

深度學習與Pytorch入門實戰（十一）資料增強

1. 資料增強比如，你遇到的一個任務，目前只有小几百的資料，然而目前流行的最先進的神經網路都是成千上萬的圖片資料，可以通過資料增強來實現。

PHP設計模式之資料對映模式（Data Mapper）程式碼例項大全（13）

目標資料對映器是一種資料訪問層，用於將資料在永續性資料儲存（通常是一個關係資料庫）和記憶體中的資料表示（領域層）之間進行雙向傳輸。該模式的目標是為了將資料的記憶體表示、持久儲存、資料訪問進行分離。該

MySQL批量插入資料（load data 和儲存過程方式）

技術標籤：MYSQL 文章內容來自於：尚矽谷MySQL技術高階篇 MySQL批量插入資料最簡單的就是迴圈遍歷，呼叫多次INSERT語句不就可以插入多條記錄了嗎！但是這種方法會增加伺服器的負荷，因為，執行每一次SQL，伺服

影象去霧畢業論文準備09-深度學習框架（tensorflow2.0）——超級詳細（手寫體資料載入、獨熱編碼one-hot）

技術標籤：Python-opencv專欄去霧畢業論文python #!/usr/bin/python3.6 # -*- coding: utf-8 -*-

參考書籍《Vue.js快跑：構建觸手可及的高效能Web應用》第1章 Vue.js基礎--模板（Template）、資料（Data）和指令（Directive）

Vue的核心是將資料顯示在頁面上，這一功能通過模板實現。為正常的HTML新增特殊的屬性——被稱作指令——藉助它來告訴Vue我們想要實現的效果以及如何處理提供給它的資料。

SQL / 資料查詢語言DQL（Data Query Language ）

　　1、基礎查詢 #進階1：基礎查詢 /* 語法： select 查詢列表 from 表名; 1、查詢列表可以是：表中的欄位、常量值、表示式、函式

uni-app之資料驅動的picker選擇器（ uni-data-picker）之可以選擇到任意級別

背景說明 uni-app 官方的外掛市場有資料驅動選擇器，可以用作多級分類的場景。本人引入外掛後，發現，在h5和微信小程式都只能選擇到葉子級。而在給出的官方元件示例中確並非如此。

SQL/資料操縱語言DML（Data Manipulation Language）

　　資料操縱語言DML主要有三種形式：　　插入：INSERT　　更新：UPDATE　　刪除：DELETE

徒手打造基於Spark的資料工廠（Data Factory）：從設計到實現

在大資料處理和人工智慧時代，資料工廠（Data Factory）無疑是一個非常重要的大資料處理平臺。市面上也有成熟的相關產品，比如Azure Data Factory，不僅功能強大，而且依託微軟的雲端計算平臺Azure，為大資料處理提

防止過擬合方式的一些理解（Regularization，Data Augmentation）

正則化：定義：任何減少泛化誤差而不減少訓練誤差的行為。（也就是可以增強對新資料的適配性，不會因為對原資料集擬合過度，導致對新資料的判斷能力下降）

深度探索通過資料共享（data sharing）優化 Amazon Redshift 工作負載分解

前言 Amazon Redshift 是一款完全託管的 PB 級大規模並行資料倉庫，它操作簡單並且效能高效。它使用標準 SQL 和現有的商業智慧 (BI) 工具來快速、簡單且經濟高效地分析所有資料。如今，Amazon Redshift 已成為使用最

Redis中3種特殊的資料型別（BitMap、Geo和HyperLogLog）

前言 Reids 在 Web 應用的開發中使用非常廣泛，幾乎所有的後端技術都會有涉及到 Redis 的使用。Redis 種除了常見的字串 String、字典 Hash、列表 List、集合 Set、有序集合 SortedSet 等等之外，還有一些不常用的資料

影象資料增強（Data Augmentation）（ 旋轉）

處理結果：

相關推薦

影象資料增強（Data Augmentation）（旋轉）