MSCOCO資料標註詳解

阿新 • • 發佈：2018-12-30

參考：

JSON檔案

json檔案主要包含以下幾個欄位：
詳細描述參考 COCO 標註詳解

{
    "info": info, # dict
    "licenses": [license], # list ，內部是dict
    "images": [image], # list ，內部是dict
    "annotations": [annotation], # list ，內部是dict
    "categories": # list ，內部是dict
}

開啟JSON檔案檢視資料特點

由於JSON檔案太大，很多都是重複定義的，所以只提取一張圖片，儲存成新的JSON檔案，便於觀察。

# -*- coding:utf-8 -*-

from __future__ import print_function
from pycocotools.coco import COCO
import os, sys, zipfile
import urllib.request
import shutil
import numpy as np
import skimage.io as io
import matplotlib.pyplot as plt
import pylab
import json

json_file='./annotations/instances_val2017.json' 
 # # Object Instance 型別的標註
# person_keypoints_val2017.json  # Object Keypoint 型別的標註格式
# captions_val2017.json  # Image Caption的標註格式

data=json.load(open(json_file,'r'))

data_2={}
data_2['info']=data['info']
data_2['licenses']=data['licenses']
data_2['images']=[data['images'][0]] # 只提取第一張圖片
data_2['categories' 
]=data['categories']
annotation=[]

# 通過imgID 找到其所有物件
imgID=data_2['images'][0]['id']
for ann in data['annotations']:
    if ann['image_id']==imgID:
        annotation.append(ann)

data_2['annotations']=annotation

# 儲存到新的JSON檔案，便於檢視資料特點
json.dump(data_2,open('./new_instances_val2017.json','w'),indent=4) # indent=4 更加美觀顯示

Object Instance 型別的標註格式

主要有以下幾個欄位：

這裡寫圖片描述

info

"info": { # 資料集資訊描述
        "description": "COCO 2017 Dataset", # 資料集描述
        "url": "http://cocodataset.org", # 下載地址
        "version": "1.0", # 版本
        "year": 2017, # 年份
        "contributor": "COCO Consortium", # 提供者
        "date_created": "2017/09/01" # 資料建立日期
    },

licenses

"licenses": [
        {
            "url": "http://creativecommons.org/licenses/by-nc-sa/2.0/",
            "id": 1,
            "name": "Attribution-NonCommercial-ShareAlike License"
        },
        ……
        ……
    ],

images

"images": [
        {
            "license": 4,
            "file_name": "000000397133.jpg", # 圖片名
            "coco_url":  "http://images.cocodataset.org/val2017/000000397133.jpg",# 網路地址路徑
            "height": 427, # 高
            "width": 640, # 寬
            "date_captured": "2013-11-14 17:02:52", # 資料獲取日期
            "flickr_url": "http://farm7.staticflickr.com/6116/6255196340_da26cf2c9e_z.jpg",# flickr網路地址
            "id": 397133 # 圖片的ID編號（每張圖片ID是唯一的）
        },
        ……
        ……
    ],

annotations

"annotation": [
        {
            "segmentation": [ # 物件的邊界點（邊界多邊形）
                [
                    224.24,297.18,# 第一個點 x,y座標
                    228.29,297.18, # 第二個點 x,y座標
                    234.91,298.29,
                    ……
                    ……
                    225.34,297.55
                ]
            ],
            "area": 1481.3806499999994, # 區域面積
            "iscrowd": 0, # 
            "image_id": 397133, # 對應的圖片ID（與images中的ID對應）
            "bbox": [217.62,240.54,38.99,57.75], # 定位邊框 [x,y,w,h]
            "category_id": 44, # 類別ID（與categories中的ID對應）
            "id": 82445 # 物件ID，因為每一個影象有不止一個物件，所以要對每一個物件編號（每個物件的ID是唯一的）
        },
        ……
        ……
        ]

注意，單個的物件（iscrowd=0)可能需要多個polygon來表示，比如這個物件在影象中被擋住了。而iscrowd=1時（將標註一組物件，比如一群人）的segmentation使用的就是RLE格式。

視覺化

現在呼叫cocoapi顯示剛生成的JSON檔案，並檢查是否有問題。

# -*- coding:utf-8 -*-

from __future__ import print_function
from pycocotools.coco import COCO
import os, sys, zipfile
import urllib.request
import shutil
import numpy as np
import skimage.io as io
import matplotlib.pyplot as plt
import pylab
pylab.rcParams['figure.figsize'] = (8.0, 10.0)

annFile='./new_instances_val2017.json'
coco=COCO(annFile)

# display COCO categories and supercategories
cats = coco.loadCats(coco.getCatIds())
nms=[cat['name'] for cat in cats]
print('COCO categories: \n{}\n'.format(' '.join(nms)))

nms = set([cat['supercategory'] for cat in cats])
print('COCO supercategories: \n{}'.format(' '.join(nms)))

# imgIds = coco.getImgIds(imgIds = [324158])
imgIds = coco.getImgIds()
img = coco.loadImgs(imgIds[0])[0]
dataDir = '.'
dataType = 'val2017'
I = io.imread('%s/%s/%s'%(dataDir,dataType,img['file_name']))

plt.axis('off')
plt.imshow(I)
plt.show()


# load and display instance annotations
# 載入例項掩膜
# catIds = coco.getCatIds(catNms=['person','dog','skateboard']);
# catIds=coco.getCatIds()
catIds=[]
for ann in coco.dataset['annotations']:
    if ann['image_id']==imgIds[0]:
        catIds.append(ann['category_id'])

plt.imshow(I); plt.axis('off')
annIds = coco.getAnnIds(imgIds=img['id'], catIds=catIds, iscrowd=None)
anns = coco.loadAnns(annIds)
coco.showAnns(anns)

# initialize COCO api for person keypoints annotations
annFile = '{}/annotations/person_keypoints_{}.json'.format(dataDir,dataType)
coco_kps=COCO(annFile)

# load and display keypoints annotations
# 載入肢體關鍵點
plt.imshow(I); plt.axis('off')
ax = plt.gca()
annIds = coco_kps.getAnnIds(imgIds=img['id'], catIds=catIds, iscrowd=None)
anns = coco_kps.loadAnns(annIds)
coco_kps.showAnns(anns)

# initialize COCO api for caption annotations
annFile = '{}/annotations/captions_{}.json'.format(dataDir,dataType)
coco_caps=COCO(annFile)

# load and display caption annotations
# 載入文字描述
annIds = coco_caps.getAnnIds(imgIds=img['id']);
anns = coco_caps.loadAnns(annIds)
coco_caps.showAnns(anns)
plt.imshow(I); plt.axis('off'); plt.show()

這裡寫圖片描述

A man is in a kitchen making pizzas.
Man in apron standing on front of oven with pans and bakeware
A baker is working in the kitchen rolling dough.
A person standing by a stove in a kitchen.
A table with pies being made and a person standing near a wall with pots and pans hanging on the wall.

仿照COCO JSON檔案

仿照COCO的資料格式，將labelme的JSON改造成COCO的JSON

首先是要`labelme`做好圖片標註

這裡寫圖片描述

說明：（類別不一定對，只是為了說明問題）
bobcat-美國短耳貓
plushcat-布偶貓
deerhound-小鹿犬
mainecat-緬因貓
golden-金毛

將labelme的JSON轉成COCO格式JSON

這裡寫一個class實現以下功能，labelme2COCO.py中的部分程式碼如下：

def image(self,data,num):
        image={}
        img = utils.img_b64_to_array(data['imageData'])  # 解析原圖片資料
        # img=io.imread(data['imagePath']) # 通過圖片路徑開啟圖片
        # img = cv2.imread(data['imagePath'], 0)
        height, width = img.shape[:2]
        img = None
        image['height']=height
        image['width'] = width
        image['id']=num+1
        image['file_name'] = data['imagePath'].split('/')[-1]

        self.height=height
        self.width=width

        return image

   def categorie(self,label):
       categorie={}
       categorie['supercategory'] = label[0]
       categorie['id']=len(self.label)+1 # 0 預設為背景
       categorie['name'] = label[1]
       return categorie

   def annotation(self,points,label,num):
       annotation={}
       annotation['segmentation']=[list(np.asarray(points).flatten())]
       annotation['iscrowd'] = 0
       annotation['image_id'] = num+1
       # annotation['bbox'] = str(self.getbbox(points)) # 使用list儲存json檔案時報錯（不知道為什麼）
       # list(map(int,a[1:-1].split(','))) a=annotation['bbox'] 使用該方式轉成list
       annotation['bbox'] = list(map(float,self.getbbox(points)))

       annotation['category_id'] = self.getcatid(label)
       annotation['id'] = self.annID
       return annotation

注：這裡只實現images、categories、annotations三個欄位內容，因為只用到這幾個欄位

視覺化資料

這部分是使用COCO的API介面開啟剛才自己生成的JSON檔案，以驗證是否存在問題。

visualization.py

# -*- coding:utf-8 -*-

from __future__ import print_function
from pycocotools.coco import COCO
import os, sys, zipfile
import urllib.request
import shutil
import numpy as np
import skimage.io as io
import matplotlib.pyplot as plt
import pylab
pylab.rcParams['figure.figsize'] = (8.0, 10.0)

annFile='./new.json'
coco=COCO(annFile)

# display COCO categories and supercategories
cats = coco.loadCats(coco.getCatIds())
nms=[cat['name'] for cat in cats]
print('COCO categories: \n{}\n'.format(' '.join(nms)))

nms = set([cat['supercategory'] for cat in cats])
print('COCO supercategories: \n{}'.format(' '.join(nms)))

# imgIds = coco.getImgIds(imgIds = [324158])
imgIds = coco.getImgIds()
imgId=np.random.randint(0,len(imgIds))
img = coco.loadImgs(imgIds[imgId])[0]
dataDir = '.'
# dataType = 'val2017'
# I = io.imread('%s/%s/%s'%(dataDir,dataType,img['file_name']))
I = io.imread('%s/%s'%(dataDir,img['file_name']))

plt.axis('off')
plt.imshow(I)
plt.show()


# load and display instance annotations
# 載入例項掩膜
# catIds = coco.getCatIds(catNms=['person','dog','skateboard']);
# catIds=coco.getCatIds()
catIds=[]
for ann in coco.dataset['annotations']:
    if ann['image_id']==imgIds[imgId]:
        catIds.append(ann['category_id'])

plt.imshow(I); plt.axis('off')
annIds = coco.getAnnIds(imgIds=img['id'], catIds=catIds, iscrowd=None)
anns = coco.loadAnns(annIds)
coco.showAnns(anns)
plt.show()

顯示結果：

這裡寫圖片描述

Object Keypoint 型別的標註格式

執行指令碼one_image_json.py 得到單張圖片的JSON資訊。

基本上內容與Object Instance的標註格式一樣，不同的地方在於categories、annotations欄位內容不一樣。

主要內容有：

{
    "info": { 
        "description": "COCO 2017 Dataset",
        "url": "http://cocodataset.org",
        "version": "1.0",
        "year": 2017,
        "contributor": "COCO Consortium",
        "date_created": "2017/09/01"
    },
    "licenses": [
        {
            "url": "http://creativecommons.org/licenses/by-nc-sa/2.0/",
            "id": 1,
            "name": "Attribution-NonCommercial-ShareAlike License"
        },
        ……
        ……
    ],
    "images": [
        {
            "license": 4,
            "file_name": "000000397133.jpg", # 圖片名
            "coco_url": "http://images.cocodataset.org/val2017/000000397133.jpg", # coco 連結地址
            "height": 427, # 高
            "width": 640, # 寬
            "date_captured": "2013-11-14 17:02:52", # 獲取日期
            "flickr_url": "http://farm7.staticflickr.com/6116/6255196340_da26cf2c9e_z.jpg", # flickr 連結地址
            "id": 397133 # 圖片ID（每張圖片ID唯一）
        }
    ],
    "categories": [
        {
            "supercategory": "person", # 主類
            "id": 1,  # class id
            "name": "person", # 子類（具體類別）
            "keypoints": [ # 相比Object Instance多了這個欄位
                "nose",
                "left_eye",
                "right_eye",
                "left_ear",
                "right_ear",
                "left_shoulder",
                "right_shoulder",
                "left_elbow",
                "right_elbow",
                "left_wrist",
                "right_wrist",
                "left_hip",
                "right_hip",
                "left_knee",
                "right_knee",
                "left_ankle",
                "right_ankle"
            ],
            "skeleton": [ # 骨架
                [
                    16,14
                ],
                [
                    14,12
                ],
               ……
               ……
                [
                    5,7
                ]
            ]
        }
    ],
    "annotations": [
        {
            "segmentation": [
                [
                    446.71,70.66, # 多邊形(物件mask)第一個點 x，y
                    466.07,72.89,
                    471.28,78.85,
                    473.51,88.52,
                    473.51,98.2,
                   ……
                   ……
                    443.74,69.92
                ]
            ],
            "num_keypoints": 13, # 關鍵點數
            "area": 17376.91885,
            "iscrowd": 0,
            "keypoints": [
                # v=0 表示這個關鍵點沒有標註（這種情況下x=y=v=0）
                # v=1 表示這個關鍵點標註了但是不可見(被遮擋了）
                # v=2 表示這個關鍵點標註了同時也可見
                433,94,2, # x,y,v 
                434,90,2,
                0,0,0,
                443,98,2,
                0,0,0,
                ……
                ……
            ],
            "image_id": 397133, # 對應的圖片ID
            "bbox": [
                388.66,69.92,109.41,277.62 # [x,y,w,h] 物件定位框
            ],
            "category_id": 1, # 類別id
            "id": 200887 # 物件id（每個物件id都是唯一的，即不能出現重複）
        },
        ……
        ……
    ]
}

Image Caption的標註格式

執行指令碼one_image_json.py 得到單張圖片的JSON資訊。

基本上內容與Object Instance的標註格式一樣，不同的地方在於annotations欄位內容不一樣以及沒有categories欄位

{
    "info": {
        "description": "COCO 2017 Dataset",
        "url": "http://cocodataset.org",
        "version": "1.0",
        "year": 2017,
        "contributor": "COCO Consortium",
        "date_created": "2017/09/01"
    },
    "licenses": [
        {
            "url": "http://creativecommons.org/licenses/by-nc-sa/2.0/",
            "id": 1,
            "name": "Attribution-NonCommercial-ShareAlike License"
        },
       ……
       ……
    ],
    "images": [
        {
            "license": 4,
            "file_name": "000000397133.jpg",
            "coco_url": "http://images.cocodataset.org/val2017/000000397133.jpg",
            "height": 427,
            "width": 640,
            "date_captured": "2013-11-14 17:02:52",
            "flickr_url": "http://farm7.staticflickr.com/6116/6255196340_da26cf2c9e_z.jpg",
            "id": 397133
        }
    ],
    "annotations": [
        {
            "image_id": 397133, # 圖片ID（唯一）
            "id": 370509, # 物件ID（唯一） （沒有類別ID）
            "caption": "A man is in a kitchen making pizzas." # 圖片描述
        },
    ……
    ……  
        {
            "image_id": 397133,
            "id": 375891,
            "caption": "A table with pies being made and a person standing near a wall with pots and pans hanging on the wall."
        }
    ]
}

這三種標註的info，licenses，images的內容是一樣的。

MSCOCO資料標註詳解

參考：完整程式碼點選此處 JSON檔案 json檔案主要包含以下幾個欄位：詳細描述參考 COCO 標註詳解 { "info": info, # dict "licenses": [license], #

python for 資料型別詳解【列表】

range # 範圍 print(range(5)) print(list(range(10,0,-1))) [起始位置:終止位置:步長] range(起始位置,終止位置,步長) #顧頭不顧尾 1. 使用for迴圈和range列印50 - 0

hashmap資料結構詳解（五）之HashMap、HashTable、ConcurrentHashMap 的區別

【hashmap 與 hashtable】 hashmap資料結構詳解（一）之基礎知識奠基 hashmap資料結構詳解（二）之走進JDK原始碼 hashmap資料結構詳解（三）之hashcode例項及大小是2的冪次方解釋 hashmap資料結構詳解（四）之has

一起學Python——資料型別詳解

和學習其他程式語言一樣，首先要了解一門語言的資料型別。 Python的資料型別有整型、浮點型、字串、布林型、日期時間型別、list列表、set集合、tuple元組、dict詞典等。 1、整型就是數學中的整數，包括負整數。定義整型的方法： a = 100 b = -100 print(a) print

C/C++堆、棧及靜態資料區詳解（轉載只是為了查閱方便，若侵權立刪）

C/C++堆、棧及靜態資料區詳解　　本文介紹C/C++中堆，棧及靜態資料區。　　五大記憶體分割槽　　在C++中，記憶體分成5個區，他們分別是堆、棧、自由儲存區、全域性/靜態儲存區和常量儲存區。下面分別來介紹：　　棧，就是那些由編譯器在需要的時候分配，在不需要

JVM之執行時資料區域詳解

1、程式計數器程式計數器是一塊較小的記憶體空間，它可以看作是當前執行緒所執行的位元組碼的行號指示器。虛擬機器工作時就是通過改變改變計數器的值來選取下一條需要執行的位元組碼指

TCP傳送資料流程詳解

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

Oracle 資料庫全部資料型別詳解

資料型別描述 VARCHAR2(size) 可變長度的字串,其最大長度為size個位元組;size的最大值是4000,而最小值是1;你必須指定一個VARCHAR2的size;

Hadoop Streaming 做大資料處理詳解

-------------------------------------------------------------------------- 以下內容摘自寒小陽老師大資料課程內容 -----------------------------

JVM 執行時資料區詳解

1、PC暫存器（執行緒獨有）：全稱是程式計數暫存器，它記載著每一個執行緒當前執行的JAVA方法的地址，如果是當前執行的是native方法，則程式計數器會是一個空地址。它的作用就是用來支援多執行緒，執行緒的阻塞、恢復、掛起等一系列操作。這

Redis內部資料結構詳解——intset

本文是《Redis內部資料結構詳解》系列的第七篇。在本文中，我們圍繞一個Redis的內部資料結構——intset展開討論。 Redis裡面使用intset是為了實現集合(set)這種對外的資料結構。set結構類似於數學上的集合的概念，它包含的元素無序，且不能重複。Redis裡的set結構還實現了

分享《深度學習與計算機視覺演算法原理框架應用》《大資料架構詳解從資料獲取到深度學習》PDF資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多資料分享：http://blog.51cto.com/3215120 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學習》PDF，帶書籤，3

分享《深度學習與計算機視覺演算法原理框架應用》PDF《大資料架構詳解從資料獲取到深度學習》PDF +資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多分享資料：https://www.cnblogs.com/javapythonstudy/ 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學

JavaScript變數與資料型別詳解

變數變數來源於數學，是計算機語言中能儲存計算結果或能表示值抽象概念。變數可以通過變數名訪問。變數的作用就是用於儲存值。語法: 宣告變數時，總是以關鍵字var打頭。任何情況下都應該這樣做。然後給變數指定名稱。在宣告變數時，也可以給它賦值，方法是在變數名後面加上等號和值。賦值語句總是以分號

Redis的資料型別詳解

字串型別雖然叫字串型別，但是裡面也可以由數字。建立一個字串型別的key 127.0.0.1:6379> set name tom OK 127.0.0.1:6379> get name "tom" 127.0.0.1:6379> type

hbase實踐之資料讀取詳解

hbase基本儲存組織結構與資料讀取組織結構對比 Segment是Hbase2.0的概念，MemStore由一個可寫的Segment，以及一個或多個不可寫的Segments構成。故hbase 1.*版本中的MemstoreScanner變成了SegmentScanner。對應關係表

influxdb記憶體中Cache資料結構詳解

引: 前面TSM檔案格式解析（一到四）綜合分析了不同case下的TSM檔案格式，檔案格式已基本清楚。寫入磁碟是如此格式，那在寫入磁碟之前的記憶體中是怎麼儲存的呢？通過第一篇influxdb初探https://blog.csdn.net/jacicson1987/article/det

微信小程式頁面跳轉及資料傳遞詳解

微信小程式頁面跳轉及資料傳遞詳解類似 Android 的 Intent 傳值，微信小程式也一樣可以傳值：例如：wxml 中寫了一個函式跳轉： ? 1 2 3 4 <view class="itemWeight" catchtap

SpringBoot基礎篇(三)啟動載入資料CommandLineRunner詳解

SpringBoot應用程式在啟動時，會遍歷CommandLineRunner介面的例項並執行他們的run()方法。也可以利用@Order註解或者Order介面來規定所有CommandLineRunner例項的執行順序。 /** * 伺服

SQL Server比較常見資料型別詳解

在SQL Server 中每個變數、引數、表示式等都有資料型別。系統提供的資料型別分為幾大類。其中，BIGINT、 SQL_VARIANT 和TABLE 是SQL Server 2000 中新增加的3 種資料型別。下面分類講述各種資料型別。一、整數

MSCOCO資料標註詳解

JSON檔案

開啟JSON檔案檢視資料特點

Object Instance 型別的標註格式

info

licenses

images

categories

annotations

視覺化

仿照COCO JSON檔案

首先是要labelme做好圖片標註

將labelme的JSON轉成COCO格式JSON

視覺化資料

Object Keypoint 型別的標註格式

Image Caption的標註格式

相關推薦

首先是要`labelme`做好圖片標註