PASCAL VOC資料集The PASCAL Object Recognition Database Collection

阿新 • • 發佈：2019-02-04

The PASCAL Object Recognition Database Collection

News

04-Apr-07: The VOC2007 challenge development kit is now available.

Objectives

To compile a standardised collection of object recognition databases
To provide standardised ground truth object annotations across all databases
To provide a common set of tools for accessing and managing the database annotations

Standardising Each Database

All images available in PNG format
Annotated with ground truth
Ground truth information in each annotated image includes a bounding box for the objects of interest and might also include pixel segmentation masks or polygonal boundaries. The class label provided by the authors who originally compiled the database is retained and a PASCAL

class label added.

MATLAB Database Tools

PASannotateimg.m - GUI tool to annotate an image and mark out objects of interest
PASreadrecord.m - Load annotation information into MATLAB
PASwriterecord.m - Write annotation information to disk
PASviewannotation.m - Display annotated image and objects

Jump to ...

Fully Annotated Databases

1.1 The TU Darmstadt Database (formerly the ETHZ Database)

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Side views of motorbikes, cars and cows
Number of images	115 motorbikes + 50 x 2 cars + 112 cows = 327
Number of annotated images	326 (cow-pic530-sml-lt discarded because of incorrect segmentation mask)
Object annotation statistics	125 PASmotorbikeSide objects + 100 PAScarSide objects + 111 PAScowSide objects
Annotation notes	The original ground truth data provided by the authors is given in terms of bounding boxes for the motorbikes and pixel segmentation masks for the cows and the cars The original segmentation masks, which are non-binary RGB triplets, have been converted to PASCAL masks (0=background, 1=object) by converting the colour masks to grey scale values between 0 and 255 and then thresholding at 128
Browse	Browse all images
Acknowledgements	Cow data provided by Derek Magee, University of Leeds. This work has been part of the CogVis project, funded in part by the Commission of the European Union (IST-2000-29375), and the Swiss Federal Office for Education and Science (BBW 00.0617)
Publications	B. Leibe, A. Leonardis and B. Schiele. Combined object categorization and segmentation with an implicit shape model. InProceedings of the Workshop on Statistical Learning in Computer Vision. Prague, Czech Republic, May 2004. Bibtex source \| Download in pdf format D. Magee and R. Boyle. Detecting Lameness using "Re-sampling Condensation" and "Multi-stream Cyclic Hidden Markov Models." Image and Vision Computing, vol 20(8), pp 581-594, 2002.

Click on images to see annotations.

Database variability and level of difficulty for objection recognition:

The database has side views of 50 cars which have been mirrored to give a total of 100. All cars have roughly the same scale and occur in the centre of the image. There is only one car per image
All cows have roughly the same scale and orientation (side view, facing left)
The 111 cow images have only 3 distinct backgrounds and many of the cow images are quite similar to at least one other cow image in the database
The motorbike images are more varied and include everyday scenes of people riding their bikes in cluttered environments
The motorbikes appear at different scales, can have large illumination changes, can have partial occlusion and there can be multiple instances per image
The annotations are fairly comprehensive as all visible cows and cars, and most motorbikes, have been labelled

1.2 The UIUC Image Database for Car Detection

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Side views of cars, negative examples (images without cars)
Number of images	550 training cars + 170 test cars + 108 multi-scale test cars + 500 negative training examples = 1328
Number of annotated images	1328
Object annotation statistics	889 PAScarSide objects + 500 PASbackground objects
Annotation notes	The original ground truth data provided by the authors is given in terms of bounding boxes for the cars All cars have been assigned the label PAScarSide Some test images have multiple cars Following the original annotations, bounding boxes which lie outside the image have not been clipped All negative examples have been assigned the label PASbackground (with the original label field being set to "Not carSide") and their bounding boxes have been set to the image dimensions
Browse	Browse all images
Acknowledgements	This research, including the collection of this database, was supported by NSF grants ITR IIS 00-85980 and ITR IIS 00-85836
Publications	S. Agarwal and D. Roth. Learning a sparse representation for object detection. In Proceedings of the European Conference on Computer Vision, volume 4, pages 113--130. Copenhagen, Denmark, May 2002. Springer-Verlag. Bibtex source \| Download in pdf format S. Agarwal, A. Awan and D. Roth. Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(11):1475--1490, November 2004. Bibtex source \| Download in pdf format

Click on images to see annotations.

Database variability and level of difficulty for objection recognition:

The database has only a single object category
Only side views of cars are present and the database has no rotated or frontal / rear views
The database does have 108 images with scale variations though the aspect ratio (y/x) of the bounding box always equals 0.4
However, most cars in the database occur at a single fixed scale
The images are fairly clean with little occlusion. However, there is a small amount of background clutter
For the 550 training images, the car is always the dominant object present in the middle of the image and occurring at a fixed scale. Thus, these images are good for training, but not for testing.
The annotations are comprehensive as all visible cars have been labelled

1.3 The VOC 2005 Database: Dataset 1

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Views of motorbikes, bicycles, people, and cars in arbitrary pose.
Number of images	1578
Number of annotated images	1578
Object annotation statistics	Total number of labelled objects = 2209
Annotation notes	The images in this database are a subset of the other image databases on this page. The images were manually selected as an "easier" dataset for the 2005 VOC challenge. Annotations were taken verbatim from the source databases.
Browse	Browse all images
Acknowledgements	Images in this database were taken from the TU-Darmstadt, Caltech, TU-Graz and UIUC databases. Additional images were provided by INRIA. Funding was provided by PASCAL.
Publications	M. Everingham, A. Zisserman, C. K. I. Williams, L. Van Gool, et al. The 2005 PASCAL Visual Object Classes Challenge. InMachine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Textual Entailment., eds. J. Quinonero-Candela, I. Dagan, B. Magnini, and F. d'Alche-Buc, LNAI 3944, pages 117-176, Springer-Verlag, 2006. Download in pdf format

Click on images to see annotations.

1.3 The VOC 2005 Database: Testset 2

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Views of motorbikes, bicycles, people, and cars in arbitrary pose.
Number of images	654
Number of annotated images	654
Object annotation statistics	Total number of labelled objects = 1293
Annotation notes	These images were collected from Google for the 2005 VOC challenge. The images were chosen to provide a "harder" test set for the challenge. All images are annotated with instances of all four categories: motorbikes, bicycles, people and cars.
Browse	Browse all images
Acknowledgements	Funding was provided by PASCAL.
Publications	M. Everingham, A. Zisserman, C. K. I. Williams, L. Van Gool, et al. The 2005 PASCAL Visual Object Classes Challenge. InMachine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Textual Entailment., eds. J. Quinonero-Candela, I. Dagan, B. Magnini, and F. d'Alche-Buc, LNAI 3944, pages 117-176, Springer-Verlag, 2006. Download in pdf format

Click on images to see annotations.

1.4 The VOC 2006 Database

PASCAL download	Download tar file of annotated PNG images: train+val sets Download tar file of annotated PNG images: test set
License	By downloading the test data you are agreeing to abide by the licenses for the "flickr" and Microsoft Research Cambridge images contained in the database: "flickr" terms of use MSR Cambridge License (RTF) Details of the contributor of each image can be found in the file "contrib.txt" included in the database.
Categories	Views of bicycles, buses, cats, cars, cows, dogs, horses, motorbikes, people, sheep in arbitrary pose.
Number of images	5,304
Number of annotated images	5,304
Object annotation statistics	Total number of labelled objects = 9,507
Annotation notes	These images were collected from personal photographs, "flickr", and the Microsoft Research Cambridge database for the 2006 VOC challenge. All images are annotated with instances of all ten categories: bicycles, buses, cats, cars, cows, dogs, horses, motorbikes, people, sheep. Guidelines used for the annotation are available here.
Browse	Browse all images
Acknowledgements	Funding was provided by PASCAL. Images were contributed and/or annotated by Moray Allen, James Bednar, Matthijs Douze, Mark Everingham, Stefan Harmeling, Juan Huo, Lindsay Hutchison, Fiona Jamieson, Maria-Elena Nilsback, John Quinn, Florian Schroff, Kira Smyllie, Mark Van Rossum, Chris Williams, John Winn, Andrew Zisserman.
Publications	Bibtex source \| Download in pdf format M. Everingham, A. Zisserman, C. K. I. Williams, L. Van Gool. The 2006 PASCAL Visual Object Classes Challenge (VOC2006) Results. http://www.pascal-network.org/challenges/VOC/voc2006/results.pdf.

Click on images to see annotations.

Partially Annotated Databases

2.1 The Caltech Database

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Profile views of aeroplanes and motorbikes, rear views of cars, car backgrounds, front views of faces and general background scenes
Number of images	1074 aeroplanes + 1155 cars + 450 faces + 826 motorbikes + 1370 car backgrounds + 900 general backgrounds = 5775
Number of annotated images	4620
Object annotation statistics	Total number of labelled objects = 1293
Annotation notes	The original ground truth data provided by the authors is given in terms of a bounding quadrilateral which is converted into a bounding rectangle Following the original annotations, in some cases only one object per image is labelled even though there are multiple instances present.
Browse	Browse images selected for annotation
Acknowledgements	Funding was provided by the UK EPSRC; Caltech Center for Neuromorphic Systems Engineering and EC CogViSys project. Michalis Titsias annotated the car images.
Publications	R. Fergus, P. Perona and A. Zisserman. Object Class recognition by unsupervised scale-invariant learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, volume 2, pages 264--271. Madison, Wisconsin, June 2003. Bibtex source \| Download in pdf format

Click on images to see annotations.

Database variability and level of difficulty for objection recognition:

The database has many images for each object allowing for sufficient intra-class variability
However, there is very little pose, orientation, scale or illumination variation in the images
There is almost no occlusion and only a limited number of images have background clutter
The object of interest generally tends to occur in the middle of the image and can quite often fill the image
For some categories only one object has been labelled per image even though there are multiple objects present

2.2 The MIT-CSAIL Database of Objects and Scenes

PASCAL download	Download tar.gz file of annotated PNG images: part 1, part 2
Number of images	Over 72,000
Number of annotated images	2873
Object annotation statistics	Total number of labelled objects = 10,358. View number of objects in each class
Annotation notes	The original ground truth data provided by the authors is given in terms of boundary polygons for each labelled object Some very minor errors in the original annotations relating to image sizes and colour depth have been corrected All the original object labels have been retained. Their PASCAL equivalents are obtained by adding a PAS in front of them 180 of the original annotated images which did not contain any objects have been assigned the label PASbackground
Browse	Browse, by subdirectory, images selected for annotation Browse, by object class, images selected for annotation
Acknowledgements	Egon Pasztor made many contribution in the early stages of the database. We also want to give thanks to the flight delays and specially to the bad television programs who motivated us very much into annotating more images every day
Publications	A. Torralba, K. P. Murphy and W. T. Freeman. Sharing features: efficient boosting procedures for multiclass object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, volume 2, pages 762--769. Washington, DC, June 2004. Bibtex source \| Download in pdf format

Click on images to see annotations.

Database variability and level of difficulty for objection recognition:

The database has many different objects taken under a variety of viewing and illumination conditions
The images have been taken from different sources such as web cams, digital cameras and over the web
While the annotations are generally of a good quality, they can sometimes lack consistency in the labelling and some instances of objects have been missed altogether when performing the annotations

2.3 The TU Graz-02 Database

PASCAL download	Download tar.gz file of annotated PNG images
Categories	Bikes, cars, people and backgrounds
Number of images	365 bikes + 420 cars + 311 people + 380 negative examples = 1476
Number of annotated images	1280
Object annotation statistics	404 PASbicycle objects + 451 PAScar + 581 PASperson objects + 380 PASbackground objects
Annotation notes	The original ground truth data provided by the authors is given in terms of pixel segmentation masks The original masks have values between 0 and 255. These are converted to PASCAL masks (0 = background, 1 = 1st object, 2 = 2nd object, etc.) by (a) setting all values greater than 0 as background, (b) finding the connected components of the foreground and (c) rejecting all components smaller than 100 pixels. Following the original annotations, overlapping objects have been merged into a single object while disconnected objects parts have been labelled as separate objects. The original negative examples had been labelled as "none". These are now labelled as PASbackground with the bounding box set to the dimensions of the image.
Browse	Browse images selected for annotation
Acknowledgements	The EU project LAVA (IST-2001-34405) and the Austrian Science Foundation (project S9103-N04)
Publications	A. Opelt, M. Fussenegger, A. Pinz and P. Auer. Generic object recognition with boosting. Technical Report TR-EMT-2004-01, EMT, TU Graz, Austria, 2004. Submitted to the IEEE Transactions on Pattern Analysis and Machine Intelligence. Bibtex source \| Download in pdf format A. Opelt and A. Pinz. Object localization with boosting and weak supervision for generic object recognition. InProceedings of the 14th Scandinavian Conference on Image Analysis (SCIA). Bibtex source \| Download in pdf format

Click on images to see annotations.

Database variability and level of difficulty for objection recognition:

The database has only three classes.
However, images from each class display a large variability in scale, viewpoint and illumination.
There are significant occlusions and background clutter.
There is a large intra-class variability within the objects.
Often, the objects of interest are not the dominant objects in the scene.
The annotations are quite comprehensive and most objects of interest have been marked.

Unannotated Databases

3.1 101 Object Categories

PASCAL download	Download tar.gz file of unannotated PNG images
Number of images	9197
Browse	Browse unannotated images from the database
Publications	L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In Proceedings of the Workshop on Generative-Model Based Vision. Washington, DC, June 2004. Bibtex source \| Download in pdf format

Some sample images from the 101 Object Categories database.

from: http://host.robots.ox.ac.uk/pascal/VOC/databases.html#VOC2005_1

PASCAL VOC資料集The PASCAL Object Recognition Database Collection

The PASCAL Object Recognition Database Collection News 04-Apr-07: The VOC2007 challenge development kit is now available.Objectives To

Pascal Voc資料集詳細介紹

1.首先了解VOC2012資料集的內容資料集下載後解壓得到一個名為VOCdevkit的資料夾，該資料夾結構如下： . └── VOCdevkit #根目錄 └── VOC2012 #不同年份的資料集，這裡只下載了2012的，還有2007等其它年份的

計算機視覺標準資料集整理—PASCAL VOC資料集

資料集下載 PASCAL VOC為影象識別和分類提供了一整套標準化的優秀的資料集，從2005年到2012年每年都會舉行一場影象識別challenge。此資料集可以用於影象分類、目標檢測、影象分割。資料集下載映象地址如下（包括VOC2007和VOC2012）：

【計算機視覺】PASCAL VOC資料集分析

PASCAL VOC 資料集

PASCAL VOC為影象識別和分類提供了一整套標準化的優秀的資料集，從2005年到2012年每年都會舉行一場影象識別challenge。下載完之後解壓，可以在VOCdevkit目錄下的VOC20

Pascal Voc資料集詳細分析

前言做深度學習目標檢測方面的同學怎麼都會接觸到PASCAL VOC這個資料集。也許很少用到整個資料集，但是一般都會按照它的格式準備自己的資料集。所以這裡就來詳細的記錄一下PASCAL VOC的格式，包括目錄構成以及各個資料夾的內容格式，方便以後自己按照VOC

PASCAL VOC資料集分析

接下來需要研究的是如何自己生成訓練資料和測試資料，將在下一篇中闡述。2. 轉載2 現在深度學習很多框架都在使用VOC資料集，來研究一下voc資料集的具體內容一般voc解壓出來後都包括Annotations,ImageSets,JPEFImages,SegmentationClass ,Segmentatio

PASCAL VOC資料集分析（分類部分）

VOC2007資料集共包含：訓練集（5011幅），測試集（4952幅），共計9963幅圖，共包含20個種類。資料集的組成架構如下： Annotations —目標真值區域ImageSets —-類別標籤JPEGImages —–影象SegmentationClassSegmentationObje

PASCAL VOC資料集分析（檢測部分）

Pascal VOC 資料集介紹介紹Pascal VOC資料集：Challenge and tasks，只介紹Detection與Segmentation相關內容。資料格式 Dataset衡量方式 Evaluationvoc2007, voc2012Challenge

【Detection】物體識別-製作PASCAL VOC資料集

程式碼下載：github PASCAL VOC資料集 PASCAL VOC為影象識別和分類提供了一整套標準化的優秀的資料集，從2005年到2012年每年都會舉行一場影象識別challenge 預設為20類物體 1 資料集結構 ①JPEGImages JPEGImages資料夾中包含了PASCAL

針對pascal voc2012資料集語義分割各類物件的顏色表示

最近一直在做語義分割的工作，終於將結果調試出來了。在最後一步中，將fc8中分割得到的.mat格式的結果，轉換成.png格式的最終分割影象，但是發現並不是很清楚各顏色代表的類別，通過將creat

機器學習(7) PASCAL VOC2012資料集

因為最近在做語義分割，需要用到這個資料集。而通過對於官網的檢視和了解，發現這個資料集的用途非常的多，特此詳細記錄一下。下載地址:點選下載這個資料集的檔案目錄如下： Annotation *.xml ImageSets Action *_tr

caffe隨記（八）---使用caffe訓練FCN的pascalcontext-fcn32s模型（pascal-context資料集）

本篇討論利用caffe進行FCN訓練（採用的是pascal-context資料集） 1、下載FCN的框架如圖是我在windows下的檢視因為不同的資料來源和不同的FCN型別的網路結構並不同，對資料來源的讀取方式也不同，因此有很多分支，本篇博文以pascalcont

影象語義分割：從頭開始訓練deeplab v2系列之三【pascal-context資料集】

在之前的部落格已經講過deeplab v2原始碼解析與基於VOC2012資料集的訓練，本部落格基於pascal-context資料集進行fine tuning txt檔案：檔案中有資料集的名字列表的txt檔案,此處不同於原始碼，在pascal-con

GluonCV：用Pascal VOC資料訓練YOLO v3（下）訓練部分

本教程介紹了訓練GluonCV提供的YOLOv3目標檢測模型的基本步驟。具體來說，展示瞭如何通過堆疊GluonCV元件來構建state-of-the-art的YOLOv3模型。首先，關於訓練有三點說明：（1）初始學習率預設是0.001，我訓練loss是nan

GluonCV：用Pascal VOC資料訓練YOLO v3（上）準備資料+簡單訓練命令

1.GluonCV簡介 GluonCV提供計算機視覺中state-of-the-art（SOTA）深度學習模型的實現。它專為工程師，研究人員和學生設計，可基於這些模型快速產生原型產品和研究思路。該工具包提供四個主要功能：（1）訓練指令碼用來複現研究論

VOC資料集顏色對應關係與程式碼

VOC顏色和分類的對於關係: code: def voc_colormap(N=256): def bitget(val, idx): return ((val & (1 << idx)) != 0)

VOC資料集用於製作tensorflow所需格式

VOC資料集 VOC主要有三個重要的資料夾：Annotations、ImageSets和JPEGImages。分別是標註資訊、分割資訊和原始圖片等。公開資料集有現成的對應檔案，前人已經做好標註工作。在初步訓練網路時，選擇該組比較完備的資料，確實是一個比較快捷的方法。 XML到CS

PACAL VOC資料集格式的相關問題

獲取指定畫素值的座標 image_path = 'C:\\Users\\Yeh Chih-En\\Desktop\\test_images\\0bb0672f1afd6baaf94c516bcfb7dfae.png' image = Image.open(i

yolo 學習筆記分享之--VOC資料集

參考： <Pascal VOC & COCO資料集介紹 & 轉換> http://www.cnblogs.com/pprp/p/9629752.html 《Pascal Voc資料集詳細分析》https://blog.csdn.net/u01383270

PASCAL VOC資料集The PASCAL Object Recognition Database Collection

The PASCAL Object Recognition Database Collection

News

Objectives

Standardising Each Database

MATLAB Database Tools

Jump to ...

Fully Annotated Databases

1.1 The TU Darmstadt Database (formerly the ETHZ Database)

1.2 The UIUC Image Database for Car Detection

1.3 The VOC 2005 Database: Dataset 1

1.3 The VOC 2005 Database: Testset 2

1.4 The VOC 2006 Database

Partially Annotated Databases

2.1 The Caltech Database

2.2 The MIT-CSAIL Database of Objects and Scenes

2.3 The TU Graz-02 Database

Unannotated Databases

3.1 101 Object Categories

相關推薦