計算機視覺著名資料集CV Datasets

阿新 • • 發佈：2019-01-09

This dataset consists of a set of actions collected from various sports which are typically featured on broadcast television channels such as the BBC and ESPN. The video sequences were obtained from a wide range of stock footage websites including BBC Motion gallery, and GettyImages.; This dataset features video sequences that were obtained using a R/C-controlled blimp equipped with an HD camera mounted on a gimbal.The collection represents a diverse pool of actions featured at different heights and aerial viewpoints. Multiple instances of each action were recorded at different flying altitudes which ranged from 400-450 feet and were performed by different actors.; It contains 11 action categories collected from YouTube.; Walk, Run, Jump, Gallop sideways, Bend, One-hand wave, Two-hands wave, Jump in place, Jumping Jack, Skip.
UCF50: UCF50 is an action recognition dataset with 50 action categories, consisting of realistic videos taken from YouTube.
ASLAN: The Action Similarity Labeling (ASLAN) Challenge.; The dataset was captured by a Kinect device. There are 12 dynamic American Sign Language (ASL) gestures, and 10 people. Each person performs each gesture 2-3 times.; Contains six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several times by 25 subjects in four different scenarios: outdoors, outdoors with scale variation, outdoors with different clothes and indoors.; Hollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video in total.; This dataset contains 5 different collective activities : crossing, walking, waiting, talking, and queueing and 44 short video sequences some of which were recorded by consumer hand-held digital camera with varying view point.; The Olympic Sports Dataset contains YouTube videos of athletes practicing different sports.; Surveillance-type videos; The dataset is designed to be realistic, natural and challenging for video surveillance domains in terms of its resolution, background clutter, diversity in scenes, and human activity/event categories than existing action recognition datasets.; Collected from various sources, mostly from movies, and a small proportion from public databases, YouTube and Google videos. The dataset contains 6849 clips divided into 51 action categories, each containing a minimum of 101 clips.; Dataset of 9,532 images of humans performing 40 different actions, annotated with bounding-boxes.; Fully annotated dataset of RGB-D video data and data from accelerometers attached to kitchen objects capturing 25 people preparing two mixed salads each (4.5h of annotated data). Annotated activities correspond to steps in the recipe and include phase (pre-/ core-/ post) and the ingredient acted upon.; The dataset contains 2326 video sequences of 15 different sport actions and human body joint annotations for all sequences.; A Kinect dataset for hand detection in naturalistic driving settings as well as a challenging 19 dynamic hand gesture recognition dataset for human machine interfaces.; Observations of several subjects setting a table in different ways. Contains videos, motion capture data, RFID tag readings,...; This dataset comprises of 10 actions related to breakfast preparation, performed by 52 different individuals in 18 different kitchens.; Cooking Activities dataset.; This dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking recipes.

計算機視覺著名資料集CV Datasets

This dataset consists of a set of actions collected from various sports which are typically featured on broadcast television channels such as the BBC and E

計算機視覺相關資料集和比賽

Imagenet資料集是目前深度學習影象領域應用得非常多的一個數據集，關於影象分類、定位、檢測等研究工作大多基於此資料集展開。Imagenet資料集有1400多萬幅圖片，涵蓋2萬多個類別；其中有超過百萬的圖片有明確的類別標註和影象中物體位置的標註。Image

CVonline: Image Databases 計算機視覺影象資料集

Index by Topic Another helpful site is the YACVID page. Action Databases Biological/Medical 2008 MICCAI MS Lesion Segmentation Challe

計算機視覺標準資料集整理—PASCAL VOC資料集

資料集下載 PASCAL VOC為影象識別和分類提供了一整套標準化的優秀的資料集，從2005年到2012年每年都會舉行一場影象識別challenge。此資料集可以用於影象分類、目標檢測、影象分割。資料集下載映象地址如下（包括VOC2007和VOC2012）：

計算機視覺標準資料集整理—CIFAR-100資料集

CIFAR-100資料集（用作100類的影象分類）這個資料集和CIFAR-10相比，它具有100個類，大約600張/類，每類500張訓練，500張測試；這100類又可以grouped成20

計算機視覺標準資料集整理—COCO資料集

COCO資料集由微軟贊助，其對於影象的標註資訊不僅有類別、位置資訊，還有對影象的語義文字描述，COCO資料集的開源使得近兩三年來影象分割語義理解取得了巨大的進展，也幾乎成為了影象語義理解演算法效能評價的“標準”資料集。Google的開源show and tell生成模型就是在此資料集上測試的。這個資料

計算機視覺技術資料留存

2018年11月《機器學習100天》深度學習影象識別的未來：機遇與挑戰並存英偉達的“千人摩擦計劃” 預訓練模型遷移學習 2018年10月為中共中央政治局講授新一代人工智慧課程高文院士：從大資料時代來到人工智慧時代，我們走了多遠了？高文院士：國家新一

CS231n 卷積神經網路與計算機視覺 6 資料預處理權重初始化規則化損失函式等常用方法總結

1 資料處理首先註明我們要處理的資料是矩陣X，其shape為[N x D] (N =number of data, D =dimensionality). 1.1 Mean subtraction 去均值去均值是一種常用的資料處理方式.它是將各個特徵值減去其均

基於深度學習的計算機視覺學習資料彙編（英）

轉載自:http://www.open-open.com/lib/view/open1452776149855.html Awesome Deep Vision A curated list of deep learning resources for comput

分享《深度學習與計算機視覺演算法原理框架應用》《大資料架構詳解從資料獲取到深度學習》PDF資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多資料分享：http://blog.51cto.com/3215120 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學習》PDF，帶書籤，3

分享《深度學習與計算機視覺演算法原理框架應用》PDF《大資料架構詳解從資料獲取到深度學習》PDF +資料集

下載：https://pan.baidu.com/s/12-s95JrHek82tLRk3UQO_w 更多分享資料：https://www.cnblogs.com/javapythonstudy/ 《深度學習與計算機視覺演算法原理、框架應用》PDF，帶書籤，347頁。《大資料架構詳解：從資料獲取到深度學

深度學習常用資料集資源（計算機視覺領域）

目錄 1、MNIST 2、ImageNet 4、COCO 5、PASCAL VOC 6、FDDB 1、MNIST 深度學習領域的入門資料集，當前主流的深度學習框架幾乎都將MNIST資料集的處理

深度學習與計算機視覺(PB-09)-使用HDF5儲存大資料集

到目前為止，我們使用的資料集都能夠全部載入到記憶體中。對於小資料集，我們可以載入全部影象資料到記憶體中，進行預處理，並進行前向傳播處理。然而，對於大規模資料集(比如ImageNet),我們需要建立資料生成器，每次只訪問一小部分資料集（比如mini-batch），然後對batch資料進行預處理

資料集 | 開源資料集（計算機視覺影象、定位、識別）

博主github：https://github.com/MichaelBeechan 博主CSDN：https://blog.csdn.net/u011344545 計算機視覺資料集：https://github.com/Michael

計算機視覺資料大合集

專利：如何查到一篇文獻的DOI號或通過DOI找到原始文獻? | 參考諮詢知識庫 Resolve a DOI 中文DOI crossref.org 中國專利下載 FPO IP Research & Communi

計算機視覺（影象分類、檢測、分割）資料集和比賽

1 ImageNet資料集和ILSVRC Imagenet資料集是目前深度學習影象領域應用得非常多的一個數據集，關於影象分類、定位、檢測等研究工作大多基於此資料集展開。Imagenet資料集有1400多萬幅圖片，涵蓋2萬多個類別；其中有超過百萬的圖片有明確的類

計算機視覺（八）：提取Cifar-10資料集的HOG、HSV特徵並使用神經網路進行分類

1 - 引言之前我們都是將整張圖片輸入進行分類，要想進一步提升準確率，我們就必須提取出圖片更容易區分的特徵，再將這些特徵當做特徵向量進行分類。在之前我們學了一些常用的影象特徵，在這次實驗中，我們使用了兩種特徵梯度方向直方圖（HOG）顏色直方圖（HSV）

計算機視覺（七）：構建兩層的神經網路來分類Cifar-10資料集

1 - 引言之前我們學習了神經網路的理論知識，現在我們要自己搭建一個結構為如下圖所示的神經網路，對Cifar-10資料集進行分類前向傳播比較簡單，就不在贅述反向傳播需要注意的是，softmax的反向傳播與之前寫的softmax程式碼一樣。神經網路內部的反向傳播權重偏導就是前面

計算機視覺（六）：使用Softmax分類Cifar-10資料集

1 - 引言這次，我們將使用Softmax來分類Cifar-10，過程其實很之前使用的SVM過程差不多，主要區別是在於損失函式的不同，而且Softmax分類器輸出的結果是輸入樣本在不同類別上的概率值大小,Softmax分類器也叫多項Logistic迴歸線性模型:

計算機視覺（五）：使用SVM分類Cifar-10資料集

1 - 引言之前我們使用了K-NN對Cifar-10資料集進行了圖片分類，正確率只有不到30%，但是還是比10%高的[手動滑稽]，這次我們將學習使用SVM分類器來對Cafi-10資料集實現分類，但是正確率應該也不會很高要想繼續提高正確率，就要對影象進行預處理和特徵的選取工作，而不

計算機視覺著名資料集CV Datasets

相關推薦