[深度學習][CIFRA資料處理] CIFRA-10 與 CIFRA-100 資料集介紹
阿新 • • 發佈:2019-01-10
作為主流的分類資料集,這一篇主要講 CIFRA-10 與 CIFRA-100 資料集下載與Python版本的處理
資料集下載地址:http://www.cs.toronto.edu/~kriz/cifar.html
資料介紹
CIFAR-10和CIFAR-100是兩組有8000萬個微小影象資料組成的標記圖片資料集。它們是由Alex Krizhevsky,Vinod Nair和Geoffrey Hinton(大佬大佬大佬)收集的。
CIFAR-10資料集
CIFAR-10資料集由10個類(‘airplane’, ‘automobile’, ‘bird’, ‘cat’, ‘deer’, ‘dog’, ‘frog’, ‘horse’, ‘ship’, ‘truck’)共60000個32x32彩色影象組成,每個類有6000個影象。被劃分為50000個訓練影象和10000個測試影象。
資料集分為五個訓練批次(data_batch)和一個測試批次(test_batch),每個批次有10000個影象。測試批次包含來自每個類別的1000個隨機選擇的影象。訓練批次以隨機順序包含剩餘影象,但是一些訓練批次可能包含來自一個類別的更多影象而不是另一個類別。在它們之間,訓練批次包含來自每個類別的5000個影象。
下載地址
Version | Size | md5sum |
---|---|---|
CIFAR-10 python version | 163 MB | c58f30108f718f92721af3b95e74349a |
CIFAR-10 Matlab version | 175 MB | 70270af85842c9e89bb428ec9976c926 |
CIFAR-10 binary version (suitable for C programs) | 162 MB | c32a1d4ab5d03f1284b67883e8d87530 |
CIFRA-100資料集
此資料集與CIFAR-10類似,不同之處在於它有100個類,每個類包含600個影象。每類分為500個訓練影象和100個測試影象。其中100個類分為20個大類。每個影象都帶有一個“精細”標籤(它所屬的類)和一個“粗略”標籤(它所屬的大類)。
以下是CIFAR-100中的類列表:
Superclass | Classes |
---|---|
aquatic mammals | beaver, dolphin, otter, seal, whale |
fish | aquarium fish, flatfish, ray, shark, trout |
flowers | orchids, poppies, roses, sunflowers, tulips |
food containers | bottles, bowls, cans, cups, plates |
fruit and vegetables | apples, mushrooms, oranges, pears, sweet peppers |
household electrical devices | clock, computer keyboard, lamp, telephone, television |
household furniture | bed, chair, couch, table, wardrobe |
insects | bee, beetle, butterfly, caterpillar, cockroach |
large carnivores | bear, leopard, lion, tiger, wolf |
large man-made outdoor things | bridge, castle, house, road, skyscraper |
large natural outdoor scenes | cloud, forest, mountain, plain, sea |
large omnivores and herbivores | camel, cattle, chimpanzee, elephant, kangaroo |
medium-sized mammals | fox, porcupine, possum, raccoon, skunk |
non-insect invertebrates | crab, lobster, snail, spider, worm |
people | baby, boy, girl, man, woman |
reptiles | crocodile, dinosaur, lizard, snake, turtle |
small mammals | hamster, mouse, rabbit, shrew, squirrel |
trees | maple, oak, palm, pine, willow |
vehicles 1 | bicycle, bus, motorcycle, pickup truck, train |
vehicles 2 | lawn-mower, rocket, streetcar, tank, tractor |
下載地址
Version | Size | md5sum |
---|---|---|
CIFAR-100 python version | 161 MB | eb9058c3a382ffc7106e4002c42a8d85 |
CIFAR-100 Matlab version | 175 MB | 6a4bfa1dcd5c9453dda6bb54194911f4 |
CIFAR-100 binary version (suitable for C programs) | 161 MB | 03b5dce01913d631647c71ecec9e9cb8 |