基於yolo的口罩識別(開原始碼和資料集)

阿新 • • 發佈：2022-05-11

2020年開頭真的很人意外，開年爆發了疫情。此次疫情牽動了各行各業，在這裡衷心的感謝奮鬥在一線的醫療工作者：您們辛苦了。作為一名非醫專業的學生，在這樣情況下，除了不亂跑以外，我也想以另一種方式去致敬那些保護著全國人民安全的工作人員。接下來就來介紹本專案的開發過程。

1. 開發環境

筆者的開發環境如下：
ubuntu16.04
PyTorch 1.1.0
anaconda
opencv-python
tqdm
matplotlib
pycocotools

2. 收集資料集

本專案的資料集是筆者花了一個晚上通過網路爬蟲的方式收集的（ps:沒有違反爬蟲協定）。筆者通過使用python寫了一個小爬蟲，爬蟲的程式碼如下：


import re
import requests
from urllib import error
from bs4 import BeautifulSoup
import os

num = 0
numPicture = 0
file = ''
List = []


def Find(url):
    global List
    print('正在檢測圖片總數，請稍等.....')
    t = 0
    i = 1
    s = 0
    while t < 1000:
        Url = url + str(t)
        try:
            Result = requests.get(Url, timeout=7)
        except BaseException:
            t = t + 60
            continue
        else:
            result = Result.text
            pic_url = re.findall('"objURL":"(.*?)",', result, re.S)  # 先利用正則表示式找到圖片url
            s += len(pic_url)
            if len(pic_url) == 0:
                break
            else:
                List.append(pic_url)
                t = t + 60
    return s


def recommend(url):
    Re = []
    try:
        html = requests.get(url)
    except error.HTTPError as e:
        return
    else:
        html.encoding = 'utf-8'
        bsObj = BeautifulSoup(html.text, 'html.parser')
        div = bsObj.find('div', id='topRS')
        if div is not None:
            listA = div.findAll('a')
            for i in listA:
                if i is not None:
                    Re.append(i.get_text())
        return Re


def dowmloadPicture(html, keyword):
    global num
    # t =0
    pic_url = re.findall('"objURL":"(.*?)",', html, re.S)  # 先利用正則表示式找到圖片url
    print('找到關鍵詞:' + keyword + '的圖片，即將開始下載圖片...')
    for each in pic_url:
        print('正在下載第' + str(num + 1) + '張圖片，圖片地址:' + str(each))
        try:
            if each is not None:
                pic = requests.get(each, timeout=7)
            else:
                continue
        except BaseException:
            print('錯誤，當前圖片無法下載')
            continue
        else:
            string = file + r'\\' + keyword + '_' + str(num) + '.jpg'
            fp = open(string, 'wb')
            fp.write(pic.content)
            fp.close()
            num += 1
        if num >= numPicture:
            return


if __name__ == '__main__':  # 主函式入口
    tm = int(input('請輸入每類圖片的下載數量 '))
    numPicture = tm
    line_list = []
    with open('./name.txt', encoding='utf-8') as file:
        line_list = [k.strip() for k in file.readlines()]  # 用 strip()移除末尾的空格

    for word in line_list:
        url = 'http://image.baidu.com/search/flip?tn=baiduimage&ie=utf-8&word=' + word + '&pn='
        tot = Find(url)
        Recommend = recommend(url)  # 記錄相關推薦
        print('經過檢測%s類圖片共有%d張' % (word, tot))
        file = word
        y = os.path.exists(file)
        if y == 1:
            print('該檔案已存在，請重新輸入')
            file = word
            os.mkdir(file)
        else:
            os.mkdir(file)
        t = 0
        tmp = url
        while t < numPicture:
            try:
                url = tmp + str(t)
                result = requests.get(url, timeout=10)
                print(url)
            except error.HTTPError as e:
                print('網路錯誤，請調整網路後重試')
                t = t + 60
            else:
                dowmloadPicture(result.text, word)
                t = t + 60
        numPicture = numPicture + tm

    print('任務完成')

執行此爬蟲需要配以一個name.txt,在其中寫下所需要爬取的資訊即可。，rename.py是對獲取的資料進行命名整理的。程式碼如下：

import os

path_name='./JPEGImages'
#path_name :表示你需要批量改的資料夾
i=0
for item in os.listdir(path_name):#進入到資料夾內，對每個檔案進行迴圈遍歷
    os.rename(os.path.join(path_name,item),os.path.join(path_name,('masks_'+'00'+str(i)+'.jpg')))#os.path.join(path_name,item)表示找到每個檔案的絕對路徑並進行拼接操作
    i+=1

檔案結構如下：

3.資料的清洗

這裡筆者採用的資料清洗就比較的笨了，就是人為一個一個的篩選，整理了一個晚上，請原諒筆者的愚蠢。

4. 標註資料集

標註資料集同資料的清洗而言就比較簡單了，比較的機械化。這裡筆者使用labelImg這個標註工具進行標註。這裡做的是簡單的二分類任務，即：Mask和unMasked.標註比資料清洗要快一點。

5. 相關準備

準備工作的話就是使用一個版本的大體框架，這裡使用的是yolov3，yolov3框架。下載後可以重新命名，當然也可不命名，這個隨性就可以了。

6. 資料裝載

申明一下最後會將資料集提供給大家。
我們需要將資料集Annotations、JPEGImages複製到YOLOV3工程目錄下的data檔案下；同時新建兩個資料夾，分別命名為ImageSets和labels，最後我們將JPEGImages資料夾複製貼上一下，並將資料夾重新命名為images

7. 程式碼的構建

這裡需要在專案的根目錄下新建連個python檔案：makeTxt.py和voc_label.py 其中makeTxt.py是用於生成data/ImageSets下的test.txt、train.txt、trainval.txt以及val.txt的;而voc_label.py是將Annotations裡資料集的標註檔案轉為data/label下txt格式的。
makeTxt.py的程式碼如下：

import os
import random
 
trainval_percent = 0.1
train_percent = 0.9
xmlfilepath = 'data/Annotations'
txtsavepath = 'data/ImageSets'
total_xml = os.listdir(xmlfilepath)
 
num = len(total_xml)
list = range(num)
tv = int(num * trainval_percent)
tr = int(tv * train_percent)
trainval = random.sample(list, tv)
train = random.sample(trainval, tr)
 
ftrainval = open('data/ImageSets/trainval.txt', 'w')
ftest = open('data/ImageSets/test.txt', 'w')
ftrain = open('data/ImageSets/train.txt', 'w')
fval = open('data/ImageSets/val.txt', 'w')
 
for i in list:
    name = total_xml[i][:-4] + '\n'
    if i in trainval:
        ftrainval.write(name)
        if i in train:
            ftest.write(name)
        else:
            fval.write(name)
    else:
        ftrain.write(name)
 
ftrainval.close()
ftrain.close()
fval.close()
ftest.close()

voc_label.py的程式碼如下：

import xml.etree.ElementTree as ET
import pickle
import os
from os import listdir, getcwd
from os.path import join
 
sets = ['train', 'test','val']
 
classes = ["mask","unmask"]   #我們只是檢測細胞，因此只有一個類別
 
 
def convert(size, box):
    dw = 1. / size[0]
    dh = 1. / size[1]
    x = (box[0] + box[1]) / 2.0
    y = (box[2] + box[3]) / 2.0
    w = box[1] - box[0]
    h = box[3] - box[2]
    x = x * dw
    w = w * dw
    y = y * dh
    h = h * dh
    return (x, y, w, h)
 
 
def convert_annotation(image_id):
    in_file = open('data/Annotations/%s.xml' % (image_id))
    out_file = open('data/labels/%s.txt' % (image_id), 'w')
    tree = ET.parse(in_file)
    root = tree.getroot()
    size = root.find('size')
    w = int(size.find('width').text)
    h = int(size.find('height').text)
 
    for obj in root.iter('object'):
        difficult = obj.find('difficult').text
        cls = obj.find('name').text
        if cls not in classes or int(difficult) == 1:
            continue
        cls_id = classes.index(cls)
        xmlbox = obj.find('bndbox')
        b = (float(xmlbox.find('xmin').text), float(xmlbox.find('xmax').text), float(xmlbox.find('ymin').text),
             float(xmlbox.find('ymax').text))
        bb = convert((w, h), b)
        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')
 
 
wd = getcwd()
print(wd)
for image_set in sets:
    if not os.path.exists('data/labels/'):
        os.makedirs('data/labels/')
    image_ids = open('data/ImageSets/%s.txt' % (image_set)).read().strip().split()
    list_file = open('data/%s.txt' % (image_set), 'w')
    for image_id in image_ids:
        list_file.write('data/images/%s.jpg\n' % (image_id))
        convert_annotation(image_id)
    list_file.close()

分別執行makeTxt.py和voc_label.py會在data/ImageSets的四個檔案中出現如下變化

再執行voc_label.py會在label資料夾下產生如下的變化。

接著還要配置兩個檔案
在data檔案下新建rbc.data，配置內容如下：

classes=2
train=data/train.txt
valid=data/test.txt
names=data/rbc.names
backup=backup/
eval=coco

值得注意的是這裡是對人是否佩戴口罩進行識別的，即分為兩類佩戴口罩和為佩戴口罩。所以這裡需要將classes設定為2.
再在data檔案下新建rbc.names，配置內容如下：

Masking
unMasked

8. 修改配置檔案

這裡需要將cfg下的yolov3-tiny.cfg檔案進行修改，修改內容如下：

[net]
# Testing
batch=1
subdivisions=1
# Training
# batch=64
# subdivisions=2
width=416
height=416
channels=3
momentum=0.9
decay=0.005
angle=0
saturation = 1.5
exposure = 1.5
hue=.1

learning_rate=0.005
burn_in=1000
max_batches = 500200
policy=steps
steps=400000,450000
scales=.1,.1

[convolutional]
batch_normalize=1
filters=16
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=2

[convolutional]
batch_normalize=1
filters=32
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=2

[convolutional]
batch_normalize=1
filters=64
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=2

[convolutional]
batch_normalize=1
filters=128
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=2

[convolutional]
batch_normalize=1
filters=256
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=2

[convolutional]
batch_normalize=1
filters=512
size=3
stride=1
pad=1
activation=leaky

[maxpool]
size=2
stride=1

[convolutional]
batch_normalize=1
filters=1024
size=3
stride=1
pad=1
activation=leaky

###########

[convolutional]
batch_normalize=1
filters=256
size=1
stride=1
pad=1
activation=leaky

[convolutional]
batch_normalize=1
filters=512
size=3
stride=1
pad=1
activation=leaky

[convolutional]
size=1
stride=1
pad=1
filters=21   ####因為這裡是二分類，根據論文中的介紹這裡需要根據3*(5+classnum),應此為21
activation=linear



[yolo]
mask = 3,4,5
anchors = 10,14,  23,27,  37,58,  81,82,  135,169,  344,319
classes=2    ### 這裡因為是二分類，所以改為2
num=6
jitter=.3
ignore_thresh = .7
truth_thresh = 1
random=1

[route]
layers = -4

[convolutional]
batch_normalize=1
filters=128
size=1
stride=1
pad=1
activation=leaky

[upsample]
stride=2

[route]
layers = -1, 8

[convolutional]
batch_normalize=1
filters=256
size=3
stride=1
pad=1
activation=leaky

[convolutional]
size=1
stride=1
pad=1
filters=21   ### 因為這裡是二分類，根據論文中的介紹這裡需要根據3*(5+classnum),應此為21
activation=linear

[yolo]
mask = 0,1,2
anchors = 10,14,  23,27,  37,58,  81,82,  135,169,  344,319
classes=2    ### 這裡因為是二分類，所以改為2
num=10
jitter=.3
ignore_thresh = .7
truth_thresh = 1
random=1

9. 訓練資料集

完成了上述操作後，即可進行資料集的訓練操作了。
在根目錄下開啟命令列，輸入如下命令：

python train.py --data-cfg data/rbc.data --cfg cfg/yolov3-tiny.cfg --epochs 100

引數說明：
rbc.data 是data資料夾下
yolov3-tiny.cfg 是cfg中的yolov3的預訓練引數
epochs 100 迭代訓練資料集的次數

訓練完成後會在weights資料夾下產生訓練的模型，包括最好的訓練模型和最後一次訓練得到的模型還有每個每10次迭代產生的模型。這裡我只保留了最好的以及最後一次的訓練模型。

10. 測試模型

在完成了訓練後，即可對訓練的模型進行測試了，看一下這個其對佩戴口罩和未佩戴口罩者的預測效果
將要預測的實際照片放在data/sample資料夾下
在根目錄下開啟命令列，在其中輸入如下命令：

python detect.py --data-cfg data/rbc.data --cfg cfg/yolov3-tiny.cfg --weights weights/best.pt

預測的結果如下：

如果需要對視訊進行檢測的話，則可以輸入如下命令：

python detect.py --source file.mp4

11. 可能出現的bug

在操作的過程中難免會出現一些小毛病，這裡呢，筆者已經為這個專案總結了幾個容易出現的問題，請參考筆者的另一篇部落格：專案可能出現的問題及其解決方法

專案原始碼和資料集

這裡筆者將本專案的原始碼和資料集都已經上傳到github上，歡迎大家擴充套件，為保護我們的工作者減少工作壓力。
連結： 原始碼和資料集

總結：

由於筆者知識能力有限，在描述上可能存在不準確的地方，還請諒解。
如遇到什麼問題歡迎新增筆者qq:1017190168
進行討論。
最後對一線的工作者們說一聲你們辛苦了！！！！
祝願疫情早日結束，祝祖國繁榮富強。

基於yolo的口罩識別(開原始碼和資料集)

1. 開發環境

2. 收集資料集

3.資料的清洗

4. 標註資料集

5. 相關準備

6. 資料裝載

7. 程式碼的構建

8. 修改配置檔案

9. 訓練資料集

10. 測試模型

11. 可能出現的bug

專案原始碼和資料集

總結：

申明近期筆者接計算機視覺方面的畢業設計、比賽，有需要的歡迎聯絡！！！

基於yolo的口罩識別(開原始碼和資料集)

三維點雲配準簡介、工具和資料集 - 1

行為分析(商用級別)01 - 所有資源連結分享：論文，原始碼，資料集，預訓練模型等

模型搜尋器和資料集

詠南全新封裝的鍵值對序列和資料集序列（包括二進位制和JSON）助手類

基於類的檢視APIView, GenericAPIView和檢視集(ViewSet)

基於Python的身份證驗證識別和資料處理詳解

YOLO-V4 實現口罩識別（附加資料、資料批量處理程式）

nlp365第112天，nlp論文總結了基於方面的挑戰資料集和有效模型

天池 - “零基礎入門資料探勘 - 二手車交易價格預測”TOP 2%開原始碼

簡單使用TensorFlow.js在瀏覽器進行視訊實時目標識別(基於YOLO V3)

基於bert命名實體識別（一）資料處理

ABP框架使用Mysql資料庫，以及基於SQLServer建立Mysql資料庫的架構和資料

資料開發_Java基本類和介面原始碼和程式碼

影象識別：基於RetNet網路的cifar-10資料集影象識別分析與引數對比

CWMP開原始碼研究2——easycwmp安裝和學習

Cifar10資料集的下載和匯入，windows和linux（基於tensorflow）

深度學習入門-基於Python的理論入門與實現原始碼加mnist資料集下載推薦

【模式識別】基於PCA手寫數字識別matlab 原始碼

Alamofire4.x開原始碼分析(四)Timeline和cURL Command Output