Tensorflow目標檢測--為視訊中的物品打上標籤

阿新 • • 發佈：2019-01-07

視訊檢測

此程式基於Tensorflow object detection API。

視訊演示：https://www.bilibili.com/video/av32418677/?p=2

# By Bend_Function
# https://space.bilibili.com/275177832
# 可以放在任何資料夾下執行（前提正確配置API[環境變數]）
# 輸出視訊沒有聲音，pr可解決一切

import numpy as np
import os
import sys
import tensorflow as tf
import cv2
import time

from 
 object_detection.utils import label_map_util
from object_detection.utils import visualization_utils as vis_util

start = time.time()
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'
cv2.setUseOptimized(True)           # 加速cv

# This is needed since the notebook is stored in the object_detection folder.
sys. 
path.append("..")

# 可能要改的內容
######################################################
PATH_TO_CKPT = 'model\\ssd_mobilenet_v1_graph.pb'   # 模型及標籤地址
PATH_TO_LABELS = 'model\\mscoco_label_map.pbtxt'

video_PATH = "test_video\\cycling.mp4"              # 要檢測的視訊
out_PATH = "OUTPUT\\out_cycling1.mp4"            # 輸出地址(帶輸出檔名) 


NUM_CLASSES = 90            # 檢測物件個數

fourcc = cv2.VideoWriter_fourcc(*'MPEG')            # 編碼器型別（可選）
# 編碼器： DIVX , XVID , MJPG ,MPEG, X264 , WMV1 , WMV2
# 如果發生寫視訊錯誤很可能是編碼器出現問題
######################################################

# Load a (frozen) Tensorflow model into memory.
detection_graph = tf.Graph()
with detection_graph.as_default():
  od_graph_def = tf.GraphDef()
  with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:
    serialized_graph = fid.read()
    od_graph_def.ParseFromString(serialized_graph)
    tf.import_graph_def(od_graph_def, name='')


# Loading label map
label_map = label_map_util.load_labelmap(PATH_TO_LABELS)
categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)
category_index = label_map_util.create_category_index(categories)


# 讀取視訊
video_cap = cv2.VideoCapture(video_PATH)  
fps = int(video_cap.get(cv2.CAP_PROP_FPS))    # 幀率


width = int(video_cap.get(3))         # 視訊長，寬
hight = int(video_cap.get(4))


videoWriter = cv2.VideoWriter(out_PATH, fourcc, fps, (width, hight)) 

config = tf.ConfigProto()
config.gpu_options.allow_growth = True    # 減小視訊記憶體佔用
with detection_graph.as_default():
  with tf.Session(graph=detection_graph, config=config) as sess:
    # Definite input and output Tensors for detection_graph
    image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')
    # Each box represents a part of the image where a particular object was detected.
    detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')
    # Each score represent how level of confidence for each of the objects.
    # Score is shown on the result image, together with the class label.
    detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')
    detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')
    num_detections = detection_graph.get_tensor_by_name('num_detections:0')
    num = 0
    while True:
        ret, frame = video_cap.read()
        if ret == False:        # 沒檢測到就跳出
            break
        num += 1
        print(num)  # 輸出檢測到第幾幀了
        # print(num/fps) # 檢測到第幾秒了
        
        image_np = frame

        image_np_expanded = np.expand_dims(image_np, axis=0)
        image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')
        boxes = detection_graph.get_tensor_by_name('detection_boxes:0')
        scores = detection_graph.get_tensor_by_name('detection_scores:0')
        classes = detection_graph.get_tensor_by_name('detection_classes:0')
        num_detections = detection_graph.get_tensor_by_name('num_detections:0')

        # Actual detection.
        (boxes, scores, classes, num_detections) = sess.run(
            [boxes, scores, classes, num_detections],
            feed_dict={image_tensor: image_np_expanded})

        # Visualization of the results of a detection.
        vis_util.visualize_boxes_and_labels_on_image_array(
            image_np,
            np.squeeze(boxes),
            np.squeeze(classes).astype(np.int32),
            np.squeeze(scores),
            category_index,
            use_normalized_coordinates=True,
            line_thickness=4)

        # 寫視訊
        videoWriter.write(image_np)
        
videoWriter.release()
end = time.time()
print("Execution Time: ", end - start)

Tensorflow目標檢測--為視訊中的物品打上標籤

視訊檢測此程式基於Tensorflow object detection API。視訊演示：https://www.bilibili.com/video/av32418677/?p=2 # By Bend_Function # https://space.bilibili.

==2==Ubuntu 16.04下安裝TensorFlow 目標檢測 API(物件檢測API)

由於最近剛看了rcnn，faster_rcnn,mask_rcnn的原文，想著做一下實驗，所以就如題，在ubuntu下安裝TensorFlow的目標識別API！！！！在此之前很少用Ubuntu，所以犯的錯很齊全環境配置參考部落格連結物件檢測API參考的部落格主要參照上面的兩個部落格

SSD-Tensorflow 目標檢測（自定義資料集（VOC2007格式））

一、準備搭建SSD框架，下載解壓即可下載pascalvoc資料，自己的資料根據voc格式改寫（圖片的名稱，不用拘泥於6位數字，其他命名也可以）資料集下載點選解壓後不要混合在一個資料夾下 VOCtrainval用來訓練，VOCtest用來測試。 VOCtrai

Ubuntu 16.04下安裝TensorFlow 目標檢測 API(物件檢測API)

由於最近剛看了rcnn，faster_rcnn,mask_rcnn的原文，想著做一下實驗，所以就如題，在ubuntu下安裝TensorFlow的目標識別API！！！！宣告本人在此之前很少用Ubuntu，所以犯的錯很齊全~~哭環境配置參考部落格連結物件檢測AP

如何盯住梅西：TensorFlow目標檢測實戰

近日，一篇題為《Following Messi with TensorFlow and Object Detection》的教程文章展示瞭如何通過 TensorFlow 訓練定製的目標檢測模型，以專門定位和識別足球巨星梅西；同時作者也希望這一技術有助於催生出足球新戰術，提升賽事水平。我們之前曾把 Ten

tensorflow 目標檢測訓練及評估

基於tensorflow訓練車輛檢測器原始碼已上傳github,裡面集成了一鍵式訓練的指令碼。 0.硬體，一塊1080Ti及以上顯示卡的機器，不建議用CPU訓練。 1.安裝gpu版tensorflow,並搭建訓練環境 sudo pip install tensorflow

目標檢測模型（不用在ImageNet上預訓練）

論文：DSOD： Learning Deeply Supervised Object Detectors from Scratch 目標檢測的難點目前，所有基於深度學習的目標檢測方法都需要預先在 ImageNet 分類任務上預訓練的模型作為初始權重。這種預

目標檢測（Google object_detection） API 上訓練自己的資料集

應公司要求，利用谷歌最近開源的Google object_detection API對公司收集的資料集進行訓練，並檢測訓練效果。通過一兩天的研究以及維持四天的訓練（GTX 1060 6GB），終於成功的在自己資料集上訓練的任務。測試效果感覺還行，雖沒有達到谷歌官方公佈的

在谷歌目標檢測（Google object_detection） API 上訓練自己的資料集

知乎連結：https://zhuanlan.zhihu.com/p/28218410應公司要求，利用谷歌最近開源的Google object_detection API對公司收集的資料集進行訓練，並檢測訓練效果。通過一兩天的研究以及維持四天的訓練（GTX 1060 6GB）

為什麼目標檢測中要將全連線層轉化為卷積層？

參考文章： VGG網路中測試時為什麼全連結層改成卷積層為什麼使用卷積層替代CNN末尾的全連線層首先看一下卷積層的特點：區域性連線：提取資料區域性特徵，比如卷積核的感受野權值共享：一個卷積核只需提取一個特徵，降低了網路訓練的難度究竟使用卷積層代替全連線層會帶來什麼好處呢？

tensorflow利用預訓練模型進行目標檢測（四）：檢測中的精度問題以及evaluation

一、tensorflow提供的evaluation Inference and evaluation on the Open Images dataset：https://github.com/tensorflow/models/blob/master/research/object_detection/g

目標檢測中tensorflow常用API以及備選框篩選程式碼分析

目標檢測演算法中，因為產生的備選框特別多，需要刪減。而刪減的方法是NMS（非極大抑制演算法）。網上很多演算法是自己編寫功能程式碼。但是這不是tensorflow中自帶的功能，所以在使用tensorflow恢復模型的時候，sess並不能hold住他們。因此別人需要

初識TensorFlow之將自己訓練好的模型遷移到電腦攝像頭和外接海康攝像頭上,並在視訊中實時檢測

有了訓練好的模型之後，可以將模型遷移到電腦或者手機上電腦： # -*- coding: utf-8 -*- """ @author: Terry n """ # Imports import numpy as np import os import sys impor

Python實現ImageAI ，視訊中目標檢測10行程式碼

Python實現Imageai，視訊中目標檢測10行程式碼 ImageAI 提供方便，靈活和強大的方法來對視訊進行物件檢測和跟蹤。目前僅支援當前最先進的 RetinaNet 演算法進行物件檢測和跟蹤 from imageai.Detection import VideoObjectDetec

紅外視訊中的移動目標檢測

紅外視訊移動目標檢測的應用背景：通常的視訊目標檢測，往往都是可見光下的目標檢測，這種檢測識別以及跟蹤技術已經很成熟了，但卻有一個無法避免的缺陷，那就是無法在光線不足的情況下進行有效的檢測，在無光的晚上根本就不能進行檢測。而紅外檢測就可以彌補這樣的不足。紅

基於Tensorflow的視訊目標檢測API實現

最近在看如何實現視訊中道路目標的檢測的相關博文，過程遇坑，簡單總結。原文在此測試環境：Win10、TF-CPU、Opencv、Anaconda一、Anaconda下Tensorflow安裝由於僅做測試，不用訓練，簡裝CPU版本，Anaconda官網下載即可，開啟cmd：pip

針對無人機航拍視訊中動態背景下的目標檢測

目錄傳統目標檢測技術 1、幀間差分通過連續兩幀相同位置畫素點間的灰度差來確定目標移動。但只適用於靜態背景和目標單一條件的目標檢測。僅適用於無人機懸停狀態下的目標檢測。 2、背景差分法通過預先設定背景，然後通過對檢測影象和背

視訊目標檢測中關於對檢測出的目標進行”安全處理“問題

最近做視訊的目標識別和追蹤計數，編譯連線均沒有問題，但是在測試時出現了問題，只要標出的box與視訊的邊界接觸就會出現程式崩潰，並提示出opencv的斷言提示：OpenCV Error: Assertion failed (0 <= roi.x && 0

目標檢測SSD+Tensorflow 轉資料為tfrecord

用tensorflow做深度學習的目標檢測真是艱難困苦啊！ 1.程式碼地址：https://github.com/balancap/SSD-Tensorflow，下載該程式碼到本地 2.解壓ssd_300_vgg.ckpt.zip 到checkpoint資料夾

語義分割(semantic segmentation) 常用神經網絡介紹對比-FCN SegNet U-net DeconvNet，語義分割,簡單來說就是給定一張圖片,對圖片中的每一個像素點進行分類；目標檢測只有兩類,目標和非目標，就是在一張圖片中找到並用box標註出所有的目標.

avi projects div 般的 ict 中間接受 img dense from：https://blog.csdn.net/u012931582/article/details/70314859 2017年04月21日 14:54:10 閱讀數：4369

Tensorflow目標檢測--為視訊中的物品打上標籤

視訊檢測

相關推薦