faster rcnn 測試程式碼解釋

阿新 • • 發佈：2019-01-22

def test_net(sess, net, imdb, weights_filename , max_per_image=300, thresh=0.05, vis=False):
    """Test a Fast R-CNN network on an image database."""
    num_images = len(imdb.image_index)
    # all detections are collected into:
    #    all_boxes[cls][image] = N x 5 array of detections in
    #    (x1, y1, x2, y2, score) 

    all_boxes = [[[] for _ in xrange(num_images)]
                 for _ in xrange(imdb.num_classes)]

    output_dir = get_output_dir(imdb, weights_filename)
    # timers
    _t = {'im_detect' : Timer(), 'misc' : Timer()}

    if not cfg.TEST.HAS_RPN:
        roidb = imdb.roidb

    det_file = os.path.join(output_dir, 'detections.pkl' 
)
    # if os.path.exists(det_file):
    #     with open(det_file, 'rb') as f:
    #         all_boxes = cPickle.load(f)

    # 首先遍歷每一個圖片，輸入檢測函式，返回這張圖片中的多個提議區域的目標和分數
    for i in xrange(num_images):
        # filter out any ground truth boxes
        if cfg.TEST.HAS_RPN:
            box_proposals = None 

        else:
            # The roidb may contain ground-truth rois (for example, if the roidb
            # comes from the training or val split). We only want to evaluate
            # detection on the *non*-ground-truth rois. We select those the rois
            # that have the gt_classes field set to 0, which means there's no
            # ground truth.
            box_proposals = roidb[i]['boxes'][roidb[i]['gt_classes'] == 0]

        im = cv2.imread(imdb.image_path_at(i))
        _t['im_detect'].tic()
        # scores (ndarray): R x K array of object class scores (K includes background as object category 0)
        # boxes (ndarray): R x (4*K) array of predicted bounding boxes
        # 原來的輸入都是list，現在返回為矩陣
        scores, boxes = im_detect(sess, net, im, box_proposals) #返回這張圖片中的多個目標和分數
        detect_time = _t['im_detect'].toc(average=False)

        _t['misc'].tic()
        if vis:
            image = im[:, :, (2, 1, 0)] 
            plt.cla()
            plt.imshow(image)

        # skip j = 0, because it's the background class
        for j in xrange(1, imdb.num_classes):
            # 一張圖片有R個目標提議區域，總共有K類，以下都是針對第j類的
            inds = np.where(scores[:, j] > thresh)[0] # 將所有分數大於閾值的目標位置提出來
            cls_scores = scores[inds, j] # 將對應的分數提出來
            cls_boxes = boxes[inds, j*4:(j+1)*4] # 將對應的框提出來
            cls_dets = np.hstack((cls_boxes, cls_scores[:, np.newaxis])) \
                .astype(np.float32, copy=False) # 將分數和框位置合併到一起稱為一個新的矩陣
            keep = nms(cls_dets, cfg.TEST.NMS) # 通過NMS抑制，得到最好的keep個框，即一個圖片中存在的目標
            cls_dets = cls_dets[keep, :]
            if vis:
                vis_detections(image, imdb.classes[j], cls_dets)
            all_boxes[j][i] = cls_dets
        if vis:
           plt.show()
        # Limit to max_per_image detections *over all classes*
        if max_per_image > 0:
            image_scores = np.hstack([all_boxes[j][i][:, -1]
                                      for j in xrange(1, imdb.num_classes)]) # 將這個圖片所有目標的分數拼在一起 keep * num_class
            if len(image_scores) > max_per_image: #如果NMS後提議區域還是太多，那就取前max_per_image個，不過一般只有1，2個
                image_thresh = np.sort(image_scores)[-max_per_image]
                for j in xrange(1, imdb.num_classes):
                    keep = np.where(all_boxes[j][i][:, -1] >= image_thresh)[0]
                    all_boxes[j][i] = all_boxes[j][i][keep, :]
        nms_time = _t['misc'].toc(average=False)

        print 'im_detect: {:d}/{:d} {:.3f}s {:.3f}s' \
              .format(i + 1, num_images, detect_time, nms_time)


    with open(det_file, 'wb') as f:
        cPickle.dump(all_boxes, f, cPickle.HIGHEST_PROTOCOL)

    print 'Evaluating detections'
    imdb.evaluate_detections(all_boxes, output_dir)

畫圖解釋

all_boxes = [[[] for _ in xrange(num_images)]
                 for _ in xrange(imdb.num_classes)]

這裡寫圖片描述

for j in xrange(1, imdb.num_classes):
    # 一張圖片有R個目標提議區域，總共有K類，以下都是針對第j類的
    inds = np.where(scores[:, j] > thresh)[0] # 將所有分數大於閾值的目標位置提出來
    cls_scores = scores[inds, j] # 將對應的分數提出來
    cls_boxes = boxes[inds, j*4:(j+1)*4] # 將對應的框提出來
    cls_dets = np.hstack((cls_boxes, cls_scores[:, np.newaxis])) \
        .astype(np.float32, copy=False) # 將分數和框位置合併到一起稱為一個新的矩陣
    keep = nms(cls_dets, cfg.TEST.NMS) # 通過NMS抑制，得到最好的keep個框，即一個圖片中存在的目標
    cls_dets = cls_dets[keep, :]

這裡寫圖片描述

faster rcnn 測試程式碼解釋

def test_net(sess, net, imdb, weights_filename , max_per_image=300, thresh=0.05, vis=False): """Test a Fast R-CNN network on an

faster-rcnn demo程式碼修改進行視訊實時性測試

參考部落格原址：https://blog.csdn.net/qq_37124237/article/details/81087505 #!/usr/bin/

Detectron 儲存faster rcnn 測試結果，類別置信度座標

由於專案需要，要求獲取每張圖片中每個box的類別、置信度得分和bbox的座標資訊。思路在inference階段，每infer一張圖片就新開一個txt檔案，txt檔案的每一行代表一個bbox得檢測資訊包括bbox的類別置信度和四個座標值需要修改兩個檔案 1

【FPN車輛目標檢測】資料集獲取以及Windows7+TensorFlow+Faster-RCNN+FPN程式碼環境配置和執行過程實測

PS 最近在學目標檢測想用最新的FPN網路，剛好看到這篇部落格https://blog.csdn.net/Angela_qin/article/details/80944604嘗試把它復現，說的小白一點。 1.資料集獲取博主只說是車輛目標檢測沒將資料集在哪裡獲取。我在程式碼中發現E:/st

eclipse 下安裝PyDev並匯入faster rcnn python程式碼除錯

Python在演算法研究應用非常廣泛，最近要研究faster rcnn的python程式碼，就得學習python，所以就需要一個趁手的工具來看python程式碼，否則只是用文字編譯器找程式碼實在是太影響效率了。因為對eclipse用著很順手，所以就使用ecl

SmallCorgi/TF-Faster RCNN測試

環境配置 Github上給出SmallCorgi的連結TF-Faster RCNN，按照要求配置環境。 sudo pip install cython sudo pip install easydict sudo pip install opencv-p

Faster-RCNN-tf使用訓練好的模型驗證測試集 test_net.py

對應原始碼地址：https://github.com/endernewton/tf-faster-rcnn 1、開啟tools目錄下的test_net.py檔案修改（1）： parser.add_argument('--model', dest='model', help='mo

Faster RCNN演算法訓練程式碼解析（2）

接著上篇的部落格，我們獲取imdb和roidb的資料後，就可以搭建網路進行訓練了。我們回到trian_rpn()函式裡面，此時執行完了roidb, imdb = get_roidb(imdb_name)，取得了imdb和roidb資料。先進入第一階段的訓練： print

Faster RCNN演算法訓練程式碼解析（3）

四個層的forward函式分析： RoIDataLayer：讀資料，隨機打亂等 AnchorTargetLayer：輸出所有anchors（這裡分析這個） ProposalLayer：用產生的anchors平移整圖，裁剪出界、移除低於閾值的的anchors，排序後使用nms，返回頂部排名的anchors

Object Detection （4）Faster RCNN Keras 原理+程式碼第二部分

目錄 Object Detection （1）VOC2007資料集製作 Object Detection （2）Faster RCNN詳解 &

Object Detection （3）Faster RCNN Keras 原理+程式碼第一部分

目錄 Object Detection （1）VOC2007資料集製作 Object Detection （2）Faster RCNN詳解 &

（原）faster rcnn的tensorflow程式碼的理解

轉載請註明出處：參考網址：論文：https://arxiv.org/abs/1506.01497 tf的第三方faster rcnn：https://github.com/endernewton/tf-faster-rcnn IOU：https://www.cnblogs.com/

Ubuntu 16.04 測試 tf-faster-rcnn 在CPU下執行

參考連結： git clone https://github.com/endernewton/tf-faster-rcnn.git 2、執行和修改配置檔案 cd tf-faster-rcnn/lib vim setup.py make clean mak

pytorch 從頭開始faster-rcnn（一）：程式碼知識準備

一： class config: def _parse(self, kwargs): state_dict = self._state_dict() for k, v in kwargs.items():

tensorflow+faster rcnn程式碼理解（四）boundingbox迴歸

1.為什麼要做Bounding-box regression？如圖所示，綠色的框為飛機的Ground Truth，紅色的框是提取的Region Proposal。那麼即便紅色的框被分類器識別為飛機，但是由於紅色的框定位不準(IoU<0.5)，那麼這張圖相當於沒有正確的檢測出飛機。如

tensorflow+faster rcnn程式碼理解（三）：損失函式構建

前面兩篇部落格已經敘述了基於vgg模型構建faster rcnn的過程： tensorflow+faster rcnn程式碼理解（一）：構建vgg前端和RPN網路 tensorflow+faster rcnn程式碼解析（二）：anchor_target_layer、proposal_targ

tensorflow+faster rcnn程式碼解析（二）：anchor_target_layer、proposal_target_layer、proposal_layer

接在tensorflow+faster rcnn程式碼理解（一）：構建vgg前端和RPN網路之後，對於每張輸入影象（600×800）RPN會產生17100個anchor，構建RPN後會輸出4個tensor，維度如下： rpn_cls_prob：（1,38,50,18） rpn_bbo

tensorflow+faster rcnn程式碼理解（一）：構建vgg前端和RPN網路

0.前言該程式碼執行首先就是呼叫vgg類建立一個網路物件self.net if cfg.FLAGS.network == 'vgg16': self.net = vgg16(batch_size=cfg.FLAGS.ims_per_batch) 該類位於vgg.py中，如下：

Faster RCNN（2）程式碼分析

目錄執行程式碼程式碼分析執行程式碼原作者的程式碼實現py-faster-rcnn，用的框架是caffe，由於對caffe不熟悉，所以在github上找了一個tensorflow版本的程式碼實現，地址是tf-faster-rcnn 在github上閱讀程式

faster-rcnn程式碼閱讀2

二、訓練接下來回到train.py第160行，通過呼叫sw.train_model方法進行訓練： 1 def train_model(self, max_iters): 2 """Network training loop.""" 3 last_sna

faster rcnn 測試程式碼解釋

相關推薦