Detectron 儲存faster rcnn 測試結果，類別置信度座標

阿新 • • 發佈：2018-12-18

由於專案需要，要求獲取每張圖片中每個box的類別、置信度得分和bbox的座標資訊。

思路在inference階段，每infer一張圖片就新開一個txt檔案，txt檔案的每一行代表一個bbox得檢測資訊包括bbox的類別置信度和四個座標值

需要修改兩個檔案

1，修改 detectron-master/detectron/utils/vis.py 檔案

2，修改 detectron-master/tools/infer_simple.py 檔案

對於detectron-master/detectron/utils/vis.py 檔案主要修改vis_one_image()函式,添加了一個引數用於傳入txt的檔名字。

另外新新增一個函式get_class_and_confidence用於獲取類別和置信度，該函式放在get_class_string杉樹下面如下圖：

函式get_class_and_confidence在函式vis_one_image中呼叫獲取類別和置信度得分，修改後的vis_one_image函式如下

def vis_one_image(
        im, im_name, output_dir, boxes, segms=None, keypoints=None, thresh=0.9,
        kp_thresh=2, dpi=200, box_alpha=0.0, dataset=None, show_class=False,
        ext='pdf', out_when_no_box=False,tested_txt=None):
    """Visual debugging of detections."""
    assert not (tested_txt==None),"please give the output full txt name"
    print("save detect reselt in : %s",tested_txt)
    save_to_txt = open(tested_txt,'w',encoding='utf-8')
    if not os.path.exists(output_dir):
        os.makedirs(output_dir)

    if isinstance(boxes, list):
        boxes, segms, keypoints, classes = convert_from_cls_format(
            boxes, segms, keypoints)

    if (boxes is None or boxes.shape[0] == 0 or max(boxes[:, 4]) < thresh) and not out_when_no_box:
        return

    dataset_keypoints, _ = keypoint_utils.get_keypoints()

    if segms is not None and len(segms) > 0:
        masks = mask_util.decode(segms)

    color_list = colormap(rgb=True) / 255

    kp_lines = kp_connections(dataset_keypoints)
    cmap = plt.get_cmap('rainbow')
    colors = [cmap(i) for i in np.linspace(0, 1, len(kp_lines) + 2)]

    fig = plt.figure(frameon=False)
    fig.set_size_inches(im.shape[1] / dpi, im.shape[0] / dpi)
    ax = plt.Axes(fig, [0., 0., 1., 1.])
    ax.axis('off')
    fig.add_axes(ax)
    ax.imshow(im)

    if boxes is None:
        sorted_inds = [] # avoid crash when 'boxes' is None
    else:
        # Display in largest to smallest order to reduce occlusion
        areas = (boxes[:, 2] - boxes[:, 0]) * (boxes[:, 3] - boxes[:, 1])
        sorted_inds = np.argsort(-areas)

    mask_color_id = 0
    for i in sorted_inds:
        bbox = boxes[i, :4]
        score = boxes[i, -1]
        if score < thresh:
            continue

        # show box (off by default)
        ax.add_patch(
            plt.Rectangle((bbox[0], bbox[1]),
                          bbox[2] - bbox[0],
                          bbox[3] - bbox[1],
                          fill=False, edgecolor='g',
                          linewidth=0.5, alpha=box_alpha))
        # print(classes[i],score,bbox)
        mycls,confidence = get_class_and_confidence(classes[i], score, dataset)
        write_linedata = "class:"+mycls+" "+"score:"+confidence+" "+"xmin:"+bbox[0]+" "+"ymin:"+bbox[1]+" "+"xmax:"+bbox[2]+" "+"ymax:"+bbox[3]
        save_to_txt.write(write_linedata + '\n')
        if show_class:
            ax.text(
                bbox[0], bbox[1] - 2,
                get_class_string(classes[i], score, dataset),
                fontsize=3,
                family='serif',
                bbox=dict(
                    facecolor='g', alpha=0.4, pad=0, edgecolor='none'),
                color='white')

        # show mask
        if segms is not None and len(segms) > i:
            img = np.ones(im.shape)
            color_mask = color_list[mask_color_id % len(color_list), 0:3]
            mask_color_id += 1

            w_ratio = .4
            for c in range(3):
                color_mask[c] = color_mask[c] * (1 - w_ratio) + w_ratio
            for c in range(3):
                img[:, :, c] = color_mask[c]
            e = masks[:, :, i]

            _, contour, hier = cv2.findContours(
                e.copy(), cv2.RETR_CCOMP, cv2.CHAIN_APPROX_NONE)

            for c in contour:
                polygon = Polygon(
                    c.reshape((-1, 2)),
                    fill=True, facecolor=color_mask,
                    edgecolor='w', linewidth=1.2,
                    alpha=0.5)
                ax.add_patch(polygon)

        # show keypoints
        if keypoints is not None and len(keypoints) > i:
            kps = keypoints[i]
            plt.autoscale(False)
            for l in range(len(kp_lines)):
                i1 = kp_lines[l][0]
                i2 = kp_lines[l][1]
                if kps[2, i1] > kp_thresh and kps[2, i2] > kp_thresh:
                    x = [kps[0, i1], kps[0, i2]]
                    y = [kps[1, i1], kps[1, i2]]
                    line = plt.plot(x, y)
                    plt.setp(line, color=colors[l], linewidth=1.0, alpha=0.7)
                if kps[2, i1] > kp_thresh:
                    plt.plot(
                        kps[0, i1], kps[1, i1], '.', color=colors[l],
                        markersize=3.0, alpha=0.7)

                if kps[2, i2] > kp_thresh:
                    plt.plot(
                        kps[0, i2], kps[1, i2], '.', color=colors[l],
                        markersize=3.0, alpha=0.7)

            # add mid shoulder / mid hip for better visualization
            mid_shoulder = (
                kps[:2, dataset_keypoints.index('right_shoulder')] +
                kps[:2, dataset_keypoints.index('left_shoulder')]) / 2.0
            sc_mid_shoulder = np.minimum(
                kps[2, dataset_keypoints.index('right_shoulder')],
                kps[2, dataset_keypoints.index('left_shoulder')])
            mid_hip = (
                kps[:2, dataset_keypoints.index('right_hip')] +
                kps[:2, dataset_keypoints.index('left_hip')]) / 2.0
            sc_mid_hip = np.minimum(
                kps[2, dataset_keypoints.index('right_hip')],
                kps[2, dataset_keypoints.index('left_hip')])
            if (sc_mid_shoulder > kp_thresh and
                    kps[2, dataset_keypoints.index('nose')] > kp_thresh):
                x = [mid_shoulder[0], kps[0, dataset_keypoints.index('nose')]]
                y = [mid_shoulder[1], kps[1, dataset_keypoints.index('nose')]]
                line = plt.plot(x, y)
                plt.setp(
                    line, color=colors[len(kp_lines)], linewidth=1.0, alpha=0.7)
            if sc_mid_shoulder > kp_thresh and sc_mid_hip > kp_thresh:
                x = [mid_shoulder[0], mid_hip[0]]
                y = [mid_shoulder[1], mid_hip[1]]
                line = plt.plot(x, y)
                plt.setp(
                    line, color=colors[len(kp_lines) + 1], linewidth=1.0,
                    alpha=0.7)
    save_to_txt.close()

對於detectron-master/tools/infer_simple.py 檔案，主要修改main函式，建立了一個資料夾（預設在detectron-master目錄一下）output_txts用來存放每一張圖片的txt。

修改後的main()函式

def main(args):
    logger = logging.getLogger(__name__)

    merge_cfg_from_file(args.cfg)
    cfg.NUM_GPUS = 1
    args.weights = cache_url(args.weights, cfg.DOWNLOAD_CACHE)
    assert_and_infer_cfg(cache_urls=False)

    assert not cfg.MODEL.RPN_ONLY, \
        'RPN models are not supported'
    assert not cfg.TEST.PRECOMPUTED_PROPOSALS, \
        'Models that require precomputed proposals are not supported'

    model = infer_engine.initialize_model_from_cfg(args.weights)
    dummy_coco_dataset = dummy_datasets.get_coco_dataset()

    if os.path.isdir(args.im_or_folder):
        im_list = glob.iglob(args.im_or_folder + '/*.' + args.image_ext)
    else:
        im_list = [args.im_or_folder]

    script_path = os.path.dirname(os.path.abspath(__file__))
    txt_path = "{0}/../output_txts/".format(script_path)
    if not os.path.exists():
        od.makedirs(txt_path)
    else:
        print("path exists,it should be removed!!")
    for i, im_name in enumerate(im_list):
        out_name = os.path.join(
            args.output_dir, '{}'.format(os.path.basename(im_name) + '.' + args.output_ext)
        )
        logger.info('Processing {} -> {}'.format(im_name, out_name))
        im = cv2.imread(im_name)
        timers = defaultdict(Timer)
        t = time.time()
        with c2_utils.NamedCudaScope(0):
            cls_boxes, cls_segms, cls_keyps = infer_engine.im_detect_all(
                model, im, None, timers=timers
            )
        print
        logger.info('Inference time: {:.3f}s'.format(time.time() - t))
        for k, v in timers.items():
            logger.info(' | {}: {:.3f}s'.format(k, v.average_time))
        if i == 0:
            logger.info(
                ' \ Note: inference on the first image will be slower than the '
                'rest (caches and auto-tuning need to warm up)'
            )
        #add save txt path
        txt_name = im_name.split(".")[0] + '.txt'
        save_txt_path = txt_path + txt_name
        vis_utils.vis_one_image(
            im[:, :, ::-1],  # BGR -> RGB for visualization
            im_name,
            args.output_dir,
            cls_boxes,
            cls_segms,
            cls_keyps,
            dataset=dummy_coco_dataset,
            box_alpha=0.3,
            show_class=True,
            thresh=args.thresh,
            kp_thresh=args.kp_thresh,
            ext=args.output_ext,
            out_when_no_box=args.out_when_no_box,
            tested_txt=save_txt_path
        )

Detectron 儲存faster rcnn 測試結果，類別置信度座標

由於專案需要，要求獲取每張圖片中每個box的類別、置信度得分和bbox的座標資訊。思路在inference階段，每infer一張圖片就新開一個txt檔案，txt檔案的每一行代表一個bbox得檢測資訊包括bbox的類別置信度和四個座標值需要修改兩個檔案 1

儲存faster-rcnn的檢測結果

為了分析faster-Rcnn的測試結果，需要先將測試結果儲存起來，效果如下：（圖片名類別 bbox座標）程式碼如下： #!/usr/bin/env python # --------------------------------------------

輸出yolo的測試結果，根據座標裁剪原圖並儲存

因為專案中的需要，本篇博文實現輸出（儲存）yolo的測試結果，並測試結果的座標位置切割原圖，並不需要知道每個框的類別，儲存了top9。主要對src/image.c檔案中的draw_detections函式做了修改。 //添加了 char *filena

faster rcnn 測試程式碼解釋

def test_net(sess, net, imdb, weights_filename , max_per_image=300, thresh=0.05, vis=False): """Test a Fast R-CNN network on an

SmallCorgi/TF-Faster RCNN測試

環境配置 Github上給出SmallCorgi的連結TF-Faster RCNN，按照要求配置環境。 sudo pip install cython sudo pip install easydict sudo pip install opencv-p

caffe學習（四）：py-faster-rcnn配置，執行測試程式（Ubuntu）

上一篇部落格中講了在Ubuntu下安裝caffe的經驗總結（各種問題，簡直懷疑人生了）。部落格連結：點我開啟 faster-rcnn有兩個版本，分別是Python的和MATLAB的。這裡介紹python版本的faster-rcnn的配置。網上有很多相關的教程，起初我在配置

Faster RCNN修改demo.py檔案實現圖片的批量測試與儲存

關於Faster R-CNN Tensorflow+python 3.5 在Windows10環境下配置實現，可以參看這裡。執行在demo.py檔案中測試資料中原始碼設定僅檢測幾張圖片供參考，原始的程式碼段如下。 im_names = ['000456.jpg', '000

學習筆記-目標檢測、定位、識別（RCNN，Fast-RCNN, Faster-RCNN，Mask-RCNN，YOLO，SSD 系列）

0. 前言說到深度學習的目標檢測，就要提到傳統的目標檢測方法。傳統的目標檢測流程： 1）區域選擇（窮舉策略：採用滑動視窗，且設定不同的大小，不同的長寬比對影象進行遍歷，時間複雜度高） 2）特徵提取（SIFT、HOG等；形態多樣性、光照變化多樣性、背景多樣性使得特徵魯棒性差）

Faster-RCNN-tf使用訓練好的模型驗證測試集 test_net.py

對應原始碼地址：https://github.com/endernewton/tf-faster-rcnn 1、開啟tools目錄下的test_net.py檔案修改（1）： parser.add_argument('--model', dest='model', help='mo

mysql儲存過程舉例：100以內的整數除以2、4、6、8的結果，相加等於多少

學習儲存過程：首先知道它是幹嘛的，概念：將一組sql語句，完成一個特定的功能，稱之為儲存過程，寫儲存過程：只能建立、替換、刪除 DROP PROCEDURE IF EXISTS sum; -- procedure 存在則先刪除 create procedure `su

faster-rcnn中新增Mask中的RoiAlign層，使迴歸框更精確（ roi_align_layer.cu:240] Check failed: error == cudaSuccess *）

版權宣告：本文為博主原創文章，未經博主允許不得轉載。 https://blog.csdn.net/e01528/article/details/80265118 具體的操作為什麼這樣做，可參照： 1.Caffe學習之自定義建立新的Layer層 2.如何在caffe中自定

建立一棵用二叉樹連結串列方式儲存的二叉樹，並對其進行遍歷（先序，中序和後序），列印輸出遍歷結果

題目如下程式碼如下 #include<stdio.h> #include<stdlib.h> #include<malloc.h> typedef struct Node//結構體 {

輸入某二叉樹的前序遍歷和中序遍歷的結果，請重建出該二叉樹(java實現並測試)

假設輸入的前序遍歷和中序遍歷的結果中都不含重複的數字。例如輸入前序遍歷序列{1,2,4,7,3,5,6,8}和中序遍歷序列{4,7,2,1,5,3,8,6}，則重建二叉樹並返回。 package ssp; class TreeNode { int val; TreeNod

ASP.NET MVC + EF 利用儲存過程讀取大資料，1億資料測試很OK

看到本文的標題，相信你會忍不住進來看看！沒錯，本文要講的就是這個重量級的東西，這個不僅僅支援單表查詢，更能支援連線查詢，加入一個表10W資料，另一個表也是10萬資料，當你用linq建立一個連線查詢然後

Ubuntu 16.04 測試 tf-faster-rcnn 在CPU下執行

參考連結： git clone https://github.com/endernewton/tf-faster-rcnn.git 2、執行和修改配置檔案 cd tf-faster-rcnn/lib vim setup.py make clean mak

faster rcnn原始碼理解imdb，roidb，blob很關鍵

原 faster rcnn原始碼理解 2016年12月12日 23:07:19 zbxzc 閱讀數：15173 &

Faster批量測試且所有類檢測結果都顯示在一張圖上。

endernewton版本tensorflow實現的faster-rcnn 原來demo.py：實現的是檢測一張圖片，然後對該圖片的每一類檢測結果，單獨顯示。修改之後：從txt中讀取要檢測的圖片名稱，進行批量檢測，並把所有類的檢測結果都放到一張圖上，然後儲存到dat

faster-rcnn demo程式碼修改進行視訊實時性測試

參考部落格原址：https://blog.csdn.net/qq_37124237/article/details/81087505 #!/usr/bin/

mysql關於資料庫事務隔離級別測試（包含例項測試語句，及測試結果對比）

1、知識點；事務的四大特性 ACID ；原子性(Atomic):事務是一個整體（無論在該事務中操作任何CRUD），要不全部執行，要不全部不執行。（資料庫能夠進行操作的最小的邏輯單元）一致性(Consistent):組成一個事務的操作是CRUD，要麼全部成功，要

使用coco資料集，faster rcnn類方法訓練出錯解決

問題：在caffe框架下，使用coco資料集進行faster rcnn類方法訓練，得到如下錯誤： File "/data/zn/light_head_rcnn/script/py-RFCN-priv/tools/../lib/rpn/anchor_target_layer.

Detectron 儲存faster rcnn 測試結果，類別 置信度 座標

相關推薦

Detectron 儲存faster rcnn 測試結果，類別置信度座標