Choosing a Object Detection Framework written by Tensorflow

阿新 • • 發佈：2018-12-23

Recently I need to train a DNN model for object detection task. In the past, I am using the object detection framework from tensorflows’s subject — models. But there are two reasons that I couldn’t use it to do my own project:

First, it’s too big. There are more than two hundred python files in this subproject. It contains different types of models such as Resnet, Mobilenet and different type of detection algorithms such as RCNN and SSD (Single Shot Detection). Enormous codes usually mean hard to understand and customize. It might cost a lot of times to build a new project by referencing a big old project.
Second, it uses too much configuration files hence even trying to understanding these configs would be a tedious work.

Then I find another better project in GitHub: SSD-Tensorflow. It’s simple. It only includes less than fifty Python files. You can begin training only by using one command:

Shell

DATASET_DIR=./train_tfrecords
TRAIN_DIR=./logs/
CHECKPOINT_PATH=./logs/
CUDA_VISIBLE_DEVICES=0,1 python train_ssd_network.py \
  --train_dir=${TRAIN_DIR} \
  --dataset_dir=${DATASET_DIR} \
  --dataset_name=pascalvoc_2007 \
  --dataset_split_name=train \
  --model_name=ssd_512_vgg \
  --checkpoint_path=${CHECKPOINT_PATH} \
  --save_summaries_secs=60 \
  --save_interval_secs=600 \
  --weight_decay=0.0005 \
  --optimizer=rmsprop \
  --learning_rate=0.001 \
  --end_learning_rate=0.000001 \
  --learning_rate_decay_factor=0.96 \
  --num_epochs_per_decay=5.0 \
  --batch_size=8 \
  --gpu_memory_fraction=1.0 \
  --num_clones=2

123456789101112131415161718192021

DATASET_DIR=./train_tfrecordsTRAIN_DIR=./logs/CHECKPOINT_PATH=./logs/CUDA_VISIBLE_DEVICES=0,1python train_ssd_network.py\--train_dir=${TRAIN_DIR}\--dataset_dir=${DATASET_DIR}\--dataset_name=pascalvoc_2007\--dataset_split_name=train\

--model_name=ssd_512_vgg\--checkpoint_path=${CHECKPOINT_PATH}\--save_summaries_secs=60\--save_interval_secs=600\--weight_decay=0.0005\--optimizer=rmsprop\--learning_rate=0.001\--end_learning_rate=0.000001\--learning_rate_decay_factor=0.96\--num_epochs_per_decay=5.0\--batch_size=8\--gpu_memory_fraction=1.0\--num_clones=2

No configuration files, no ProtoBuf formats.
Although simple and easy to use, this project still has the capability to be remolded. We can add new dataset by changing the code in directory ‘/datasets’ and add new networks in directory ‘/nets’. Let’s see the basic implementation of its VGG for SSD:

Python

# nets/ssd_vgg_512.py
    with tf.variable_scope(scope, 'ssd_512_vgg', [inputs], reuse=reuse):
        # Original VGG-16 blocks.
        net = slim.repeat(inputs, 2, slim.conv2d, 64, [3, 3], scope='conv1')
        end_points['block1'] = net
        net = slim.max_pool2d(net, [2, 2], scope='pool1')
        # Block 2.
        net = slim.repeat(net, 2, slim.conv2d, 128, [3, 3], scope='conv2')
        end_points['block2'] = net
        net = slim.max_pool2d(net, [2, 2], scope='pool2')
        # Block 3.
        net = slim.repeat(net, 3, slim.conv2d, 256, [3, 3], scope='conv3')
        end_points['block3'] = net
        net = slim.max_pool2d(net, [2, 2], scope='pool3')
        # Block 4.
        net = slim.repeat(net, 3, slim.conv2d, 512, [3, 3], scope='conv4')
        end_points['block4'] = net
        net = slim.max_pool2d(net, [2, 2], scope='pool4')
        # Block 5.
        net = slim.repeat(net, 3, slim.conv2d, 512, [3, 3], scope='conv5')
        end_points['block5'] = net
        net = slim.max_pool2d(net, [3, 3], 1, scope='pool5')
...

1234567891011121314151617181920212223

# nets/ssd_vgg_512.pywithtf.variable_scope(scope,'ssd_512_vgg',[inputs],reuse=reuse):# Original VGG-16 blocks.net=slim.repeat(inputs,2,slim.conv2d,64,[3,3],scope='conv1')end_points['block1']=netnet=slim.max_pool2d(net,[2,2],scope='pool1')# Block 2.net=slim.repeat(net,2,slim.conv2d,128,[3,3],scope='conv2')end_points['block2']=netnet=slim.max_pool2d(net,[2,2],scope='pool2')# Block 3.net=slim.repeat(net,3,slim.conv2d,256,[3,3],scope='conv3')end_points['block3']=netnet=slim.max_pool2d(net,[2,2],scope='pool3')# Block 4.net=slim.repeat(net,3,slim.conv2d,512,[3,3],scope='conv4')end_points['block4']=netnet=slim.max_pool2d(net,[2,2],scope='pool4')# Block 5.net=slim.repeat(net,3,slim.conv2d,512,[3,3],scope='conv5')end_points['block5']=netnet=slim.max_pool2d(net,[3,3],1,scope='pool5')...

Quit straight, isn’t it?

Choosing a Object Detection Framework written by Tensorflow

Recently I need to train a DNN model for object detection task. In the past, I am using the object detection framework from tensorflows’s subject — m

Viola–Jones object detection framework--Rapid Object Detection using a Boosted Cascade of Simple Features中文翻譯及 matlab實現(見文末連結)

ACCEPTED CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2001 Rapid Object Detection using a Boosted Cascade of Simple Features 簡單特徵的優化級聯

Choosing a Object Detection Framework written by Tensorflow

Related

Choosing a Object Detection Framework written by Tensorflow

Viola–Jones object detection framework--Rapid Object Detection using a Boosted Cascade of Simple Features中文翻譯及 matlab實現(見文末連結)

Tensorflow object detection API 搭建屬於自己的物體識別模型——常見問題彙總 Q&A

【論文解讀】【半監督學習】【Google教你水論文】A Simple Semi-Supervised Learning Framework for Object Detection

TensorFlow使用object detection訓練並識別自己的模型

tensorflow object detection faster r-cnn 中keep_aspect_ratio_resizer是什麽意思

配置tensorflow object detection api

谷歌開源的TensorFlow Object Detection API視頻物體識別系統實現教程

論文翻譯 DOTA:A Large-scale Dataset for Object Detection in Aerial Images

#tensorflow object detection api 源碼分析

TensorFlow object detection API

TensorFlow object detection API應用一

Ubuntu 16.04 安裝Tensorflow Object Detection API遇到的問題解決

Tensorflow Object Detection API之MaskRCNN-資料處理篇

Tensorflow object detection API--修改visualization_utils檔案,裁剪並儲存bounding box部分

[論文理解] Rapid-Object-Detection-using-a-Boosted-cascade-of-simple-features

【CV】如何使用Tensorflow提供的Object Detection API--3--手工標註資料

【CV】如何使用Tensorflow提供的Object Detection API --2--資料轉換為TFRecord格式

基於TensorFlow Object Detection API進行相關開發的步驟

Object Detection In Tensorflow - Part 3

Choosing a Object Detection Framework written by Tensorflow

Related

相關推薦