resnet50訓練cifar10，請各位高手指正

阿新 • • 發佈：2018-12-05

使用resnet50從頭訓練cifar10，最終結果只有84%左右，貌似和論文差很多，請各位高手指正。

首先加入cifar10的資料結構程式碼：

import cifar10,cifar10_input
import tensorflow as tf
import numpy as np
import time

#max_steps = 100000
max_steps = 100
data_dir = 'cifar-10-batches-bin'
batch_size = 128

# 配置每個 GPU 上佔用的記憶體的比例
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0 
.95)
sess = tf.Session(config=tf.ConfigProto(gpu_options=gpu_options))

images_train, labels_train = cifar10_input.distorted_inputs(data_dir=data_dir,batch_size=batch_size)
images_test, labels_test = cifar10_input.inputs(eval_data=True,data_dir=data_dir,batch_size=batch_size)

sess = tf.InteractiveSession()
tf.global_variables_initializer().run()

tf.train.start_queue_runners()

start_time = time 
.time()
image_batch,label_batch = sess.run([images_train,labels_train])
duration = time.time() - start_time

print('Use Time = %.3f sec'%duration)

start_time = time.time()
image_batch,label_batch = sess.run([images_train,labels_train])
duration = time.time() - start_time

print('Use Time = %.3f sec' 
%duration)

加入resnet50的程式碼：

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import tensorflow as tf

import resnet_utils


resnet_arg_scope = resnet_utils.resnet_arg_scope
slim = tf.contrib.slim


class NoOpScope(object):
  """No-op context manager."""

  def __enter__(self):
    return None

  def __exit__(self, exc_type, exc_value, traceback):
    return False


@slim.add_arg_scope
def bottleneck(inputs,
               depth,
               depth_bottleneck,
               stride,
               rate=1,
               outputs_collections=None,
               scope=None,
               use_bounded_activations=False):
  """Bottleneck residual unit variant with BN after convolutions.

  This is the original residual unit proposed in [1]. See Fig. 1(a) of [2] for
  its definition. Note that we use here the bottleneck variant which has an
  extra bottleneck layer.

  When putting together two consecutive ResNet blocks that use this unit, one
  should use stride = 2 in the last unit of the first block.

  Args:
    inputs: A tensor of size [batch, height, width, channels].
    depth: The depth of the ResNet unit output.
    depth_bottleneck: The depth of the bottleneck layers.
    stride: The ResNet unit's stride. Determines the amount of downsampling of
      the units output compared to its input.
    rate: An integer, rate for atrous convolution.
    outputs_collections: Collection to add the ResNet unit output.
    scope: Optional variable_scope.
    use_bounded_activations: Whether or not to use bounded activations. Bounded
      activations better lend themselves to quantized inference.

  Returns:
    The ResNet unit's output.
  """
  with tf.variable_scope(scope, 'bottleneck_v1', [inputs]) as sc:
    depth_in = slim.utils.last_dimension(inputs.get_shape(), min_rank=4)
    if depth == depth_in:
      shortcut = resnet_utils.subsample(inputs, stride, 'shortcut')
    else:
      shortcut = slim.conv2d(
          inputs,
          depth, [1, 1],
          stride=stride,
          activation_fn=tf.nn.relu6 if use_bounded_activations else None,
          scope='shortcut')

    residual = slim.conv2d(inputs, depth_bottleneck, [1, 1], stride=1,
                           scope='conv1')
    residual = resnet_utils.conv2d_same(residual, depth_bottleneck, 3, stride,
                                        rate=rate, scope='conv2')
    residual = slim.conv2d(residual, depth, [1, 1], stride=1,
                           activation_fn=None, scope='conv3')

    if use_bounded_activations:
      # Use clip_by_value to simulate bandpass activation.
      residual = tf.clip_by_value(residual, -6.0, 6.0)
      output = tf.nn.relu6(shortcut + residual)
    else:
      output = tf.nn.relu(shortcut + residual)

    return slim.utils.collect_named_outputs(outputs_collections,
                                            sc.name,
                                            output)

def resnet_v1(inputs,
              blocks,
              num_classes=None,
              is_training=True,
              global_pool=True,
              output_stride=None,
              include_root_block=True,
              spatial_squeeze=True,
              store_non_strided_activations=False,
              reuse=None,
              scope=None):
  """Generator for v1 ResNet models.

  This function generates a family of ResNet v1 models. See the resnet_v1_*()
  methods for specific model instantiations, obtained by selecting different
  block instantiations that produce ResNets of various depths.

  Training for image classification on Imagenet is usually done with [224, 224]
  inputs, resulting in [7, 7] feature maps at the output of the last ResNet
  block for the ResNets defined in [1] that have nominal stride equal to 32.
  However, for dense prediction tasks we advise that one uses inputs with
  spatial dimensions that are multiples of 32 plus 1, e.g., [321, 321]. In
  this case the feature maps at the ResNet output will have spatial shape
  [(height - 1) / output_stride + 1, (width - 1) / output_stride + 1]
  and corners exactly aligned with the input image corners, which greatly
  facilitates alignment of the features to the image. Using as input [225, 225]
  images results in [8, 8] feature maps at the output of the last ResNet block.

  For dense prediction tasks, the ResNet needs to run in fully-convolutional
  (FCN) mode and global_pool needs to be set to False. The ResNets in [1, 2] all
  have nominal stride equal to 32 and a good choice in FCN mode is to use
  output_stride=16 in order to increase the density of the computed features at
  small computational and memory overhead, cf. http://arxiv.org/abs/1606.00915.

  Args:
    inputs: A tensor of size [batch, height_in, width_in, channels].
    blocks: A list of length equal to the number of ResNet blocks. Each element
      is a resnet_utils.Block object describing the units in the block.
    num_classes: Number of predicted classes for classification tasks.
      If 0 or None, we return the features before the logit layer.
    is_training: whether batch_norm layers are in training mode. If this is set
      to None, the callers can specify slim.batch_norm's is_training parameter
      from an outer slim.arg_scope.
    global_pool: If True, we perform global average pooling before computing the
      logits. Set to True for image classification, False for dense prediction.
    output_stride: If None, then the output will be computed at the nominal
      network stride. If output_stride is not None, it specifies the requested
      ratio of input to output spatial resolution.
    include_root_block: If True, include the initial convolution followed by
      max-pooling, if False excludes it.
    spatial_squeeze: if True, logits is of shape [B, C], if false logits is
        of shape [B, 1, 1, C], where B is batch_size and C is number of classes.
        To use this parameter, the input images must be smaller than 300x300
        pixels, in which case the output logit layer does not contain spatial
        information and can be removed.
    store_non_strided_activations: If True, we compute non-strided (undecimated)
      activations at the last unit of each block and store them in the
      `outputs_collections` before subsampling them. This gives us access to
      higher resolution intermediate activations which are useful in some
      dense prediction problems but increases 4x the computation and memory cost
      at the last unit of each block.
    reuse: whether or not the network and its variables should be reused. To be
      able to reuse 'scope' must be given.
    scope: Optional variable_scope.

  Returns:
    net: A rank-4 tensor of size [batch, height_out, width_out, channels_out].
      If global_pool is False, then height_out and width_out are reduced by a
      factor of output_stride compared to the respective height_in and width_in,
      else both height_out and width_out equal one. If num_classes is 0 or None,
      then net is the output of the last ResNet block, potentially after global
      average pooling. If num_classes a non-zero integer, net contains the
      pre-softmax activations.
    end_points: A dictionary from components of the network to the corresponding
      activation.

  Raises:
    ValueError: If the target output_stride is not valid.
  """
  with tf.variable_scope(scope, 'resnet_v1', [inputs], reuse=reuse) as sc:
    end_points_collection = sc.original_name_scope + '_end_points'
    with slim.arg_scope([slim.conv2d, bottleneck,
                         resnet_utils.stack_blocks_dense],
                        outputs_collections=end_points_collection):
      with (slim.arg_scope([slim.batch_norm], is_training=is_training)
            if is_training is not None else NoOpScope()):
        net = inputs
        if include_root_block:
          if output_stride is not None:
            if output_stride % 4 != 0:
              raise ValueError('The output_stride needs to be a multiple of 4.')
            output_stride /= 4
          net = resnet_utils.conv2d_same(net, 64, 7, stride=2, scope='conv1')
          net = slim.max_pool2d(net, [3, 3], stride=2, scope='pool1')
        net = resnet_utils.stack_blocks_dense(net, blocks, output_stride,
                                              store_non_strided_activations)
        # Convert end_points_collection into a dictionary of end_points.
        end_points = slim.utils.convert_collection_to_dict(
            end_points_collection)

        if global_pool:
          # Global average pooling.
          net = tf.reduce_mean(net, [1, 2], name='pool5', keep_dims=True)
          end_points['global_pool'] = net
        if num_classes:
          net = slim.conv2d(net, num_classes, [1, 1], activation_fn=None,
                            normalizer_fn=None, scope='logits')
          end_points[sc.name + '/logits'] = net
          if spatial_squeeze:
            net = tf.squeeze(net, [1, 2], name='SpatialSqueeze')
            end_points[sc.name + '/spatial_squeeze'] = net
          end_points['predictions'] = slim.softmax(net, scope='predictions')
        return net, end_points
resnet_v1.default_image_size = 224

def resnet_v1_block(scope, base_depth, num_units, stride):
  """Helper function for creating a resnet_v1 bottleneck block.

  Args:
    scope: The scope of the block.
    base_depth: The depth of the bottleneck layer for each unit.
    num_units: The number of units in the block.
    stride: The stride of the block, implemented as a stride in the last unit.
      All other units have stride=1.

  Returns:
    A resnet_v1 bottleneck block.
  """
  return resnet_utils.Block(scope, bottleneck, [{
      'depth': base_depth * 4,
      'depth_bottleneck': base_depth,
      'stride': 1
  }] * (num_units - 1) + [{
      'depth': base_depth * 4,
      'depth_bottleneck': base_depth,
      'stride': stride
  }])


def resnet_v1_50(inputs,
                 num_classes=None,
                 is_training=True,
                 global_pool=True,
                 output_stride=None,
                 spatial_squeeze=True,
                 store_non_strided_activations=False,
                 reuse=None,
                 scope='resnet_v1_50'):
  """ResNet-50 model of [1]. See resnet_v1() for arg and return description."""
  blocks = [
      resnet_v1_block('block1', base_depth=64, num_units=3, stride=2),
      resnet_v1_block('block2', base_depth=128, num_units=4, stride=2),
      resnet_v1_block('block3', base_depth=256, num_units=6, stride=2),
      resnet_v1_block('block4', base_depth=512, num_units=3, stride=1),
  ]
  return resnet_v1(inputs, blocks, num_classes, is_training,
                   global_pool=global_pool, output_stride=output_stride,
                   include_root_block=True, spatial_squeeze=spatial_squeeze,
                   store_non_strided_activations=store_non_strided_activations,
                   reuse=reuse, scope=scope)

resnet_v1_50.default_image_size = 24

建立resnet50網路

height, width = 24, 24

image_holder = tf.placeholder(tf.float32, [batch_size, 24, 24, 3])
label_holder = tf.placeholder(tf.int32, [batch_size])

with slim.arg_scope(resnet_arg_scope()):
    net, end_points = resnet_v1_50(image_holder,10)
    print(end_points)

#看看網路情況
tf.global_variables_initializer().run()
see_net,see_end_points= sess.run([net,end_points],feed_dict={image_holder:image_batch,label_holder:label_batch})

print(see_net.shape)#應該是最後輸出值
print(see_end_points)#應該是整個網路節點引數

定義loss函式，優化器：

def loss(logits, labels):
    labels = tf.cast(labels, tf.int64)
    cross_entropy = tf.nn.sparse_softmax_cross_entropy_with_logits(
        logits=logits, labels=labels, name='cross_entropy_per_example')
    cross_entropy_mean = tf.reduce_mean(cross_entropy, name='cross_entropy')
    tf.add_to_collection('losses', cross_entropy_mean)

    return cross_entropy_mean

gloss = loss(net, label_holder)

learnrate = 1e-3
train_op = tf.train.AdamOptimizer(learnrate).minimize(gloss)

#看看網路情況
tf.global_variables_initializer().run()
see_loss= sess.run([gloss],feed_dict={image_holder:image_batch,label_holder:label_batch})
print(see_loss)

訓練網路：

import math
def SeePre():
    top_k_op = tf.nn.in_top_k(net, label_holder, 1)
    num_examples = 10000
    num_iter = int(math.ceil(num_examples / batch_size))
    true_count = 0  
    total_sample_count = num_iter * batch_size
    step = 0
    while step < num_iter:
        image_batch,label_batch = sess.run([images_test,labels_test])
        predictions = sess.run([top_k_op],feed_dict={image_holder: image_batch,
                                                 label_holder:label_batch})
        true_count += np.sum(predictions)
        step += 1
    precision = true_count / total_sample_count
    print('precision @ 1 = %.3f' % precision)

max_steps = 300000
SeePre()
for step in range(max_steps):
    start_time = time.time()
    image_batch,label_batch = sess.run([images_train,labels_train])
    _, nowloss = sess.run([train_op, gloss],feed_dict={image_holder: image_batch, 
                                                         label_holder:label_batch})
    duration = time.time() - start_time
    if step % 100 == 0:
        print('step = %d Use Time = %.3f sec loss=%.6f'%(step,duration,nowloss))
    if step % 1000 == 0:
        SeePre()
    if step % 10000 == 0:
        learnrate = 0.95*learnrate
SeePre()

最終結果0.84左右，和論文中說的94%相差很大啊，不知道以上程式碼是哪裡出問題了呢？

resnet50訓練cifar10，請各位高手指正

使用resnet50從頭訓練cifar10，最終結果只有84%左右，貌似和論文差很多，請各位高手指正。首先加入cifar10的資料結構程式碼： import cifar10,cifar10_input import tensorflow as tf import numpy as

java求陣列的平衡點，請各位高手看看對否？

2007年5月去一箇中小型外企（在朝陽的甜水園）的上機題，求陣列的平衡點，不知道答對了沒有？ package myAction;public class Balence { /** * 求陣列index左邊的和 * @param a * @param index *

IntelliJ IDEA 在使用manven後的糾結（每次修改程式碼都要重啟tomcat才能看效果嗎？），請各位大俠來看看問題

在加入manven後每次都要從其tomcat 或者重新package才能看到修改的效果這樣對於程式原來說很瘋狂，反正我是快瘋了，都不想用manven了。直接上圖：上圖為沒用manven之前的專案 project Structure的配置也在，每次直接編譯在tomca

(新手)Java課程作業，請各位老哥指教：綜合運用巢狀if選擇結構、switch選擇結構、多重if選擇結構實現商品換購功能

綜合運用巢狀if選擇結構、switch選擇結構、多重if選擇結構實現商品換購功能下面是我自己的程式碼，功能雖然基本滿足，但是感覺好臃腫，很不簡潔，有更好的方法嗎？import java.util.Scanner; public class Homework1_3 { pu

DFT演算法的理解和實現，望各位高手指點指點（謝謝）

DFT的公式：其中X(k)表示DFT變換後的資料，x(n)為取樣的模擬訊號,公式中的x(n)可以為覆信號，實際當中x(n)都是實訊號，即虛部為0,此時公式可以展開為：從這個公式可以看出，變換後的資料就是原訊號對cos和sin的相關操作，即進行相乘求和（

漢諾塔學習筆記，有不正確的地方請小夥伴們指正~·~

學習順序執行 == cab -1 nbsp 什麽猜想 abc 1* n=3.abc; 2* n-1=2,acb; 3* n-1=1,abc 1* n=3,執行hanoi(n-1,A,C,B); =>2* n-1=2,acb執行hanoi

KICKSTART無人值守安裝服務-學藝不精-請各位大神多多指正

watermark bsp ron yum 自動安裝系統 isa sta 9.png first Linux系統批量自動安裝實現原理將手動安裝的所有詳細步驟記錄到一個文件中，然後有一種軟件通過讀取這個文件就可以實現自動化安裝系統；工具這個工具叫做kickstart，kic

原生JS編寫了個簡易進度條，還請各位前輩指教~

classname 學習 UNC TP .com 開始學習 com get 能說剛開始學習JS不久，以及第一次來到博客園，第一次進行分享博文。。。噢，不對，不能說是分享，而是學習請教，請前輩多多指教，各個方面都可以~ 感謝您的路過~ <!DOC

java關於硬件接口操作方面的疑惑，請高手解惑。

人的一生這樣的不同的深深 ofo 無法會有沒有似的 <p>　　人的一生是要；經歷許多階段的，比如說純真無邪的少年時代，激情如火的青春歲月，厚重沈穩的中年時期，從容淡定的人生暮年。每個時候都有獨特的風景，每段歲月都會給人不同的感受。可進入中年的她，突然

Hadoop文章收集彙總 - 如禁止轉載，請及時聯絡本人收集學習網際網路各位前輩分享的文章

工具自動自動整合文章列表與URL 公眾號名稱標題作者釋出時間 Hadoop實操如何使用Sentry管理Hive倉庫目錄外的其他目錄的acl同步 Fays

ACM 給你一個整數Q，找出一個最小的正整數N，使得它的各位之積等於Q，如果不存在，請輸出-1 輸入：第一行為組數，

#include<iostream> #include<stdio.h> using namespace std; bool smallten(int data) { if((data<10)

樹莓派無人機-資料整理，請做過的大佬，多多指正

樹莓派3B和樹莓派3B+有了，無人機有了，PX4飛控有了，還是一臉懵逼，看過老外的樹莓派無人機+4G圖傳視訊，用到拓展版，一難買，二窮；國內大神也有做出來的了；參考連結：樹莓派2手工打造Linux APM飛控國內首個Linux開源飛控 0.2秒延遲數

android 3d遊戲研究（二）（邊學邊寫，多謝高手指正，鞠躬）：資料庫

android中的資料庫按儲存位置分為兩種：1，系統目錄下的資料庫；2，sdcard下資料庫首先來說系統目錄下的資料庫：一般位置：/data/data/APK包名/databases/xx.db （xx 資料庫名稱）看下下面的類： import android.c

淺結在OJ中的輸入格式問題（總結可能多處不足與錯誤，發現請各位大咔評論指導）

1#include<stdio.h>int main() { int a,b; scanf("%d %d",&a, &b); printf("%d\n",a+b); //最簡單的輸入 return 0; } 2.A+B Problem (EOF)

自己寫的資料庫連結類，請高手指點一下。

using System; using System.Collections.Generic; using System.Text; using System.Data; using System.Data.OleDb; namespace DatabaseCo

cifar10模型訓練完，用於識別單個圖片報錯解決方案

（InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [72,384] rhs shape= [9216,384

請各位大神幫我看看，這是什麼原因造成的。。。。。。

[INFO] Scanning for projects... [WARNING] [WARNING] Some problems were encountered while building the effective model for org.springfram

asp實現修改access資料庫，但是一點提交就出錯，請高手看看怎麼回事

程式碼：  <% exec="select * from student2 where id="&request.QueryString("id") set rs=server.Crea

vue2.0組件之間的傳值--新入坑，請指教

fine ext sets mode tro exp ted pro -s prop down emit up 嘿嘿如果是第一次接觸vue2.0組件傳值的肯定很疑惑，這是什麽意思（大神總結的，我也就是拿來用用） “down”—>指的是下的意思，即父

DirectX 安裝報錯: 不能信任一個安裝所需的壓縮文件，請檢查加密服務是否啟用並且cabinet文件證書是否有效

建議長時間頁面檢查 ould get 浪費 images 跳轉 DirectX 安裝報錯不能信任一個安裝所需的壓縮文件，請檢查加密服務是否啟用並且cabinet文件證書是否有效是直播軟件open broadcaster software,這個軟件安裝的時候提示“y

resnet50訓練cifar10，請各位高手指正

相關推薦