用cnn做行人分類

阿新 • • 發佈：2019-02-15

機器學習資料庫是關鍵，自己搜搜吧，規模太小訓練不出來，正樣本和負樣本。

訓練之前要處理訓練檔案，這個我在之前的python影象操作這篇博文裡寫過，並有完整程式碼。

也可以用我處理好的資料，稍後我會上傳

input_data.py

"""Functions for downloading and reading MNIST data."""
from __future__ import print_function
import gzip
import os
import numpy


def extract_images(filename):
  """Extract the images into a 4D uint8 numpy array [index, y, x, depth]."""
  print('Extracting', filename)
  rows = 128
  cols = 64
  data = numpy.fromfile(filename, dtype=numpy.uint8)
  data = data.reshape(-1, rows, cols, 1)
  #print numpy.shape(data)
  return data


def dense_to_one_hot(labels_dense, num_classes=2):
  """Convert class labels from scalars to one-hot vectors."""
  num_labels = labels_dense.shape[0]
  index_offset = numpy.arange(num_labels) * num_classes
  labels_one_hot = numpy.zeros((num_labels, num_classes))
  labels_one_hot.flat[index_offset + labels_dense.ravel()] = 1
  return labels_one_hot


def extract_labels(filename, one_hot=False):
  """Extract the labels into a 1D uint8 numpy array [index]."""
  print('Extracting', filename)
  labels = numpy.fromfile(filename, dtype=numpy.uint8)
  if one_hot:
     return dense_to_one_hot(labels)
  return labels


class DataSet(object):

  def __init__(self, images, labels, fake_data=False):
    if fake_data:
      self._num_examples = 10000
    else:
      assert images.shape[0] == labels.shape[0], (
          "images.shape: %s labels.shape: %s" % (images.shape,
                                                 labels.shape))
      self._num_examples = images.shape[0]

      # Convert shape from [num examples, rows, columns, depth]
      # to [num examples, rows*columns] (assuming depth == 1)
      assert images.shape[3] == 1
      images = images.reshape(images.shape[0],
                              images.shape[1] * images.shape[2])
      # Convert from [0, 255] -> [0.0, 1.0].
      images = images.astype(numpy.float32)
      images = numpy.multiply(images, 1.0 / 255.0)
    self._images = images
    self._labels = labels
    self._epochs_completed = 0
    self._index_in_epoch = 0

  @property
  def images(self):
    return self._images

  @property
  def labels(self):
    return self._labels

  @property
  def num_examples(self):
    return self._num_examples

  @property
  def epochs_completed(self):
    return self._epochs_completed

  def next_batch(self, batch_size, fake_data=False):
    """Return the next `batch_size` examples from this data set."""
    if fake_data:
      fake_image = [1.0 for _ in xrange(784)]
      fake_label = 0
      return [fake_image for _ in xrange(batch_size)], [
          fake_label for _ in xrange(batch_size)]
    start = self._index_in_epoch
    self._index_in_epoch += batch_size
    if self._index_in_epoch > self._num_examples:
      # Finished epoch
      self._epochs_completed += 1
      # Shuffle the data
      perm = numpy.arange(self._num_examples)
      numpy.random.shuffle(perm)
      self._images = self._images[perm]
      self._labels = self._labels[perm]
      # Start next epoch
      start = 0
      self._index_in_epoch = batch_size
      assert batch_size <= self._num_examples
    end = self._index_in_epoch
    return self._images[start:end], self._labels[start:end]


def read_data_sets(train_dir, fake_data=False, one_hot=False):
  class DataSets(object):
    pass
  data_sets = DataSets()

  if fake_data:
    data_sets.train = DataSet([], [], fake_data=True)
    data_sets.validation = DataSet([], [], fake_data=True)
    data_sets.test = DataSet([], [], fake_data=True)
    return data_sets

  TRAIN_IMAGES = 'train_data.bin'
  TRAIN_LABELS = 'train_label.bin'
  TEST_IMAGES = 'test_data.bin'
  TEST_LABELS = 'test_label.bin'
  VALIDATION_SIZE = 500

  local_file =os.path.join(train_dir, TRAIN_IMAGES)
  train_images = extract_images(local_file)
  
  local_file =os.path.join(train_dir, TRAIN_LABELS)
  train_labels = extract_labels(local_file, one_hot=one_hot)

  local_file = os.path.join(train_dir, TEST_IMAGES)
  test_images = extract_images(local_file)
  
  local_file =os.path.join(train_dir, TEST_LABELS)
  test_labels = extract_labels(local_file, one_hot=one_hot)

  validation_images = train_images[:VALIDATION_SIZE]
  validation_labels = train_labels[:VALIDATION_SIZE]
  train_images = train_images[VALIDATION_SIZE:]
  train_labels = train_labels[VALIDATION_SIZE:]

  data_sets.train = DataSet(train_images, train_labels)
  data_sets.validation = DataSet(validation_images, validation_labels)
  data_sets.test = DataSet(test_images, test_labels)

  return data_sets

conv_net.py

import input_data
mnist = input_data.read_data_sets('dataset', one_hot=True)
import tensorflow as tf

# Parameters
learning_rate = 0.001
training_iters = 100000
batch_size = 128
display_step = 10

# Network Parameters
n_input = 128*64 #  data input (img shape: 128*64)
n_classes = 2 # total classes (0-1)
dropout = 0.50 # Dropout, probability to keep units

# tf Graph input
x = tf.placeholder(tf.float32, [None, n_input])
y = tf.placeholder(tf.float32, [None, n_classes])
keep_prob = tf.placeholder(tf.float32) #dropout (keep probability)

# Create model
def conv2d(img, w, b):
    return tf.nn.relu(tf.nn.bias_add(tf.nn.conv2d(img, w, strides=[1, 1, 1, 1], padding='SAME'),b))

def max_pool(img, k):
    return tf.nn.max_pool(img, ksize=[1, k, k, 1], strides=[1, k, k, 1], padding='SAME')

def conv_net(_X, _weights, _biases, _dropout):
    # Reshape input picture
    _X = tf.reshape(_X, shape=[-1, 128, 64, 1])

    # Convolution Layer
    conv1 = conv2d(_X, _weights['wc1'], _biases['bc1'])
    # Max Pooling (down-sampling)
    conv1 = max_pool(conv1, k=2)
    # Apply Dropout
    conv1 = tf.nn.dropout(conv1, _dropout)

    # Convolution Layer
    conv2 = conv2d(conv1, _weights['wc2'], _biases['bc2'])
    # Max Pooling (down-sampling)
    conv2 = max_pool(conv2, k=2)
    # Apply Dropout
    conv2 = tf.nn.dropout(conv2, _dropout)

    # Fully connected layer
    dense1 = tf.reshape(conv2, [-1, _weights['wd1'].get_shape().as_list()[0]]) # Reshape conv2 output to fit dense layer input
    dense1 = tf.nn.relu(tf.add(tf.matmul(dense1, _weights['wd1']), _biases['bd1'])) # Relu activation
    dense1 = tf.nn.dropout(dense1, _dropout) # Apply Dropout

    # Output, class prediction
    out = tf.add(tf.matmul(dense1, _weights['out']), _biases['out'])
    return out

# Store layers weight & bias
weights = {
    'wc1': tf.Variable(tf.random_normal([5, 5, 1, 32])), # 5x5 conv, 1 input, 32 outputs
    'wc2': tf.Variable(tf.random_normal([5, 5, 32, 64])), # 5x5 conv, 32 inputs, 64 outputs
    'wd1': tf.Variable(tf.random_normal([32*16*64, 1024])), # fully connected, 7*7*64 inputs, 1024 outputs
    'out': tf.Variable(tf.random_normal([1024, n_classes])) # 1024 inputs, 10 outputs (class prediction)
}

biases = {
    'bc1': tf.Variable(tf.random_normal([32])),
    'bc2': tf.Variable(tf.random_normal([64])),
    'bd1': tf.Variable(tf.random_normal([1024])),
    'out': tf.Variable(tf.random_normal([n_classes]))
}

# Construct model
pred = conv_net(x, weights, biases, keep_prob)

# Define loss and optimizer
cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(pred, y))
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate).minimize(cost)

# Evaluate model
correct_pred = tf.equal(tf.argmax(pred,1), tf.argmax(y,1))
accuracy = tf.reduce_mean(tf.cast(correct_pred, tf.float32))

# Initializing the variables
init = tf.initialize_all_variables()

# Launch the graph
with tf.Session() as sess:
    sess.run(init)
    step = 1
    # Keep training until reach max iterations
    while step * batch_size < training_iters:
        batch_xs, batch_ys = mnist.train.next_batch(batch_size)
        # Fit training using batch data
        sess.run(optimizer, feed_dict={x: batch_xs, y: batch_ys, keep_prob: dropout})
        if step % display_step == 0:
            # Calculate batch accuracy
            acc = sess.run(accuracy, feed_dict={x: batch_xs, y: batch_ys, keep_prob: 1.})
            # Calculate batch loss
            loss = sess.run(cost, feed_dict={x: batch_xs, y: batch_ys, keep_prob: 1.})
            print "Iter " + str(step*batch_size) + ", Minibatch Loss= " + "{:.6f}".format(loss) + ", Training Accuracy= " + "{:.5f}".format(acc)
        step += 1
    print "Optimization Finished!"
    # Calculate accuracy for 256 mnist test images
    print "Testing Accuracy:", sess.run(accuracy, feed_dict={x: mnist.test.images[:256], y: mnist.test.labels[:256], keep_prob: 1.})

下面是訓練結果

用cnn做行人分類

機器學習資料庫是關鍵，自己搜搜吧，規模太小訓練不出來，正樣本和負樣本。訓練之前要處理訓練檔案，這個我在之前的python影象操作這篇博文裡寫過，並有完整程式碼。也可以用我處理好的資料，稍後我會上傳 input_data.py """Functions for do

[TensorFlow深度學習入門]實戰九·用CNN做科賽網TibetanMNIST藏文手寫數字資料集準確率98%+

[TensorFlow深度學習入門]實戰九·用CNN做科賽網TibetanMNIST藏文手寫數字資料集準確率98.8%+ 我們在博文，使用CNN做Kaggle比賽手寫數字識別準確率99%+，在此基礎之上，我們進行對科賽網TibetanMNIST藏文手寫數字資料集訓練，來驗證網路的正確性。

[TensorFlow深度學習入門]實戰六·用CNN做Kaggle比賽手寫數字識別準確率99%+

[TensorFlow深度學習入門]實戰六·用CNN做Kaggle比賽手寫數字識別準確率99%+ 參考部落格地址本部落格採用Lenet5實現，也包含TensorFlow模型引數儲存與載入參考我的博文，實用性比較好。在訓練集準確率99.85%，測試訓練集準確率99%+。訓練

tensorflow 學習：用CNN進行影象分類

# -*- coding: utf-8 -*- from skimage import io,transform import glob import os import tensorflow as tf import numpy as np import time path='e:/flower/'

用RNN做MNIST分類

1.前言 RNN常用作NLP中，像圖片生成文字、自動生成古詩詞等。這篇文章用RNN做MNIST手寫數字識別，分類效果雖然沒有CNN效果好，但準確率也能夠達到96%。 2.環境 Mac os系統，pyt

完整案例：caffe框架用Alexnet做二分類的全部流程

一.資料的準備與預處理資料的準備是非常重要的，我們現在準備做的是一個二分類任務，計劃選取男女圖片進行訓練與測試。 train：隨機選取300張男性圖片，300張女性圖片 val：隨機選取80張男

用CNN做影象檢索

先看看效果，搜飛機：搜番茄：下面是操作步驟：本地目錄是這樣的：最重要的是web目錄：其中，256feat2048Norml.mat和thumbnails是需要自己下載的，一個是特徵，一個是縮圖，下載地址： thumbnails解壓後是這樣的：

深度學習入門專案：用keras構建CNN或LSTM對minist資料集做簡單分類任務

深度學習入門專案：用keras構建CNN或LSTM或RNN對Minist資料集做簡單分類任務參考keras中文文件 ——keras：是一個高階神經網路庫，用 Python 語言寫成，可以執行在 TensorFlow 或者 Theano 之上（即以此為後端）。

用深度學習keras的cnn做影象識別分類，準確率達97%

Keras是一個簡約，高度模組化的神經網路庫。可以很容易和快速實現原型（通過總模組化，極簡主義，和可擴充套件性）同時支援卷積網路（vision）和複發性的網路（序列資料）。以及兩者的組合。無縫地執行在CPU和GPU上。keras的資源庫網址為https://github.co

mysql遞迴查詢，mysql中從子類ID查詢所有父類（做無限分類經常用到）

由於mysql 不支援類似 oracle with ...connect的遞迴查詢語法之前一直以為類似的查詢要麼用儲存過程要麼只能用程式寫遞迴查詢.現在發現原來一條sql語句也是可以搞定的先來看資料表的結構如下：id name parent_id&n

實戰keras——用CNN實現cifar10影象分類

原文：https://blog.csdn.net/zzulp/article/details/76358694 import keras from keras.datasets import cifar10 from keras.models import Sequenti

用CNN巧妙解決金字塔滑動視窗，用cnn一邊滑動一遍輸出預測分類

效果如圖：這是用cnn對一張305*471的影象做分類得到的結果，相當於做了52*93次滑動視窗+分類，卻僅僅耗時0.2672951465708593s。相當於一次視窗分類，僅僅耗時 0.00005s。具體網路+預測如下圖所示： import numpy as

用gensim做LDA實踐之文字分類

之前看LDA，一直沒搞懂到底作用是什麼，公式推導了一大堆，dirichlet分佈求了一堆倒數，卻沒有真正理解精髓在哪裡。最近手上遇到了一個文字分類的問題，採用普通的VSM模型的時候，執行的太慢，後來查詢改進策略的時候，想起了LDA，因此把LDA重新拉回我的視

用CNN對文字處理，句子分類（簡單理解卷積原理）

首先需要理解N-gram https://zhuanlan.zhihu.com/p/32829048對於在NLP中N-gram的理解，一元，二元，三元gram 大多數 NLP 任務的輸入不是影象畫素，而是以矩陣表示的句子或文件。矩陣的每一行對應一個標記，通常是一個單詞，但它也可以是一個字元。也就是說，每一

用Sklearn做判別分析(分類)

來自：http://cloga.info/python/2014/02/07/classify_use_Sklearn/#wat_e_12612920-6fe4-464e-a2b0-3b1f13c1a4f6_zss_ 載入資料集這裡我使用pandas來載入資料集，資料集採用kaggle的titanic

分類和擴充套件有什麼區別？可以分別用來做什麼？分類有哪些侷限性？分類的結構體裡面有哪些成員？

1、分類中原則上只能增加方法（能新增屬性的的原因只是通過runtime的objc_setAssociatedObject和objc_getAssociatedObject方法新增setter/getter方法）； 2、擴充套件不僅可以增加方法，還可以增加例項變數（或者屬性），只是該例項變數預設是@p

Windows下用Matlab載入caffemodel做影象分類

1.編譯caffe的matlab介面用到了happynear提供的caffe-windows-master，編譯caffe和matlab介面的過程看這裡。編譯好之後，caffe-windows-master\matlab\+caffe\private內的檔案如下：如果

用CNN工具箱對自己的資料集分類

在github上下載了一個Deeplearningtoolbox的工具箱裡面各種deeplearning 很全我用其中的CNN對自己的資料集分類但很麻煩很多地方都得改動 http://download.csdn.net/detail/wd1603926823/9

用最新NLP庫Flair做文字分類

介紹文字分類是一種監督機器學習方法，用於將句子或文字文件歸類為一個或多個已定義好的類別。它是一個被廣泛應用的自然語言處理方法，在垃圾郵件過濾、情感分析、新聞稿件分類以及與許多其它業務相關的問題中發揮著重要作用。目前絕大多數最先進的方法都依賴於一種被稱為文字嵌入的技術。它將文字轉換成高維空間

手把手教你用matlab做深度學習(一)- --CNN

1.使用深度學習做目標檢測上一篇部落格已經講解了怎麼用matlab匯入資料。 [trainingImages,trainingLabels,testImages,testLabels] = helperCIFAR10Data.load('cifar10Data');

用cnn做行人分類

相關推薦