keras SegNet使用池化索引（pooling indices）

阿新 • • 發佈：2019-01-11

keras中不能直接使用池化索引。最近學習到SegNet（網上許多錯的，沒有用池化索引），其中下采樣上取樣用到此部分。此處用到自定義層。
在這裡插入圖片描述
完整測試程式碼如下。

"""
@author: LiShiHang
@software: PyCharm
@file: utils.py
@time: 2018/12/18 14:58
"""
from keras.engine import Layer
import keras.backend as K


class MaxPoolingWithArgmax2D(Layer):

    def __init__(
            self, 

            pool_size=(2, 2),
            strides=(2, 2),
            padding='same',
            **kwargs):
        super(MaxPoolingWithArgmax2D, self).__init__(**kwargs)
        self.padding = padding
        self.pool_size = pool_size
        self.strides = strides

    def call(self, inputs, ** 
kwargs):
        padding = self.padding
        pool_size = self.pool_size
        strides = self.strides
        if K.backend() == 'tensorflow':
            ksize = [1, pool_size[0], pool_size[1], 1]
            padding = padding.upper()
            strides = [1, strides[0], strides[1], 1]
            output, 
 argmax = K.tf.nn.max_pool_with_argmax(
                inputs,
                ksize=ksize,
                strides=strides,
                padding=padding)
        else:
            errmsg = '{} backend is not supported for layer {}'.format(
                K.backend(), type(self).__name__)
            raise NotImplementedError(errmsg)
        argmax = K.cast(argmax, K.floatx())
        return [output, argmax]

    def compute_output_shape(self, input_shape):
        ratio = (1, 2, 2, 1)
        output_shape = [
            dim // ratio[idx]
            if dim is not None else None
            for idx, dim in enumerate(input_shape)]
        output_shape = tuple(output_shape)
        return [output_shape, output_shape]

    def compute_mask(self, inputs, mask=None):
        return 2 * [None]


class MaxUnpooling2D(Layer):
    def __init__(self, up_size=(2, 2), **kwargs):
        super(MaxUnpooling2D, self).__init__(**kwargs)
        self.up_size = up_size

    def call(self, inputs, output_shape=None):

        updates, mask = inputs[0], inputs[1]
        with K.tf.variable_scope(self.name):
            mask = K.cast(mask, 'int32')
            input_shape = K.tf.shape(updates, out_type='int32')
            #  calculation new shape
            if output_shape is None:
                output_shape = (
                    input_shape[0],
                    input_shape[1] * self.up_size[0],
                    input_shape[2] * self.up_size[1],
                    input_shape[3])

            # calculation indices for batch, height, width and feature maps
            one_like_mask = K.ones_like(mask, dtype='int32')
            batch_shape = K.concatenate(
                [[input_shape[0]], [1], [1], [1]],
                axis=0)
            batch_range = K.reshape(
                K.tf.range(output_shape[0], dtype='int32'),
                shape=batch_shape)
            b = one_like_mask * batch_range
            y = mask // (output_shape[2] * output_shape[3])
            x = (mask // output_shape[3]) % output_shape[2]
            feature_range = K.tf.range(output_shape[3], dtype='int32')
            f = one_like_mask * feature_range

            # transpose indices & reshape update values to one dimension
            updates_size = K.tf.size(updates)
            indices = K.transpose(K.reshape(
                K.stack([b, y, x, f]),
                [4, updates_size]))
            values = K.reshape(updates, [updates_size])
            ret = K.tf.scatter_nd(indices, values, output_shape)
            return ret

    def compute_output_shape(self, input_shape):
        mask_shape = input_shape[1]
        return (
            mask_shape[0],
            mask_shape[1] * self.up_size[0],
            mask_shape[2] * self.up_size[1],
            mask_shape[3]
        )


if __name__ == '__main__':

    import keras
    import numpy as np

    # input = keras.layers.Input((4, 4, 3))
    # o = MaxPoolingWithArgmax2D()(input)
    # model = keras.Model(inputs=input, outputs=o)  # outputs=o
    # model.compile(optimizer="adam", loss='categorical_crossentropy')
    # x = np.random.randint(0, 100, (3, 4, 4, 3)) # 除錯此處
    # m = model.predict(x) # 除錯此處
    # print(m)

    input = keras.layers.Input((4, 4, 3))
    o = MaxPoolingWithArgmax2D()(input)
    o2 = MaxUnpooling2D()(o)
    model = keras.Model(inputs=input, outputs=o2)  # outputs=o
    model.compile(optimizer="adam", loss='categorical_crossentropy')
    x = np.random.randint(0, 100, (3, 4, 4, 3))  # 除錯此處
    m = model.predict(x)  # 除錯此處
    print(m)

感興趣的可除錯註釋處。
在這裡插入圖片描述

keras SegNet使用池化索引（pooling indices）

keras中不能直接使用池化索引。最近學習到SegNet（網上許多錯的，沒有用池化索引），其中下采樣上取樣用到此部分。此處用到自定義層。完整測試程式碼如下。 """ @author: LiShiHang @software: PyCharm @file: utils.py @ti

全連接層（FC）與全局平均池化層（GAP）

出了類別節點過擬合技術分類思想 ID 連接在卷積神經網絡的最後，往往會出現一兩層全連接層，全連接一般會把卷積輸出的二維特征圖轉化成一維的一個向量，全連接層的每一個節點都與上一層每個節點連接，是把前一層的輸出特征都綜合起來，所以該層的權值參數是最多的。例如在VG

深度學習基礎--池化--global average pooling

global average pooling 這個概念出自於 network in network。主要是用來解決全連線的問題（代替FC），其主要是是將最後一層的特徵圖進行整張圖的一個均值池化，形成一個特徵點，將這些特徵點組成最後的特徵向量進行softmax中進行計算。 g

深度學習中的池化詳解 | Pooling in Deep learning

本文由多篇部落格總結整理而成，參考部落格見文末，侵刪。目錄最大池化：平均池化重疊池化參考文獻參考部落格首先，什麼是CNN ------------------------------------

ElasticSearch 用ik分詞器建立索引（java API）

tle creat analyzer undefined 全文搜索 () map 多用戶 tcl 　　ElasticSearch是一個基於Lucene的搜索服務器。它提供了一個分布式多用戶能力的全文搜索引擎，基於RESTful web接口。Elasticsearch是用Ja

Netty5 序列化方式（Jboss Marshalling）

java netty 序列化Netty作為很多高性能的底層通訊工具，被很多開發框架應用再底層，今天來說說常用的序列化工具，用Jboss的Marshalling。直接上代碼，Marshalling的工廠類package com.netty.serialize.marshalling; import io.ne

c#實現圖片二值化例子（黑白效果）

rec con devel 圖片 round amp bsp 操作 spl C#將圖片2值化示例代碼，原圖及二值化後的圖片如下：原圖：二值化後的圖像：實現代碼：using System; using System.Drawing; namespace BMP2G

Serializable 指示一個類可以序列化；ICloneable支持克隆，即用與現有實例相同的值創建類的新實例（接口）；ISerializable允許對象控制其自己的序列化和反序列化過程（接口）

att 文本所有可能成員強制 void inter 適用於 Serializable ：序列化是指將對象實例的狀態存儲到存儲媒體的過程。在此過程中，先將對象的公共字段和私有字段以及類的名稱（包括類所在的程序集）轉換為字節流，然後再把字節流寫入數據流。在隨後對對象進

索引（快速查詢）

數據量 EDA led 數據哪些如果參考類型 rom 《沁園春·雪》北國風光，千裏冰封，萬裏雪飄。望長城內外，惟餘莽莽；大河上下，頓失滔滔。山舞銀蛇，原馳蠟象，欲與天公試比高。須晴日，看紅妝素裹，分外妖嬈。江山如此多嬌，引無數英雄竟折腰。惜秦皇漢武，略輸

線程池代碼（通用版）

line cor 使用場景鏈表 http sde 依次線程 HR 一、適用場景首先，必須明確一點，線程池不是萬能的，它有其特定的使用場景。使用線程池是為了減小線程本身的開銷對應用性能所產生的影響，但是其前提是線程本身創建、銷毀的開銷和線程執行任務的開銷相比是

Building a Keras + deep learning REST API（三部曲之一）

and from urn -h png app 比較 get round 一、基本環境$ pip install flask gevent requests pillow其中 flask不需要解釋gevent 是用於自動切換進程的；pillow 是用來進行python下的圖

科學計算三維可視化---Traits（Property屬性）

pri tro str 而是 light 一次圖片直接獲得 Property屬性使用Property函數為類創建Property屬性，Property屬性用法和一般屬性相同，但是他在獲取值和賦值時會調用相應的方法 traits庫也提供了Property屬性

MySQL8.0新特性——不可見索引（Invisible Indexes）

mysq test 沒有官方 ash ann bar htm 一個 MySQL8.0新特性——不可見索引（Invisible Indexes）MySQL8.0開始支持看不見的索引。一個看不見的索引根本不被優化器使用，但是通常是保持正常的。默認情況下索引是可見的。不可見的索

（譯）MySQL 8.0實驗室---MySQL中的倒敘索引（Descending Indexes）

mysql 重新 .cn 創建表 https 正序 tro 一個刪除譯者註：MySQL 8.0之前，不管是否指定索引建的排序方式，都會忽略創建索引時候指定的排序方式（語法上不會報錯），最終都會創建為ASC方式的索引，在執行查詢的時候，只存在forwarded（正向

CentOS7.5安裝與初始化配置（做標準化）

分配同步服務 == ulimit 默認 ipad get aos ssa 本文分享CentOS的標準化安裝配置方法，方便集群批量裝機配置 ------------------------- 完美的分割線 ---------------------------- 1.安

java序列化機制（簡單使用）

轉載：孤傲蒼狼 https://www.cnblogs.com/xdp-gacl/p/3777987.html 詳細分析：http://www.importnew.com/24490.html 一、序列化和反序列化的概念　　把物件轉換為位元組序列的過程稱為物件的序列化。　　

初始化引數（Initialization Parameter）知識合集 based on 11g

初始化引數檔案分為： 1）pfile 靜態引數檔案 2）spfile 動態伺服器引數檔案作用：儲存建立例項、啟動後臺程序所需引數值。呼叫：例項啟動時，按如下順序調取初始化引數檔案 linux: $ORACLE_HOME/dbs/spfile<SID>.ora $ORACLE

Impossible Mission - 單機百億檔案的極致索引（設計篇）

一. 背景當下資訊社會每天都產生大量需要儲存的資料，這些資料在刺激海量儲存技術發展的同時也帶來了新的挑戰。比如，海量資料為儲存系統增加了大量的小檔案，這些小檔案的元資料如何管理？如何控制定位某個檔案的時間和空間開銷？隨著對資料實時性要求的提高, 檔案也越來越趨於碎片化，像短視訊、直播

ZOJ——Copying Books 最大值最小化問題（貪心 + 二分）

題目連結： #include <cstdio> #include <cmath> #include<vector> #include<cstring> #include<algorithm> #include<cmath>

Flutter | Json自動反序列化——json_serializable（附原始碼）【3】

轉載自：https://www.jianshu.com/p/b307a377c5e8 前言 Google推出flutter這樣一個新的高效能跨平臺（Android，ios）快速開發框架之後，被業界許多開發者所關注。我在接觸了flutter之後發現這個確實是一個好東西，好東西

keras SegNet使用池化索引（pooling indices）

相關推薦