Mxnet:以全連線層為例子自定義新的操作(層)

阿新 • • 發佈：2019-01-23

在使用深度學習平臺時，光會使用其中已定義好的操作有時候是滿足不了實際使用的，一般需要我們自己定義新的操作。但是，絕大多數深度平臺都是編譯好的，很難再次編寫。本文以Mxnet為例，官方給出四種定義新操作的方法，

分別呼叫：

１、mx.operator.CustomOp

２、mx.operator.NDArrayOp

３、mx.operator.NumpyOp

４、使用 C++　定義底層

並且給出了重新定義softmax層的例子。但是sofetmax操作只有前向操作，也沒有引數，與我們通常需要需要使用的情況不符，官方文件也沒有一個有引數的中間層例子。在此博主給出了一個重新定義全連線操作的例子，希望能夠給大家帶來幫助。

# pylint: skip-file
import os
from data import mnist_iterator
import mxnet as mx
import numpy as np
import logging
from numpy import *

class Dense(mx.operator.CustomOp):

    def __init__(self, num_hidden):
        self.num_hidden = num_hidden

    def forward(self, is_train, req, in_data, out_data, aux):
        x = in_data[0]
        w = in_data[1]
        b = in_data[2]
        y = out_data[0]
        y[:] = mx.nd.add(mx.nd.dot(x, w.T), b)
        self.assign(out_data[0], req[0], mx.nd.array(yy))

    def backward(self, req, out_grad, in_data, out_data, in_grad, aux):
        dx = in_grad[0]
        dw = in_grad[1]
        db = in_grad[2]
        dy = out_grad[0]
        x = in_data[0]
        w = in_data[1]
        dw[:] = mx.nd.dot(dy.T, x)
        dx[:] = mx.nd.dot(dy, w)
        db[:] = mx.nd.sum(dy, axis=0)
        self.assign(in_grad[0], req[0], dx)
        self.assign(in_grad[1], req[0], dw)
        self.assign(in_grad[2], req[0], db)


@mx.operator.register("dense")
class DenseProp(mx.operator.CustomOpProp):
    def __init__(self, num_hidden):
        super(DenseProp, self).__init__(True)
        # we use constant bias here to illustrate how to pass arguments
        # to operators. All arguments are in string format so you need
        # to convert them back to the type you want.
        self.num_hidden = long(num_hidden)

    def list_arguments(self):
        return ['data', 'weight', 'bias']

    def list_outputs(self):
        #  this can be omitted if you only have 1 output.
        return ['output']

    def infer_shape(self, in_shapes):
        data_shape = in_shapes[0]
        weight_shape = (self.num_hidden, in_shapes[0][1])
        bias_shape = (self.num_hidden,)
        output_shape = (data_shape[0], self.num_hidden)
        return [data_shape, weight_shape, bias_shape], [output_shape], []

    def infer_type(self, in_type):
        dtype = in_type[0]
        return [dtype, dtype, dtype], [dtype], []

    def create_operator(self, ctx, in_shapes, in_dtypes):
        #  create and return the CustomOp class.
        return Dense(self.num_hidden)


# define mlp
data = mx.symbol.Variable('data')
##This is the new defined layer
fc1 = mx.symbol.Custom(data, name='fc1', op_type='dense', num_hidden=128)
act1 = mx.symbol.Activation(data=fc1, name='relu1', act_type="relu")
fc2 = mx.symbol.FullyConnected(data=act1, name = 'fc2', num_hidden = 64)
act2 = mx.symbol.Activation(data = fc2, name='relu2', act_type="relu")
fc3 = mx.symbol.FullyConnected(data = act2, name='fc3', num_hidden=10)
mlp = mx.symbol.Softmax(data = fc3, name = 'softmax')
train, val = mnist_iterator(batch_size=100, input_shape = (784,))
logging.basicConfig(level=logging.DEBUG)
model = mx.model.FeedForward(
    ctx = mx.gpu(1), symbol = mlp, num_epoch = 20,
    learning_rate = 0.1, momentum = 0.9, wd = 0.00001)
model.fit(X=train, eval_data=val,
          batch_end_callback=mx.callback.Speedometer(100,100))

Mxnet:以全連線層為例子自定義新的操作(層)

Mxnet:以全連線層為例子自定義新的操作(層)

spring boot 整合redis 以一個熱門房產為例子

Spring boot(20) Spring /Spring boot使用AOP、強制使用cglib（以記錄方法耗時為例子）

C#+HtmlAgilityPack+XPath帶你採集資料(以採集天氣資料為例子)

如何使用github(以國內碼云為例子)

自定義用戶層的靈活性串口通信協議

46.Odoo產品分析 (五) – 定制板塊(2) – 為業務自定義odoo(1)

全局異常方式處理自定義異常 @RestControllerAdvice + @ExceptionHandler

Django中利用filter與simple_tag為前端自定義函數的實現方法

Android學習筆記之為Dialog自定義佈局，並說明空指標問題

微信小程式-自定義彈出層

Scrapy框架自定義pipeline兩層下載路徑去下載圖片，關於item傳值的問題

使用node連線mySQL，封裝自定義模組供外部呼叫

Android架構分析之使用自定義硬體抽象層(HAL)模組

使用caffe的python layer自定義資料增強層DataAugmentationLayer

【轉】手把手教你實現自定義的應用層協議

Odoo產品分析 (五) -- 定製板塊(2) -- 為業務自定義odoo(1)

支援中文/全拼/簡拼以及自定義篩選的下拉列表

js javascript 自定義json資料格式，以變數作為鍵名，自定義鍵值對

[微信小程式]實現一個自定義遮罩層元件(完整示例程式碼附效果圖)

Mxnet:以全連線層為例子自定義新的操作(層)

相關推薦