tensorflow 模型搭建及使用常用小技巧

阿新 • • 發佈：2021-06-28

針對基於tensorflow的深度學習實驗中，如何自定義損失函式啟用函式注意力層等問題，記錄常用自定義模型的技巧和方法

1. 自定義啟用函式

自定義gelu啟用函式

# gelu啟用函式
def gelu(x):
    return 0.5 * x * (1 + tf.tanh(np.sqrt(2 / np.pi) * (x + 0.044715 * tf.pow(x, 3))))
模型使用自定義啟用函式

inputs1 = Input(shape=(2048,))
fe1 = Dropout(0.5)(inputs1)
fe2 = Dense(256, activation=gelu)(fe1)
inputs2 = Input(shape=(max_length1,)) 

se1 = Embedding(vocab_size, embedding_dim, mask_zero=True)(inputs2)
se2 = Dropout(0.5)(se1)
x = LSTM(128, return_sequences=True,activation=gelu)(se2)
x = Dropout(0.5)(x)
x = LSTM(256)(x)
載入自定義啟用函式的模型

from keras.utils import to_categorical, get_custom_objects
from keras.models import load_model
from keras.layers import Activation 


model = load_model('saved_model/inceptionV3_LSTM2_200.h5',
                   custom_objects=get_custom_objects().update({'gelu': Activation(gelu)}))

2. 自定義Class的模型搭建
自定義class

# TPA注意力
class CalculateScoreMatrix(Layer):
    def __init__(self, output_dim=None, **kwargs):
        self.output_dim = output_dim
        super(CalculateScoreMatrix, self).__init__(**kwargs)

    def get_config(self):
        config = super().get_config().copy()
        config.update({'output_dim': self.output_dim})
        return config

    def build(self, input_shape):
        self.kernel = self.add_weight(name='kernel',
                                      shape=(input_shape[-1], self.output_dim),
                                      initializer='uniform',
                                      trainable=True)
        super(CalculateScoreMatrix, self).build(input_shape)

    def call(self, x):
        res = K.dot(x, self.kernel)
        return res
模型搭建

# ATP-LSTM
fe1 = Dropout(0.5)(inputs1)
fe2 = Dense(256, activation=tf_swish)(fe1)
inputs2 = Input(shape=(max_length1,))
se1 = Embedding(vocab_size, embedding_dim, mask_zero=True)(inputs2)
se2 = Dropout(0.5)(se1)
x = LSTM(64, return_sequences=True)(se2)
se2 = Dropout(0.5)(se1)
# get the 1~t-1 and t hidden state
H = Lambda(lambda x: x[:, :-1, :])(x)
ht = Lambda(lambda x: x[:, -1, :])(x)
ht = Reshape((64, 1))(ht)
# get the HC by 1*1 convolution
HC = Lambda(lambda x: K.permute_dimensions(x, [0, 2, 1]))(H)
score_mat = CalculateScoreMatrix(64)(HC)
score_mat = Lambda(lambda x: K.batch_dot(x[0], x[1]))([score_mat, ht])
# get the attn matrix
score_mat = Activation("sigmoid")(score_mat)
attn_mat = Multiply()([HC, score_mat])
attn_vec = Lambda(lambda x: K.sum(x, axis=-1))(attn_mat)
wvt = Dense(units=64 * 4, activation=None)(attn_vec)
wht = Dense(units=64 * 4, activation=None)(Flatten()(ht))
yht = Add()([wht, wvt])

載入自定義class的模型

from keras.utils import to_categorical, get_custom_objects
from keras.models import load_model
from keras.layers import Activation
model = load_model('saved_model/model_inception_TPA_lstm1_30.h5',custom_objects=get_custom_objects().update({'CalculateScoreMatrix': CalculateScoreMatrix}))


3.花式學習率設定和損失函式自定義過幾天再弄

tensorflow 模型搭建及使用常用小技巧

針對基於tensorflow的深度學習實驗中，如何自定義損失函式啟用函式注意力層等問題，記錄常用自定義模型的技巧和方法

Spring Shell打Jar包時常用小技巧

1、Main-Class 　　spring-shell專案打Jar包的一個必要條件就是，指定Main-Class為org.springframework.shell.Bootstrap

OpenJDK11+JavaFX+Maven環境搭建及最小化案例

開發環境：作業系統：Windows10或Linux 均可（我都在用） JDK版本：RedHat OpenJDK11 JavaFX版本：JavaFX15（當前穩定版）或JavaFX11.0.2

ROS學習之利用xacro/URDF模型搭建及rviz和gazebo模擬

技術標籤：ROS學習gazebo roslinuxurdfros 建議好好研究一下P3DX中的程式碼，非常有借鑑意義。

《戴森球計劃》資源、配方及發電小技巧分享

《戴森球計劃》中資源、配方及發電都是至關重要的環節，那麼這些環節都有什麼小技巧呢？現在為大家帶來“溫暖的凡凡”的《戴森球計劃》資源、配方及發電小技巧分享，希望對大家有所幫助。

matplotlib 常用小技巧

https://mp.weixin.qq.com/s/Ux66-omtEU6EWEWhjQnnyw 新增標題 plt.title(\'示例標題\') 新增圖上面的文字

Js常用小技巧收藏合集（一）持續更新中...

1、轉字串時，位數不足時自動補零（這兩個方法ES2017才出現，瀏覽器版本太舊不支援）

專案問題解決方案及常用方法技巧

1、求陣列最大、最小值 const arr = [0,1,2,3,4,5,6] const max/min = Math.max/min(...arr)//使用...結構 Math函式求值

python常用小技巧

# 連結：https://juejin.cn/post/7078284540745089055# 1、第一個字母大寫# s = \"programming is awesome\"# print(s.title())# 2、列表合併# 第一種方式：使用 +。# a = [1,2,3]# b = [4,5,6]# print(a + b)# 第二