keras實現Bi-LSTM+CRF

阿新 • • 發佈：2019-02-18

keras官方版目前還沒有實現CRF層，但是網上有大牛實現的擴充套件包戳這裡，取用之。

安裝方法1

git clone https://www.github.com/farizrahman4u/keras-contrib.git
cd keras-contrib
python setup.py install

安裝方法2

pip install git+https://www.github.com/farizrahman4u/keras-contrib.git

我自己在使用方法1的時候一直出錯，google了好久也沒解決，故改用方法2。
安裝完之後就可以直接使用了，上面給出的連結裡有example，很容易上手。

程式碼
下面這部分程式碼是之前做實驗搭baseline寫的。

input_layer = Input(shape = (max_len, ))
embedding_layer = Embedding(len(vocabulary) + 1, output_dim = embedding_dim, mask_zero = True)
bi_lstm_layer = Bidirectional(LSTM(64, return_sequences = True))
bi_lstm_drop_layer = Dropout(0.5)
dense_layer = TimeDistributed(Dense(len(tag_list)))
crf_layer 
 = CRF(len(tag_list), sparse_target = True)

input = input_layer
embedding = embedding_layer(input)
bi_lstm = bi_lstm_layer(embedding)
bi_lstm_drop = bi_lstm_drop_layer(bi_lstm)
dense = dense_layer(bi_lstm_drop)
crf = crf_layer(dense)

model = Model(input = [input], output = [crf])
model.summary()

optmr 
 = optimizers.Adam(lr = 0.001, beta_1 = 0.5)
model.compile(optimizer = optmr, loss = crf_layer.loss_function, metrics = [crf_layer.accuracy])

check_pointer = ModelCheckpoint(filepath = 'best_model.hdf5', verbose = 1,  save_best_only = True)
hist = model.fit(train_x, train_y, batch_size = 32, epochs = 20, verbose = 2, validation_data = [val_x, val_y], callbacks = [check_pointer])
model.load_weights('best_model.hdf5')

test_y_pred = model.predict(test_x).argmax(-1)
test_y_true = test_y[:, :, 0]

注意
keras-contrib在linux python2.7 theano後端的情況下不work！
雖然不知道原因，總之後端改成tensorflow就沒有任何問題。

keras實現Bi-LSTM+CRF

keras官方版目前還沒有實現CRF層，但是網上有大牛實現的擴充套件包戳這裡，取用之。安裝方法1 git clone https://www.github.com/farizrahman4u/ke

NLP（二十五）實現ALBERT+Bi-LSTM+CRF模型

在文章[NLP（二十四）利用ALBERT實現命名實體識別](https://blog.csdn.net/jclian91/article/details/104806598)中，筆者介紹了ALBERT+Bi-LSTM模型在命名實體識別方面的應用。在本文中，筆者將介紹如何實現ALBERT+Bi-LST

Bi-LSTM+CRF函數分解

lstm 函數 ria ont pre scala list() sca seed 1. to_scalar() 1 import torch 2 import torch.autograd as autograd 3 import torch.nn as nn

Bi-LSTM-CRF（一）--tensorflow原始碼解析

1.1.核心程式碼： cell_fw = tf.contrib.rnn.LSTMCell(num_units=100) cell_bw = tf.contrib.rnn.LSTMCell(num_units=100) (outputs, output_states) =

97.5%準確率的深度學習中文分詞（字嵌入+Bi-LSTM+CRF）

摘要深度學習當前在NLP領域發展也相當快，翻譯，問答，摘要等基本都被深度學習佔領了。本文給出基於深度學習的中文分詞實現，藉助大規模語料，不需要構造額外手工特徵，在2014年人民日報語料上取得97.5%的準確率。模型基本是參考論文：http://www.aclw

詞法分析之Bi-LSTM-CRF框架

詞法分析是NLP的一項重要的基礎技術，包括分詞、詞性標註、實體識別等，其主要演算法結構為基於Bi-LSTM-CRF演算法體系，下面對Bi-LSTM-CRF演算法體系進行介紹。引言首先拋開深層的技術原因，來從巨集觀上看一下為什麼LSTM（Bi-LSTM

BI-LSTM and CRF using Keras

crf proto multiple fig onf con pro ken import 問題1：CUDA_ERROR_OUT_OF_MEMORY: How to activate multiple GPUs from Keras in Tensorflow import

簡明條件隨機場CRF介紹（附帶純Keras實現）

筆者去年曾寫過博文《果殼中的條件隨機場(CRF In A Nutshell)》，以一種比較粗糙的方式介紹了一下條件隨機場（CRF）模型。然而那篇文章顯然有很多不足的地方，比如介紹不夠清晰，也不夠完整，還沒有實現，在這裡我們重提這個模型，將相關內容補充完成。本文是對CRF基本原理的一個簡明的介紹

Bi-LSTM的理解以及 Tensorflow實現

Bidirectional LSTM，由兩個LSTMs上下疊加在一起組成。輸出由這兩個LSTMs的隱藏層的狀態決定。 def bilstm(self,x): # 輸入的資料格式轉換 # x.shape [batch_size, time_

86、使用Tensorflow實現，LSTM的時間序列預測，預測正弦函數

ati pre win real testing could sqrt sha ima ‘‘‘ Created on 2017年5月21日 @author: weizhen ‘‘‘ # 以下程序為預測離散化之後的sin函數 import numpy as np impo

學習筆記TF036:實現Bidirectional LSTM Classifier

var 整數 nump tutorial sse times 單元 variables mce 雙向循環神經網絡(Bidirectional Recurrent Neural Networks,Bi-RNN)，Schuster、Paliwal，1997年首次提出，和LSTM

keras實現多個模型融合（非keras自帶模型，這裡以3個自己的模型為例）

該程式碼可以實現類似圖片的效果，多個模型採用第一個輸入。圖片來源：https://github.com/keras-team/keras/issues/4205 step 1:重新定義模型(這是我自己的模型，你們可以用你們自己的)，與預訓練不一樣，這裡定義模型inp

Keras實現VGG16

from keras.models import Sequential from keras.layers import Dense, Flatten, Dropout from keras.layers.convolutional import Conv2D, MaxPooling2D i

keras實現VGG 13

from keras.models import Sequential from keras.layers import Dense, Flatten, Dropout from keras.layers.convolutional import Conv2D, MaxPooling2D i

使用keras實現深度殘差網路

from keras.models import Model from keras.layers import Input, Dense, Dropout, BatchNormalization, Conv2D, MaxPooling2D, AveragePooling2D, concate

Keras實現GoogleNet

from keras.models import Model from keras.layers import Input, Dense, Dropout, BatchNormalization, Conv2D, MaxPooling2D, AveragePooling2D, concate

Keras 實現AlexNet

from keras.models import Sequential from keras.layers import Dense, Flatten, Dropout from keras.layers.convolutional import Conv2D, MaxPooling2D f

使用Keras實現機器翻譯（英語—>法語）

import numpy as np from keras.models import Model from keras.models import load_model from keras.layers import Input,LSTM,Dense batch_size = 64 # Batch

keras：4)LSTM函式詳解

LSTM層 keras.layers.recurrent.LSTM(units, activation='tanh', recurrent_activation='hard_sigmoid', use_bias=True, kernel_initializer='glorot_uni

基於keras實現多標籤分類（multi-label classification）

首先討論多標籤分類資料集（以及如何快速構建自己的資料集）。之後簡要討論SmallerVGGNet，我們將實現的Keras神經網路架構，並用於多標籤分類。然後我們將實施SmallerVGGNet並使用我們的多標籤分類資料集對其進行訓練。最後，我們將通過在示例影象上測試我

keras實現Bi-LSTM+CRF

相關推薦