Pytorch在NLP中的簡單應用詳解

阿新 • • 發佈：2020-01-09

因為之前在專案中一直使用Tensorflow，最近需要處理NLP問題，對Pytorch框架還比較陌生，所以特地再學習一下pytorch在自然語言處理問題中的簡單使用，這裡做一個記錄。

一、Pytorch基礎

首先，第一步是匯入pytorch的一系列包

import torch
import torch.autograd as autograd #Autograd為Tensor所有操作提供自動求導方法
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim

1）Tensor張量

a) 建立Tensors

#tensor
x = torch.Tensor([[1,2,3],[4,5,6]])
#size為2x3x4的隨機數隨機數
x = torch.randn((2,3,4))

b) Tensors計算

x = torch.Tensor([[1,2],[3,4]])
y = torch.Tensor([[5,6],[7,8]])
z = x+y

c) Reshape Tensors

x = torch.randn(2,4)
#拉直
x = x.view(-1)
#4*6維度
x = x.view(4,6)

2）計算圖和自動微分

a) Variable變數

#將Tensor變為Variable
x = autograd.Variable(torch.Tensor([1,3]),requires_grad = True)
#將Variable變為Tensor
y = x.data

b) 反向梯度演算法

x = autograd.Variable(torch.Tensor([1,2]),requires_grad=True)
y = autograd.Variable(torch.Tensor([3,4]),requires_grad=True)
z = x+y
#求和
s = z.sum()
#反向梯度傳播
s.backward()
print(x.grad)

c) 線性對映

linear = nn.Linear(3,5) #三維線性對映到五維
x = autograd.Variable(torch.randn(4,3))
#輸出為（4,5）維
y = linear(x)

d) 非線性對映（啟用函式的使用）

x = autograd.Variable(torch.randn(5))
#relu啟用函式
x_relu = F.relu(x)
print(x_relu)
x_soft = F.softmax(x)
#softmax啟用函式
print(x_soft)
print(x_soft.sum())

output:

Variable containing:
-0.9347
-0.9882
 1.3801
-0.1173
 0.9317
[torch.FloatTensor of size 5]
 
Variable containing:
 0.0481
 0.0456
 0.4867
 0.1089
 0.3108
[torch.FloatTensor of size 5]
 
Variable containing:
 1
[torch.FloatTensor of size 1]
 
Variable containing:
-3.0350
-3.0885
-0.7201
-2.2176
-1.1686
[torch.FloatTensor of size 5]

二、Pytorch建立網路

1) word embedding詞嵌入

通過nn.Embedding(m,n)實現，m表示所有的單詞數目，n表示詞嵌入的維度。

word_to_idx = {'hello':0,'world':1}
embeds = nn.Embedding(2,5) #即兩個單詞，單詞的詞嵌入維度為5
hello_idx = torch.LongTensor([word_to_idx['hello']])
hello_idx = autograd.Variable(hello_idx)
hello_embed = embeds(hello_idx)
print(hello_embed)

output:

Variable containing:
-0.6982 0.3909 -1.0760 -1.6215 0.4429
[torch.FloatTensor of size 1x5]

2) N-Gram 語言模型

先介紹一下N-Gram語言模型，給定一個單詞序列，計算，其中是序列的第個單詞。

import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.autograd as autograd
import torch.optim as optim
 
from six.moves import xrange

對句子進行分詞：

context_size = 2
embed_dim = 10
text_sequence = """When forty winters shall besiege thy brow,And dig deep trenches in thy beauty's field,Thy youth's proud livery so gazed on now,Will be a totter'd weed of small worth held:
Then being asked,where all thy beauty lies,Where all the treasure of thy lusty days;
To say,within thine own deep sunken eyes,Were an all-eating shame,and thriftless praise.
How much more praise deserv'd thy beauty's use,If thou couldst answer 'This fair child of mine
Shall sum my count,and make my old excuse,'
Proving his beauty by succession thine!
This were to be new made when thou art old,And see thy blood warm when thou feel'st it cold.""".split()
#分詞
trigrams = [ ([text_sequence[i],text_sequence[i+1]],text_sequence[i+2]) for i in xrange(len(text_sequence) - 2) ]
trigrams[:10]

分詞的形式為：

#建立vocab索引
vocab = set(text_sequence)
word_to_ix = {word: i for i,word in enumerate(vocab)}

建立N-Gram Language model

#N-Gram Language model
class NGramLanguageModeler(nn.Module): 
 def __init__(self,vocab_size,embed_dim,context_size):
  super(NGramLanguageModeler,self).__init__()
  #詞嵌入
  self.embedding = nn.Embedding(vocab_size,embed_dim)
  #兩層線性分類器
  self.linear1 = nn.Linear(embed_dim*context_size,128)
  self.linear2 = nn.Linear(128,vocab_size)
  
 def forward(self,input):
  embeds = self.embedding(input).view((1,-1)) #2,10拉直為20
  out = F.relu(self.linear1(embeds))
  out = F.relu(self.linear2(out))
  log_probs = F.log_softmax(out)
  return log_probs

輸出模型看一下網路結構

#輸出模型看一下網路結構
model = NGramLanguageModeler(96,10,2)
print(model)

定義損失函式和優化器

#定義損失函式以及優化器
loss_function = nn.NLLLoss()
optimizer = optim.SGD(model.parameters(),lr = 0.01)
model = NGramLanguageModeler(len(vocab),context_size)
losses = []

模型訓練

#模型訓練
for epoch in xrange(10):
 total_loss = torch.Tensor([0])
 for context,target in trigrams:
  #1.處理資料輸入為索引向量
  #print(context)
  #注：python3中map函式前要加上list()轉換為列表形式
  context_idxs = list(map(lambda w: word_to_ix[w],context))
  #print(context_idxs)
  context_var = autograd.Variable( torch.LongTensor(context_idxs) )
 
  
  #2.梯度清零
  model.zero_grad()
  
  #3.前向傳播，計算下一個單詞的概率
  log_probs = model(context_var)
  
  #4.損失函式
  loss = loss_function(log_probs,autograd.Variable(torch.LongTensor([word_to_ix[target]])))
  
  #反向傳播及梯度更新
  loss.backward()
  optimizer.step()
  
  total_loss += loss.data 
 losses.append(total_loss)
print(losses)

以上這篇Pytorch在NLP中的簡單應用詳解就是小編分享給大家的全部內容了，希望能給大家一個參考，也希望大家多多支援我們。

Pytorch在NLP中的簡單應用詳解

pyecharts在資料視覺化中的應用詳解

使用pyecharts進行資料視覺化安裝 pip install pyecharts 也可以在pycharm軟體裡進行下載pyecharts庫包。

C#中FlagsAttribute屬性在enum中的應用詳解

Net C#中列舉的宣告格式如下所示： [attributes] [modifiers] enum identifier [:base-type] {enumerator-list} [;]

C# Winfom 中ListBox的簡單用法詳解

1、如何新增listBox的值 this.listBox1.Items.Add(\"張曉東\"); 2、如何判斷listBox集合是否新增過

MySQL資料庫8——資料庫中函式的應用詳解

資料庫中內建函式的使用該篇主要介紹資料庫中內建函式的使用，主要有日期函式，字串函式，數學函式。

pytorch::Dataloader中的迭代器和生成器應用詳解

在使用pytorch訓練模型，經常需要載入大量圖片資料，因此pytorch提供了好用的資料載入工具Dataloader。

Python collections中的雙向佇列deque簡單介紹詳解

前言在python神書《Python+Cookbook》中有這麼一段話：在佇列兩端插入或刪除元素時間複雜度都是 O(1) ，而在列表的開頭插入或刪除元素的時間複雜度為 O(N)。

python中property屬性的介紹及其應用詳解

Python的property屬性的功能是：property屬性內部進行一系列的邏輯計算，最終將計算結果返回。

RxJS在TypeScript中的簡單使用詳解

1. 安裝 # 安裝 typescript， rxjs 包 npm install -D typescript @types/node npm install rxjs 2. 使用

Python中的特殊方法以及應用詳解

前言 Python 中的特殊方法主要是為了被直譯器呼叫的，因此應該儘量使用 len(my_object) 而不是 my_object.__len__() 這種寫法。在執行 len(my_object) 時，Python 直譯器會自行呼叫 my_object 中實現的 __len__ 方法

JS中佇列和雙端佇列實現及應用詳解

佇列佇列雙端佇列資料結構應用用擊鼓傳花遊戲模擬迴圈佇列用雙端對列檢查一個詞是否構成迴文

SpringBoot中的響應式web應用詳解

簡介在Spring 5中，Spring MVC引入了webFlux的概念，webFlux的底層是基於reactor-netty來的，而reactor-netty又使用了Reactor庫。

SpringBoot中dubbo+zookeeper實現分散式開發的應用詳解

總體實現思路是啟動一個生產者專案註冊,將所含服務註冊到zookeeper的註冊中心,然後在啟動一個消費者專案,將所需服務向zookeeper註冊中心進行訂閱,等待註冊中心的通知

IDEA中Git的基本應用詳解

基於Git的專案操作安裝Git工具 Git是版本控制系統，可以藉助Git實現團隊程式碼版本控制及管理，

Java中lombok的@Builder註解的解析與簡單使用詳解

Lombok中@Builder用法 1、建造者模式簡介：Builder 使用建立者模式又叫建造者模式。簡單來說，就是一步步建立一個物件，它對使用者遮蔽了裡面構建的細節，但卻可以精細地控制物件的構造過程。

JavaScript中正則表示式的實際應用詳解

實際工作中，javascript正則表示式還是經常用到的。所以這部分的知識是非常重要的。

vue中{{}},v-text和v-html區別與應用詳解

{{}}獲取值，不會清空標籤原有內容 v-text 獲取值，會清空標籤原有內容，輸出的是純文字

Java web攔截器inteceptor原理及應用詳解

這篇文章主要介紹了java web攔截器inteceptor原理及應用詳解,文中通過示例程式碼介紹的非常詳細，對大家的學習或者工作具有一定的參考學習價值,需要的朋友可以參考下

DragChartPanel可拖拽曲線應用詳解

DragChartPanel 是java cs架構中的一種圖形展現的開源元件。業務需求需要用到DragChartPanel ，這是一種根據jtable表格中的資料給與展示的圖形元件。它和其他圖形元件區別再與它可以進行拖拽，使用者通過它不僅可以

mysql中workbench例項詳解

MySQL Workbench - 建模和設計工具 1.模型是大多數有效和高效能資料庫的核心。MySQL workbench具有允許開發人員和資料庫管理員視覺化地建立物理資料庫設計模型的工具，這些模型可以使用正向工程輕鬆轉換為MySQL資料庫

Pytorch在NLP中的簡單應用詳解

相關推薦