torch.nn.NLLLoss()與torch.nn.CrossEntropyLoss()

阿新 • • 發佈：2021-02-16

技術標籤：pytorch

torch.nn.NLLLoss()

class torch.nn.NLLLoss(weight=None, size_average=None, ignore_index=-100, reduce=None, reduction='mean')

計算公式：loss(input, class) = -input[class]
公式理解：input = [-0.1187, 0.2110, 0.7463]，target = [1]，那麼 loss = -0.2110。
個人理解：感覺像是把 target 轉換成 one-hot 編碼，然後與 input 點乘得到的結果。

nn.NLLLoss輸入是一個對數概率向量和一個目標標籤。NLLLoss() ，即負對數似然損失函式（Negative Log Likelihood）。

NLLLoss() 損失函式公式：
在這裡插入圖片描述

常用於多分類任務，NLLLoss 函式輸入 input 之前，需要對 input 進行 log_softmax 處理，即將 input 轉換成概率分佈的形式，並且取對數，底數為 e。
y k y_k yk表示one_hot 編碼之後的資料標籤。
損失函式執行的結果為 y k y_k yk與經過log_softmax執行的資料相乘，求平均值，在取反。
實際使用NLLLoss()損失函式時，傳入的標籤，無需進行one_hot編碼。

例項1：

import torch
import torch.nn as nn
import torch.nn.functional as 
 F


torch.manual_seed(2019)
output = torch.randn(1, 3)  # 網路輸出
target = torch.ones(1, dtype=torch.long).random_(3)  # 真實標籤
print(output)
print(target)
 
# 直接呼叫
loss = F.nll_loss(output, target)
print(loss)
 
# 例項化類
criterion = nn.NLLLoss()
loss = criterion(output, target)
print(loss)
 
"""
tensor([[-0.1187,  0.2110,  0.7463]])
tensor([1])
tensor(-0.2110)
tensor(-0.2110)
"""

例項2：
如果 input 維度為 M x N，那麼 loss 預設取 M 個 loss 的平均值，reduction=‘none’ 表示顯示全部 loss.

import torch
import torch.nn as nn
import torch.nn.functional as F
 
 
torch.manual_seed(2019)
output = torch.randn(2, 3)  # 網路輸出
target = torch.ones(2, dtype=torch.long).random_(3)  # 真實標籤
print(output)
print(target)
 
# 直接呼叫
loss = F.nll_loss(output, target)
print(loss)
 
# 例項化類
criterion = nn.NLLLoss(reduction='none')
loss = criterion(output, target)
print(loss)
 
"""
tensor([[-0.1187,  0.2110,  0.7463],
        [-0.6136, -0.1186,  1.5565]])
tensor([2, 0])
tensor(-0.0664)
tensor([-0.7463,  0.6136])
"""

參考：https://blog.csdn.net/weixin_40476348/article/details/94562240

torch.nn.CrossEntropyLoss()

對資料進行softmax,再log，再進行NLLLoss。其與nn.NLLLoss的關係可以描述為：

softmax(x)+log(x)+nn.NLLLoss====>nn.CrossEntropyLoss

無需對輸出結果進行softmax處理，使用nn.CrossEntropyLoss會自動加上Softmax層。
nn.CrossEntropy()的表示式：
在這裡插入圖片描述

import torch
import torch.nn as nn
 
a = torch.Tensor([[1,2,3]])
target = torch.Tensor([2]).long()
logsoftmax = nn.LogSoftmax()
ce = nn.CrossEntropyLoss()
nll = nn.NLLLoss()
 
# 測試CrossEntropyLoss
cel = ce(a,target)
print(cel)
# 輸出：tensor(0.4076)
 
# 測試LogSoftmax+NLLLoss
lsm_a = logsoftmax(a)
nll_lsm_a = nll(lsm_a,target)
# 輸出tensor(0.4076)

看來直接用nn.CrossEntropy和nn.LogSoftmax+nn.NLLLoss是一樣的結果。為什麼這樣呢，回想下交叉熵的表示式：
在這裡插入圖片描述
其中y是label，x是prediction的結果，所以其實交叉熵損失就是target對應位置的輸出結果x再取-log。這個計算過程剛好就是先LogSoftmax()再NLLLoss()。

參考：
https://blog.csdn.net/watermelon1123/article/details/91044856
https://blog.csdn.net/weixin_40522801/article/details/106616295

torch.nn.NLLLoss()與torch.nn.CrossEntropyLoss()

技術標籤：pytorch torch.nn.NLLLoss() class torch.nn.NLLLoss(weight=None, size_average=None, ignore_index=-100, reduce=None, reduction=\'mean\')

細數nn.BCELoss與nn.CrossEntropyLoss的區別

以前我瀏覽部落格的時候記得別人說過，BCELoss與CrossEntropyLoss都是用於分類問題。可以知道，BCELoss是Binary CrossEntropyLoss的縮寫，BCELoss　CrossEntropyLoss的一個特例，只用於二分類問題，而CrossEntropyLo

Pytorch學習筆記12----損失函式nn.CrossEntropyLoss()、nn.NLLLoss()

1.CrossEntropyLoss()損失函式 nn.CrossEntropyLoss()這個損失函式用於多分類問題雖然說的是交叉熵，但是和我理解的交叉熵不一樣。nn.CrossEntropyLoss()是nn.logSoftmax()和nn.NLLLoss()的整合,可以直接使用它來替換

Pyorch之numpy與torch之間相互轉換方式

numpy中的ndarray轉化成pytorch中的tensor : torch.from_numpy() pytorch中的tensor轉化成numpy中的ndarray : numpy()

PyTorch中torch.tensor與torch.Tensor的區別詳解

PyTorch最近幾年可謂大火。相比於TensorFlow，PyTorch對於Python初學者更為友好，更易上手。

PyTorch之nn.ReLU與F.ReLU的區別介紹

我就廢話不多說了，大家還是直接看程式碼吧~ import torch.nn as nn import torch.nn.functional as F

Pytorch學習筆記15----nn.Conv2d與Conv3d引數理解

1.Conv3d class torch.nn.Conv3d(in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True)

torch.tensor拼接與list(tensors)

此文轉載自：https://blog.csdn.net/weixin_43543177/article/details/110206274#commentBox tensor&list[tensors]

torch.cat與torch.chunk的使用

技術標籤：python深度學習 torch.cat ( (A, B), dim=0)接受一個由兩個（或多個）tensor組成的元組，按行拼接，所以兩個（多個）tensor的列數要相同：

numpy與torch相同功能命令

技術標籤：pytorch 1.資料生成 0向量 import torch import numpy as np a1=torch.zeros(10,1) a2=np.zeros((10,1))

RNNCell、LSTMCell、tf.nn.static_rnn、tf.nn.static_bidirectional_rnn和tf.nn.bidirectional_dynamic_rnn

@deprecation.deprecated(None, "Please use `keras.layers.RNN(cell, unroll=True)`, " "which is equivalent to this API")

torch.Tensor常用操作:torch.cat

技術標籤：零基礎學習SSD網路PyTorch實現《深度學習之PyTorch實戰計算機視覺》Deep-Learning-with-PyTorch

PyTorch | torch.full()使用方法 | torch.full()如何使用？ torch.full()例子說明 | 通過torch.full建立全相同的張量

技術標籤：PyTorch--由入門到精通公眾號【計算機視覺聯盟】後臺回覆【PyTorch】可以獲得獨家PyTorch學習教程pdf版

PyTorch基礎——torch.nn.CrossEntropyLoss交叉熵損失

技術標籤：PyTorch交叉熵損失本文只考慮基本情況，未考慮加權。 torch.nnCrossEntropyLosss使用的公式

PyTorch之torch.nn.CrossEntropyLoss()

技術標籤：PyTorchpython深度學習機器學習演算法人工智慧簡介資訊熵：按照真實分佈p來衡量識別一個樣本所需的編碼長度的期望，即平均編碼長度交叉熵：使用擬合分佈q來表示來自真實分佈p的編碼長度的期望，即

pytorch torch.nn.AdaptiveAvgPool2d()自適應平均池化函式詳解

如題：只需要給定輸出特徵圖的大小就好，其中通道數前後不發生變化。具體如下：

PyTorch裡面的torch.nn.Parameter()詳解

在看過很多部落格的時候發現了一個用法self.v = torch.nn.Parameter(torch.FloatTensor(hidden_size)),首先可以把這個函式理解為型別轉換函式，將一個不可訓練的型別Tensor轉換成可以訓練的型別parameter並將這個par

pytorch1.0中torch.nn.Conv2d用法詳解

Conv2d的簡單使用 torch 包 nn 中 Conv2d 的用法與 tensorflow 中類似，但不完全一樣。

torch.nn.Embedding進行word Embedding

torch.nn.Embedding 在pytorch裡面實現word embedding是通過一個函式來實現的:nn.Embedding import torch

PyTorch之 torch.nn.Embedding 詞嵌入層的理解

1.word Embedding的概念理解首先，我們先理解一下什麼是Embedding。Word Embedding翻譯過來的意思就是詞嵌入，通俗來講就是將文字轉換為一串數字。因為數字是計算機更容易識別的一種表達形式。我們詞嵌入的過程，就

torch.nn.NLLLoss()與torch.nn.CrossEntropyLoss()

相關推薦