loss function之用Dice-coefficient loss function or cross-entropy

阿新 • • 發佈：2018-12-03

答案出處：https://stats.stackexchange.com/questions/321460/dice-coefficient-loss-function-vs-cross-entropy

cross entropy 是普遍使用的loss function，但是做分割的時候很多使用Dice, 二者的區別如下;

One compelling reason for using cross-entropy over dice-coefficient or the similar IoU metric is that the gradients are nicer.

The gradients of cross-entropy wrt the logits is something like p−t, where p is the softmax outputs and t is the target. Meanwhile, if we try to write the dice coefficient in a differentiable form:

. then the resulting gradients wrt p are much uglier:

It's easy to imagine a case where both p and t are small, and the gradient blows up to some huge value. In general, it seems likely that training will become more unstable.

The main reason that people try to use dice coefficient or IoU directly is that the actual goal is maximization of those metrics

, and cross-entropy is just a proxy which is easier to maximize using backpropagation. In addition, Dice coefficient performs better at class imbalanced problems by design:

However, class imbalance is typically taken care of simply by assigning loss multipliers to each class, such that the network is highly disincentivized to simply ignore a class which appears infrequently, so it's unclear that Dice coefficient is really necessary in these cases.

I would start with cross-entropy loss, which seems to be the standard loss for training segmentation networks, unless there was a really compelling reason to use Dice coefficient.

簡單來說，就是DICE並沒有真正使結果更優秀，而是結果讓人看起來更優秀。

loss function之用Dice-coefficient loss function or cross-entropy

loss function之用Dice-coefficient loss function or cross-entropy

機器學習損失函數(Loss/Error Function)、代價函數(Cost Function)和目標函數(Objective function)

卷積神經網絡系列之softmax，softmax loss和cross entropy的講解

Function 之 Read_Text 函式的使用方法

深入瞭解機器學習之降低損失 (Reducing Loss)：梯度下降法

old.2.流程梳理（function之return代碼）

深入瞭解機器學習之降低損失 (Reducing Loss)：學習速率

用基於center loss的人臉識別模型對LFW人臉資料集進行評測（c++）

loss函式之margin改進方法

卷積神經網路系列之softmax，softmax loss和cross entropy loss的講解

java8 function 活用

PowerShell Function之獲取OS資訊

人臉識別：損失函式之softmax loss和cross entropy Loss

初識Haskell 四：函數function之二

目標檢測之Loss：softmaxLoss和Cross Entropy的講解

人臉驗證（LOSS）之Facenet

卷積神經網路系列之softmax，softmax loss和cross entropy的講解

Linux學習之用戶管理

python之用戶交互

第一篇：linux系統應用管理之用戶的切換

loss function之用Dice-coefficient loss function or cross-entropy

相關推薦