smooth L1 loass and focal loss

阿新 • • 發佈：2018-12-17

import keras
from . import backend


def focal(alpha=0.25, gamma=2.0):
    """ Create a functor for computing the focal loss.

    Args
        alpha: Scale the focal weight with alpha.
        gamma: Take the power of the focal weight with gamma.

    Returns
        A functor that computes the focal loss using the alpha and gamma.
    """
    def _focal(y_true, y_pred):
        """ Compute the focal loss given the target tensor and the predicted tensor.

        As defined in https://arxiv.org/abs/1708.02002

        Args
            y_true: Tensor of target data from the generator with shape (B, N, num_classes).
            y_pred: Tensor of predicted data from the network with shape (B, N, num_classes).

        Returns
            The focal loss of y_pred w.r.t. y_true.
        """
        labels         = y_true[:, :, :-1]
        anchor_state   = y_true[:, :, -1]  # -1 for ignore, 0 for background, 1 for object
        classification = y_pred

        # filter out "ignore" anchors
        indices        = backend.where(keras.backend.not_equal(anchor_state, -1))
        labels         = backend.gather_nd(labels, indices)
        classification = backend.gather_nd(classification, indices)

        # compute the focal loss
        alpha_factor = keras.backend.ones_like(labels) * alpha
        alpha_factor = backend.where(keras.backend.equal(labels, 1), alpha_factor, 1 - alpha_factor)
        focal_weight = backend.where(keras.backend.equal(labels, 1), 1 - classification, classification)
        focal_weight = alpha_factor * focal_weight ** gamma

        cls_loss = focal_weight * keras.backend.binary_crossentropy(labels, classification)

        # compute the normalizer: the number of positive anchors
        normalizer = backend.where(keras.backend.equal(anchor_state, 1))
        normalizer = keras.backend.cast(keras.backend.shape(normalizer)[0], keras.backend.floatx())
        normalizer = keras.backend.maximum(1.0, normalizer)

        return keras.backend.sum(cls_loss) / normalizer

    return _focal


def smooth_l1(sigma=3.0):
    """ Create a smooth L1 loss functor.

    Args
        sigma: This argument defines the point where the loss changes from L2 to L1.

    Returns
        A functor for computing the smooth L1 loss given target data and predicted data.
    """
    sigma_squared = sigma ** 2

    def _smooth_l1(y_true, y_pred):
        """ Compute the smooth L1 loss of y_pred w.r.t. y_true.

        Args
            y_true: Tensor from the generator of shape (B, N, 5). The last value for each box is the state of the anchor (ignore, negative, positive).
            y_pred: Tensor from the network of shape (B, N, 4).

        Returns
            The smooth L1 loss of y_pred w.r.t. y_true.
        """
        # separate target and state
        regression        = y_pred
        regression_target = y_true[:, :, :-1]
        anchor_state      = y_true[:, :, -1]

        # filter out "ignore" anchors
        indices           = backend.where(keras.backend.equal(anchor_state, 1))
        regression        = backend.gather_nd(regression, indices)
        regression_target = backend.gather_nd(regression_target, indices)

        # compute smooth L1 loss
        # f(x) = 0.5 * (sigma * x)^2          if |x| < 1 / sigma / sigma
        #        |x| - 0.5 / sigma / sigma    otherwise
        regression_diff = regression - regression_target
        regression_diff = keras.backend.abs(regression_diff)
        regression_loss = backend.where(
            keras.backend.less(regression_diff, 1.0 / sigma_squared),
            0.5 * sigma_squared * keras.backend.pow(regression_diff, 2),
            regression_diff - 0.5 / sigma_squared
        )

        # compute the normalizer: the number of positive anchors
        normalizer = keras.backend.maximum(1, keras.backend.shape(indices)[0])
        normalizer = keras.backend.cast(normalizer, dtype=keras.backend.floatx())
        return keras.backend.sum(regression_loss) / normalizer

    return _smooth_l1

smooth L1 loass and focal loss

import keras from . import backend def focal(alpha=0.25, gamma=2.0): """ Create a functor for computing the focal loss.

Focal loss and RetinaNet

這是一篇論文閱讀筆記論文連結：https://arxiv.org/abs/1708.02002 程式碼連結：https://github.com/facebookresearch/Detectron 首先，提一個問題，為什麼one stage方法精度比two stage方法

L1 loss, L2 loss以及Smooth L1 Loss的對比

總結對比下$L_1$ 損失函式，$L_2$ 損失函式以及$\text{Smooth} L_1$ 損失函式的優缺點。均方誤差MSE ($L_2$ Loss) 均方誤差（Mean Square Error,MSE）是模型預測值$f(x)$ 與真實樣本值$y$ 之間差值平方的平均值，其公式

Focal Loss for Dense Object Detection 論文閱讀

因此分類技術分享模型出發點 oss oca 圖片同時何凱明大佬 ICCV 2017 best student paper 作者提出focal loss的出發點也是希望one-stage detector可以達到two-stage detector的準確率，同時

目標檢測focal loss 和 loss rank mining筆記

focal loss 參考https://blog.csdn.net/qq_34564947/article/details/77200104 α是控制類別不均衡，對屬於少數類別的樣本，增大α γ是區分樣本識別難易 loss rank mining paper:https://

focal loss

Focal Loss 就是一個解決分類問題中類別不平衡、分類難度差異的一個 loss. Kaiming 大神的 Focal Loss ,二分類形式,是：如果落實到 ŷ =σ(x) 這個預測，那麼就有：通過一系列調參，得到 α=0.25, γ=2（在他的模型上）的效果最好

何愷明大神的「Focal Loss」，如何更好地理解？

轉自：http://blog.csdn.net/c9Yv2cf9I06K2A9E/article/details/78920998 作者丨蘇劍林單位丨廣州火焰資訊科技有限公司研究方向丨NLP，神經網路個人主頁丨kexue.fm 前言

focal loss 兩點理解

png 感覺技術 src 類別 com 大量。。 ima 博客給出了三個算例。可以看出，focal loss 對可很好分類的樣本賦予了較小的權重，但是對分錯和不易分的樣本添加了較大的權重。對於類別不平衡，使用了$\alpha_t$進行加權，文章中提到較好的值是0

Focal Loss(RetinaNet) 與 OHEM

Focal Loss for Dense Object Detection-RetinaNet YOLO和SSD可以算one-stage演算法裡的佼佼者，加上R-CNN系列演算法，這幾種演算法可以說是目標檢測領域非常經典的演算法了。這幾種演算法在提出之後經過數次改進，都得到了很高的精確度，但是one-sta

Focal Loss for Dense Object Detection

Focal loss是Kaiming He和RBG發表在ICCV2017上的文章。 abstract: one-stage網路和two-stage網路相比，one-stage會得到大量目標位置。one stage不好的原因在於：極度不平衡的正負樣本比例：abchor近

[論文筆記] Focal Loss for Dense Object Detection

Introduction 在 object detection 中，one-stage 跟 two-stage 的 model 的精準度的比較往往是一個高度討論的熱門話題，本論文中大致的描述了自己對於 two-stage 精準度上較高原因提出了一些猜測，詳細的

caffe新增層：Focal Loss的caffe實現

1，caffe.proto 原始檔在src/caffe/proto/目錄裡從492行這些optional裡，作者添加了兩行: optional ReLU6Parameter relu6_param = 208; optional FocalLossParamete

目標檢測之focal loss

https://blog.csdn.net/dreamer_on_air/article/details/78187565 我的批註：作者沒有考慮負樣本的情況，當正樣本被預測正確時，其loss下降為0；當正樣本預測錯誤時，其loss有稍微的下降；也就是，對於容易訓練的樣本，其loss

Diet and Weight Loss News

Tuesday, December 4, 2018 Monday, December 3, 2018 Wednesday, November 28, 2018 Monday, November 26, 2018 Tuesday, November 20, 2018 Thursday, Novembe

Focal Loss 的理解

論文：《Focal Loss for Dense Object Detection》 Focal Loss 是何愷明設計的為了解決one-stage目標檢測在訓練階段前景類和背景類極度不均衡（如1：1000）的場景的損失函式。它是由二分類交叉熵改造而來的。標準交叉熵其中，p是模型預測屬於類別y=

論文閱讀-《Focal Loss for Dense Object Detection》

FAIR. ICCV2017 Oral Kaiming He & RBG 1.Motivation 一直以來，one-stage detector都以快著稱，yolo剛釋出的時候表明了是主打速度的，但是這些one-stage detector的精

Focal Loss 論文理解及公式推導

作者: Tsung-Yi, Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollar 團隊: FAIR 精度最高的目標檢測器往往基於 RCNN 的 two-stage 方法，對候選目標位置再採用

【Caffe】Focal Loss

Pk對zk的求導，以及Pk對zj的求導請參考https://blog.csdn.net/u013066730/article/details/86231215 前向程式碼： for (int i = 0; i < outer_num_; ++i) { for (int j

論文(3) Focal Loss

Focal Loss @(目標檢測) Focal Loss是KaiMing大神提出來的，這篇文章的重點在於分析了one-stage網路的檢測精度為什麼會弱於two-stage的網路。當原理分析出來之後，其實公式的更改就很簡單了。這篇paper也自建了一個網路

深度學習【17】物體檢測：Focal Loss 反向求導及darknet上的實現

Focal Loss 反向求導及darknet上的實現 Focal Loss 可以解決不平衡分類問題，是在交叉熵損失函式上的擴充套件。詳見，論文：Focal Loss for Dense Object Detection。該文，主要推導FL在softmax

smooth L1 loass and focal loss

相關推薦