Faster RCNN pytroch訓練問題：Warning: NaN or Inf found in input tensor.

阿新 • • 發佈：2020-12-19

problem

在自己的資料（voc格式）上訓練Faster RCNN（https://github.com/jwyang/faster-rcnn.pytorch）就出現了loss=nan的問題。
在Pascal voc和coco上訓練Faster RCNN都正常。

reason

可能是learning rate太大，調小learning rate。最有效的方法是learning rate設為0，看看是不是還有nan的問題。
大概率是自己的資料有問題（我的資料是voc格式），voc獲取左邊後是要減1的，如果你的資料的座標框本身就是從0開始的，那減1就會導致超出影象邊界。

solution

設定lr=0，如果不在出現loss=nan的問題，說明是learning rate太大，導致了梯度爆炸或梯度消失。可調整learning rate和weight decay。
如果lr=0後，依然存在loss=nan的問題，就修改pascal_voc.py中獲取座標框的程式碼：

原始碼
x1 = float(bbox.find('xmin').text) - 1
y1 = float(bbox.find('ymin').text) - 1
x2 = float(bbox.find('xmax').text) - 1
y2 = float(bbox.find('ymax').text) 
 - 1
修改後
x1 = float(bbox.find('xmin').text) 
y1 = float(bbox.find('ymin').text) 
x2 = float(bbox.find('xmax').text) 
y2 = float(bbox.find('ymax').text)

若設定了翻轉（cfg.TRAIN.USE_FLIPPED = True），則需要在imdb.py中的def append_flipped_images(self)方法：

原始碼
boxes[:, 0] = widths[i] - oldx2 - 1
boxes[:, 2] = widths[i] 
 - oldx1 - 1
修改後
boxes[:, 0] = widths[i] - oldx2
boxes[:, 2] = widths[i] - oldx1

總結（可能導致loss=nan的情況）[2]

Coordinates out of the image resolution------------> NaN Loss
xmin=xmax-----------> Results in NaN Loss
ymin==ymax-----------> Results in Nan Loss
The size of bounding box was very small-----------> Results in NaN Loss

For the 4th case, we put a condition that the difference of |xmax -xmin| >= 20 and similarly |ymax- ymin| >=20

[1]https://github.com/VisionLearningGroup/DA_Detection/issues/11
[2]https://github.com/jwyang/faster-rcnn.pytorch/issues/136

Faster RCNN pytroch訓練問題：Warning: NaN or Inf found in input tensor.

技術標籤：pythonpytorch深度學習pytorch problem 在自己的資料（voc格式）上訓練Faster RCNN（https://github.com/jwyang/faster-rcnn.pytorch）就出現了loss=nan的問題。在Pascal voc和coco上訓練Faster RCNN都

Error-React.js：Warning: Each child in an array or iterator should have a unique "key" prop.

ylbtech-Error-React.js：Warning: Each child in an array or iterator should have a unique \"key\" prop.

【深度學習：目標檢測】1.1 Faster RCNN理論合集

1. R-CNN簡介 2014年之前都是使用傳統方法進行目標檢測，準確率僅30%左右，R-CNN出現後提升了30%的準確率。

啟動時出現錯誤：*** Warning - bad CRC or NAND

燒寫NAND Flash時出現錯誤：*** Warning - bad CRC or NAND, using default environment在對NAND Flash燒寫了bootstrap和U-Boot之後，啟動目標板，發現有如下顯示的錯誤：U-Boot 2009.11-rc2 (Jun 15 2012 - 12:59:2

計算機網路：Faster RCNN

Faster RCNN=Fast RCNN+RPN Faster RCNN可以分為四個部分： 1）Conv Layer：特徵提取網路，通過一組conv+relu+pooling來提取影象的feature map，用於後續的RPN來提取proposal

Pytorch訓練過程出現nan的解決方式

今天使用shuffleNetV2+，使用自己的資料集，遇到了loss是nan的情況，而且top1精確率出現斷崖式上升，這顯示是不正常的。

RCNN + Fast RCNN + Faster RCNN

影象分類影象定位目標檢測和例項分割目標檢測的發展歷程（論文時間）圖片來自https://github.com/hoya012/deep_learning_object_detection#2014

藍橋杯演算法訓練：網路流裸題

題目描述：網路流裸題問題描述　　一個有向圖，求1到N的最大流輸入格式　　第一行N M，表示點數與邊數

藍橋杯演算法訓練：Yaroslav and Algorithm

題目描述：網路流裸題問題描述　　（這道題的資料和SPJ已完工，盡情來虐吧！）

矩陣及變換，以及矩陣在DirectX和OpenGL中的運用問題：左乘 or 右乘，儲存問題：行優先 or 列優先,

1.向量和矩陣的乘法的線性代數表示　　首先，無論Direct3D還是opengl，所表示的向量和矩陣都是依據線性代數中的標準定義的：“矩陣A與B的乘積為矩陣C，則C的第i行第j列的元素c(ij)等於A的第i行與B的第j列的對應

暴力解決：WARNING: You are using pip version 20.1.1; however, version 20.2.2 is available.

1、問題：安裝第三方協程模組 gevent時，提示pip版本過時，要升級為最新版本

快速解決VS Code報錯：Java 11 or more recent is required to run. Please download and install a recent JDK

VS Code確實不是最好的Java編譯器（好吧，它或許都不該算是個編譯器），在使用的過程完全依賴咱們自己寫一些配置或者使用一些外掛，但是因為它外觀好看，我還是比較喜歡用這個。哪怕遇到的問題比別的編譯器多得多。排

Eclipse整合Maven打包時報錯：[ERROR] Unknown lifecycle phase "mvn". You must specify a valid lifecycle phase or a goal in the format

1、Eclipse整合Maven打包時報錯：[ERROR] Unknown lifecycle phase \"mvn\". You must specify a valid lifecycle phase or a goal in the format。

Faster RCNN pytroch訓練問題：Warning: NaN or Inf found in input tensor.

problem

reason

solution

總結（可能導致loss=nan的情況）[2]

For the 4th case, we put a condition that the difference of |xmax -xmin| >= 20 and similarly |ymax- ymin| >=20

Faster RCNN pytroch訓練問題：Warning: NaN or Inf found in input tensor.

Error-React.js：Warning: Each child in an array or iterator should have a unique "key" prop.

【深度學習：目標檢測】1.1 Faster RCNN理論合集

啟動時出現錯誤：*** Warning - bad CRC or NAND

計算機網路：Faster RCNN

Pytorch訓練過程出現nan的解決方式

RCNN + Fast RCNN + Faster RCNN

藍橋杯演算法訓練：網路流裸題

藍橋杯演算法訓練：Yaroslav and Algorithm

矩陣及變換，以及矩陣在DirectX和OpenGL中的運用問題：左乘 or 右乘，儲存問題：行優先 or 列優先,

暴力解決：WARNING: You are using pip version 20.1.1; however, version 20.2.2 is available.

快速解決VS Code報錯：Java 11 or more recent is required to run. Please download and install a recent JDK

Eclipse整合Maven打包時報錯：[ERROR] Unknown lifecycle phase "mvn". You must specify a valid lifecycle phase or a goal in the format

faster rcnn圖片測試

Faster-RCNN實現遙感影象滑坡識別

umijs控制檯報錯：Warning: Cannot update during an existing state transition (such as within `render`).

AcWing 藍橋杯專題訓練：（一）遞迴與遞推

python pip安裝時報錯：WARNING: Retrying等錯誤這樣解決！！

Faster-RCNN原始碼分析——AnchorGenerator

盧偉冰徵集 Redmi K40 小尾巴：真旗艦 or 大魔王

Faster RCNN pytroch訓練問題：Warning: NaN or Inf found in input tensor.

problem

reason

solution

總結（可能導致loss=nan的情況）[2]

For the 4th case, we put a condition that the difference of |xmax -xmin| >= 20 and similarly |ymax- ymin| >=20

相關推薦