RuntimeError: Trying to backward through the graph a second time

阿新 • • 發佈：2022-12-07

起因是把別人的用clip做分割的模型加到自己的框架上，結果報這個錯。Google了一下，發現可能是如下幾種原因：多個loss都要backward卻沒有retain graphhttps://www.zhihu.com/question/414980879，或者是rnn時對於前一次的輸出沒有detach就送進網路等等，還有一些奇怪的原因比如https://www.zhihu.com/search?type=content&q=RuntimeError%3A%20Trying%20to%20backward%20through%20the，結果發現和自己的情況都不符合。後來看到某CSDN的一個帖子https://blog.csdn.net/qq_49030008/article/details/125440817，雖然和自己的情況也不太一樣，但提到的預訓練模型啟發了我：現在跑的不就是clip做分割的任務嗎！於是開始一通亂改，比如把embedding後的text feature給detach或者儲存在迴圈外，每次forward的時候傳進來，等等，結果都不work。無奈之下只好跑起來官方程式碼對拍，但官方程式碼用了Pytorch lighting，封裝了不少東西，其餘的地方看起來貌似都沒啥特別的...最後在某一次除錯的時候列印了一下text feature的is_leaf和requires_grad屬性，發現兩輪後這兩個屬性竟然會發生反轉！仔細一看發現前兩輪不是真正在train，而是進行了一次驗證（可以自行查閱lighting框架的num_sanity_val_steps引數），猜想可能是在跑這兩次測試的過程中對模型引數屬性進行了一些奇妙的初始化，於是檢視框架原始碼https://github.com/Lightning-AI/lightning/blob/master/src/pytorch_lightning/trainer/trainer.py：

with torch.no_grad():
	val_loop.run()

發現其實就是跑了一下val的loop，於是對自己的程式碼在訓練前加上一部分：

with torch.no_grad():
	for cur_step, (images, labels) in enumerate(train_loader):
    images = images.to(device, dtype=torch.float32)
    outputs = model(images, labelset='')
    break

果然不報錯了，但原理是什麼還未搞懂。已經在 GitHub提了一個issue，希望能看到作者給的答案https://github.com/isl-org/lang-seg/issues/38

RuntimeError: Trying to backward through the graph a second time

起因是把別人的用clip做分割的模型加到自己的框架上，結果報這個錯。Google了一下，發現可能是如下幾種原因：多個loss都要backward卻沒有retain graphhttps://www.zhihu.com/question/414980879，或者是rnn時對於前一

Tensorflow安裝以及RuntimeError: The Session graph is empty. Add operations to the graph before calling run().解決方法

Tensorflow安裝之前裝過pytorch，但是很多老的機器學習程式碼都是tensorflow，所以沒辦法，還要裝個tensorflow。

深度學習tensorflow2.x RuntimeError: The Session graph is empty. Add operations to the graph before calling run(). 報錯解決方法

import tensorflow as tf tf.compat.v1.disable_eager_execution()　＃保證sess.run()能夠正常執行

.NET MVC5 API AutoFac 出錯 An error occurred when trying to create a controller of type 'xxxController'. Make sure that the controller has a parameterless public constructor

最近在改一個使用.net frameWork4.6 的MVC5框架下開發的API專案時，想新增AutoFac進行Ioc注入管理時發生一個小坑。

A note on trying to extend the intermediate value theorem

First, it is necessary to introduce the following definitions1, The function is said to be increasing at \\(x_{0}\\) if for all \\(x\\)-values in some interval about \\(x_{0}\\) it is true that when

執行nvue 頁面報錯reportJSException >>>> exception function:GraphicActionAddElement, exception:You are trying to add a u-text to a u-text, which is illegal as u-text is not a container

Failed to load resource: the server responded with a status of 404 (Not Found) favicon.ico檔案找不到

技術標籤：vuehtml 今天使用sublime以localhost方式開啟html檔案時（使用wamp環境提供一個Apache伺服器，html檔案存在於wamp環境的www資料夾下），出現favicon.ico檔案找不到問題

The file that you are trying to load does not match the file format of the destination table.

技術標籤：問題hive大資料hqlsql ive匯入資料報錯 Hive load data local inpath … into table … 出錯

python pandas Dataframe增加一列遇到A value is trying to be set on a copy of a slice from a DataFrame.

技術標籤：pythonpython大資料pandasDataframe df2是Dataframe資料，直接在其上面增加一列，使用如下程式碼：

Android開發The style on this component requires your app theme to be Theme.AppCompat (or a descendant)的解決方法

問題： Caused by: android.view.InflateException: Binary XML file line #100 in xxx_layout: Binary XML file line #100 in xxx_layout: Error inflating class com.google.android.material.XXX

How To Convert A CER Certificate To PFX Without The Private Key

Import the certificate to its personal certificate store Right-clickon the certificate file.Selectinstall certificate.

【深度學習】RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

報錯程式碼： if __name__ == \'__main__\': model = Perception(2, 3, 2).cuda() input = torch.randn(4, 2).cuda()

How to Turn Off the Back and Forward Trackpad Gestures on a MacHow to Turn Off the Back and Forward Trackpad Gestures on a Mac

Fast traslate Icon translate If you use a Mac laptop to surf, you’ve probably noticed that lightly swiping two fingers left or right on the trackpad causes your web

【TMP】Font Asset Creator - Error Code [Invalid_File_Format] has occurred trying to load the [Tripfive-EX] font file. This typically results from the use of an incompatible or corrupted font file.

Unity報錯具體內容： Font Asset Creator - Error Code [Invalid_File_Format] has occurred trying to load the [<FontName>] font file. This typically results from the use of an incompatible or corrup

RuntimeError: Trying to backward through the graph a second time

RuntimeError: Trying to backward through the graph a second time

Tensorflow安裝以及RuntimeError: The Session graph is empty. Add operations to the graph before calling run().解決方法

深度學習tensorflow2.x RuntimeError: The Session graph is empty. Add operations to the graph before calling run(). 報錯解決方法

.NET MVC5 API AutoFac 出錯 An error occurred when trying to create a controller of type 'xxxController'. Make sure that the controller has a parameterless public constructor

A note on trying to extend the intermediate value theorem

執行nvue 頁面報錯reportJSException >>>> exception function:GraphicActionAddElement, exception:You are trying to add a u-text to a u-text, which is illegal as u-text is not a container

Failed to load resource: the server responded with a status of 404 (Not Found) favicon.ico檔案找不到

The file that you are trying to load does not match the file format of the destination table.

python pandas Dataframe增加一列遇到A value is trying to be set on a copy of a slice from a DataFrame.

Android開發The style on this component requires your app theme to be Theme.AppCompat (or a descendant)的解決方法

How To Convert A CER Certificate To PFX Without The Private Key

【深度學習】RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

How to Turn Off the Back and Forward Trackpad Gestures on a MacHow to Turn Off the Back and Forward Trackpad Gestures on a Mac

【TMP】Font Asset Creator - Error Code [Invalid_File_Format] has occurred trying to load the [Tripfive-EX] font file. This typically results from the use of an incompatible or corrupted font file.

OEM報錯"Failed to connect to ASM instance. The connection is closed: The connection is closed"處理

CF724G Xor-matic Number of the Graph

成功解決MSB8020 The build tools for v141 (Platform Toolset = ‘v141‘) cannot be found. To build using the

[論文筆記 ECCV2020] Learning to Count in the Crowd from Limited Labeled Data

(Mac Android Studio)Unable to connect to ADB.Check the Event Log for possible issues.Verify that you

VS 報錯：Run-Time Check Failure #2 - Stack around the variable ‘a‘ was corrupted

RuntimeError: Trying to backward through the graph a second time

相關推薦