Failed to get convolution algorithm. This is probably because cuDNN failed to initialize

阿新 • • 發佈：2020-12-29

最近在學tf2，訓練模型的時候遇到以下錯誤：
在這裡插入圖片描述

解決方案如下（新增以下程式碼）：

physical_devices = tf.config.experimental.list_physical_devices('GPU')
assert len(physical_devices) > 0, "Not enough GPU hardware devices available"
tf.config.experimental.set_memory_growth(physical_devices[0], True)

全部程式碼：

import tensorflow as tf
import numpy as np

# 解決方法
physical_devices = tf.config.experimental.list_physical_devices('GPU')
assert len(physical_devices) > 0, "Not enough GPU hardware devices available"
tf.config.experimental.set_memory_growth(physical_devices[0], True)


class MNISTdataloader():
    def __init__(self):
        mnist = tf.keras.datasets.mnist
        (self.train_data, self.train_label), (self.test_data, self.test_label) = mnist.load_data()
        self.train_data = np.expand_dims(self.train_data.astype(np.float32)/255.0, axis=-1)
        self.test_data = np.expand_dims(self.test_data.astype(np.float32)/255.0, axis=-1)

        self.train_label = self.train_label.astype(np.int32)
        self.test_label = self.test_label.astype(np.int32)
        self.num_train_data, self.num_test_data = self.train_data.shape[0], self.test_data.shape[0]

    def get_batch(self, batch_size):
        index = np.random.randint(0, self.train_data.shape[0], batch_size)
        return self.train_data[index, :], self.train_label[index]


class CNN(tf.keras.Model):
    def __init__(self):
        super(CNN, self).__init__()
        self.conv1 = tf.keras.layers.Conv2D(filters=32, kernel_size=[5, 5], padding='same', activation=tf.nn.relu)
        self.pool1 = tf.keras.layers.MaxPool2D(pool_size=[2, 2], strides=2)
        self.conv2 = tf.keras.layers.Conv2D(filters=64, kernel_size=[5, 5], padding='same', activation=tf.nn.relu)
        self.pool2 = tf.keras.layers.MaxPool2D(pool_size=[2, 2], strides=2)

        self.flatten = tf.keras.layers.Reshape(target_shape=(7 * 7 * 64,))
        self.dense1 = tf.keras.layers.Dense(units=1024, activation=tf.nn.relu)
        self.dense2 = tf.keras.layers.Dense(units=10)

    def call(self, input):
        x = self.conv1(input)
        x = self.pool1(x)
        x = self.conv2(x)
        x = self.pool2(x)
        x = self.flatten(x)
        x = self.dense1(x)
        x = self.dense2(x)
        output = tf.nn.softmax(x)
        return output


if __name__ == '__main__':
    with tf.device('/gpu:0'):
        num_epochs = 1
        batch_size = 10
        learning_rate = 0.001
        model = CNN()
        data_loader = MNISTdataloader()
        optimizer = tf.keras.optimizers.Adam(learning_rate=learning_rate)
        num_batches = int(data_loader.num_train_data // batch_size * num_epochs)
        for batch_index in range(num_batches):
            x, y = data_loader.get_batch(batch_size)
            with tf.GradientTape() as tape:
                y_pred = model(x)
                loss = tf.keras.losses.sparse_categorical_crossentropy(y, y_pred)
                loss = tf.reduce_mean(loss)
                print('batch %d: loss %f' % (batch_index, loss.numpy()))
            grads = tape.gradient(loss, model.variables)
            optimizer.apply_gradients(grads_and_vars=zip(grads, model.variables))

    sparse_categorical_accuracy = tf.keras.metrics.SparseCategoricalAccuracy()
    num_batches = int(data_loader.num_test_data // batch_size)
    for batch_index in range(num_batches):
        start_index, end_index = batch_index * batch_size, (batch_index + 1) * batch_size
        y_pred = model.predict(data_loader.test_data[start_index:end_index])
        sparse_categorical_accuracy.update_state(
            y_true=data_loader.test_label[start_index:end_index], y_pred=y_pred
        )
    print('test accuracy:  %f' % sparse_categorical_accuracy.result())

參考：https://www.cnblogs.com/dxscode/p/11657197.html

Failed to get convolution algorithm. This is probably because cuDNN failed to initialize

技術標籤：深度學習tensorflow 最近在學tf2，訓練模型的時候遇到以下錯誤：解決方案如下（新增以下程式碼）：

tensorflow出現Failed to get convolution algorithm， cuDNN failed to initialize

網上大多的教程是說tensorflow的版本過高，或者說cuda和cudnn的版本不對，需要降級，但這樣會很麻煩！！！

The web application [] appears to have started a thread named [Abandoned connection cleanup thread] but has failed to stop it. This is very likely to create a memory leak

SSM整合小專案關閉時tomcat報錯： The web application [] appears to have started a thread named [Abandoned connection cleanup thread] but has failed to stop it.This is very likely to create a memory

MacOS svn:E230001 Can't use Subversion command line client: svn The path to the Subversion executable is probably wrong.

注意：本文僅針對於 MacOS 系統。錯誤資訊如下： Can\'tuseSubversioncommandlineclient:svnThepathtotheSubversionexecutableisprobablywrong.Fixit.

OEM報錯"Failed to connect to ASM instance. The connection is closed: The connection is closed"處理

OEM報錯\"Failed to connect to ASM instance. The connection is closed: The connection is closed\"處理

Error creating bean with name 'xxx': Lookup method resolution failed; nested exception is java.lang.IllegalStateException: Failed to introspect Class [xxx]

org.springframework.beans.factory.BeanCreationException: Error creating bean with name \'commonExceptionAdvice\': Lookup method resolution failed; nested exception is java.lang.IllegalStateException:

springboot啟動報錯：nested exception is java.lang.IllegalStateException: Failed to introspect annotated methods on class org.springframework.boot.web.servlet.support.SpringBootServletInitializer

問題：　　今天將一個springboot工程，由jar包形式改為war包，啟動一直報錯：nested exception is java.lang.IllegalStateException: Failed to introspect annotated methods on class org.springframework.boot.w

kubelet failed to get container info for "/system.slice/docker.service": unknown container "/system.slice/docker.service"錯誤

kubernetes版本1.18.6 描述：在檢視kubelet狀態或是在檢視日誌時有以下錯誤 Jun 28 14:05:08 cwztapp131 kubelet[775]: E0628 14:05:08.185793 775 summary_sys_containers.go:47] Failed to get system container

Factory method 'eurekaClient' threw exception; nested exception is java.lang.RuntimeException: Failed to initialize DiscoveryClient!

1Factory method \'eurekaClient\' threw exception; nested exception is java.lang.RuntimeException: Failed to initialize DiscoveryClient!

K8S-kubelet報錯： failed to get c ontainer info for "/system.slice/docker.service": unknown container "/system.slice/docker.service"

K8S版本：1.17.11 今天檢視kubelet日誌的時候，發信一堆報錯：檢視kubelet日誌：]# journalctl -f -u kubelet

This system is not registered to Red Hat Subscription Management. You can use subscription-manager to register.

目錄 redhat7.9 redhat8.5 redhat7.9 今天裝完後Redhat7.9忘記了yum的問題，在安裝命令時提示如下：

Slave is not configured or failed to initialize properly. You must at least set --server-id

一、如果版本不一樣請執行以下操作：MySQL 跨版本主從複製時報錯：ERROR 1794 (HY000): Slave is not configured or failed to initialize properly. 背景： zabbix 資料庫遷移，搭建主從，主是5.6.25，從是5.

K8s 還是 k3s？This is a question

本文來自：Rancher Labs 自k3s問世以來，社群裡有許多小夥伴都問過這樣的問題“除了中間的數字之外，k3s和K8s的區別在哪裡？”，“在兩者之間應該如何選擇？”。本文將簡單介紹它們兩者的區別。

redis.exceptions.ResponseError: MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk.

這個錯誤資訊是Redis客戶端工具在儲存資料時候丟擲的異常資訊。網上很多人的回答都是“config set stop-writes-on-bgsave-error no”。

window下 mysql5.7查詢報錯： ORDER BY clause is not in GROUP BY..this is incompatible with sql_mode=only_full_group_by

一、舊方法，修改mysql配置檔案，但是會導致資料丟失等不可預知的錯誤在用mysql執行如下查詢的時候：

Unable to get repr for <class 'django.db.models.query.QuerySet'>無法讀取出mysql中的資料

D:\\python_learn\\meiduo_project\\meiduo_mall\\meiduo_mall\\apps\\goods\\views.py:38: UnorderedObjectListWarning: Pagination may yield inconsistent results with an unordered object_list: <class \'

ERROR 1419 (HY000) at line 9: You do not have the SUPER privilege and binary logging is enabled (you might want to use the less safe log_bin_trust_function_creators variable)

報錯原因在將函式或觸發器匯入MySQL資料庫時，會出現以下錯誤：“您沒有SUPER特權，並且啟用了二進位制日誌記錄（您*可能*想要使用不太安全的log_bin_trust_function_creators變數）”。

Failed to get convolution algorithm. This is probably because cuDNN failed to initialize

解決方案如下（新增以下程式碼）：

全部程式碼：

參考：https://www.cnblogs.com/dxscode/p/11657197.html

Failed to get convolution algorithm. This is probably because cuDNN failed to initialize

tensorflow出現Failed to get convolution algorithm， cuDNN failed to initialize

The web application [] appears to have started a thread named [Abandoned connection cleanup thread] but has failed to stop it. This is very likely to create a memory leak

MacOS svn:E230001 Can't use Subversion command line client: svn The path to the Subversion executable is probably wrong.

OEM報錯"Failed to connect to ASM instance. The connection is closed: The connection is closed"處理

Error creating bean with name 'xxx': Lookup method resolution failed; nested exception is java.lang.IllegalStateException: Failed to introspect Class [xxx]

springboot啟動報錯：nested exception is java.lang.IllegalStateException: Failed to introspect annotated methods on class org.springframework.boot.web.servlet.support.SpringBootServletInitializer

kubelet failed to get container info for "/system.slice/docker.service": unknown container "/system.slice/docker.service"錯誤

Factory method 'eurekaClient' threw exception; nested exception is java.lang.RuntimeException: Failed to initialize DiscoveryClient!

K8S-kubelet報錯： failed to get c ontainer info for "/system.slice/docker.service": unknown container "/system.slice/docker.service"

This system is not registered to Red Hat Subscription Management. You can use subscription-manager to register.

Slave is not configured or failed to initialize properly. You must at least set --server-id

K8s 還是 k3s？This is a question

redis.exceptions.ResponseError: MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk.

window下 mysql5.7查詢報錯： ORDER BY clause is not in GROUP BY..this is incompatible with sql_mode=only_full_group_by

Unable to get repr for <class 'django.db.models.query.QuerySet'>無法讀取出mysql中的資料

ERROR 1419 (HY000) at line 9: You do not have the SUPER privilege and binary logging is enabled (you might want to use the less safe log_bin_trust_function_creators variable)

【論文閱讀筆記】How Robust is 3D Human Pose Estimation to Occlusion?

解決Error generating final archive: Unable to get debug signature key問題

MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk

Failed to get convolution algorithm. This is probably because cuDNN failed to initialize

解決方案如下（新增以下程式碼）：

全部程式碼：

參考：https://www.cnblogs.com/dxscode/p/11657197.html

相關推薦