【TensorFlow】InternalError: Failed copying input tensor

TensorFlow-GPU 执行模型训练时报错：

InternalError: Failed copying input tensor from /job:localhost/replica:0/task:0/device:CPU:0 to /job:localhost/replica:0/task:0/device:GPU:0 in order to run _EagerConst: Dst tensor is not initialized.

解决方案：『TensorFlow: Dst tensor is not initialized - Stack Overflow』

主要原因在于 batch_size 太大，内存无法负载，将 batch_size 适当调小即可正常运行。

【注】默认情况下，TF 会尽可能地多分配占用 GPU 内存，通过调整 GPUConfig 可以设置为按需分配内存，参考 TensorFlow 文档和 TensorFlow 代码。

另外，使用 Jupyter Notebook 进行长期模型训练时，可能由于 GPU 内存无法及时释放导致该报错。参考此答案可以解决此问题，定义如下函数：

from keras.backend import set_session

from keras.backend import clear_session

from keras.backend import get_session

import gc

# Reset Keras Session

def reset_keras():

    sess = get_session()

    clear_session()

    sess.close()

    sess = get_session()

    try:

        del classifier # this is from global space - change this as you need

    except:

        pass

    print(gc.collect()) # if it does something you should see a number as output

    # use the same config as you used to create the session

    config = tf.compat.v1.ConfigProto()

    config.gpu_options.per_process_gpu_memory_fraction = 1

    config.gpu_options.visible_device_list = "0"

    set_session(tf.compat.v1.Session(config=config))

需要清除 GPU 内存时，直接调用 reset_keras 函数即可。例如：

dense_layers = [0, 1, 2]

layer_sizes = [32, 64, 128]

conv_layers = [1, 2, 3]

for dense_layer in dense_layers:

    for layer_size in layer_sizes:

        for conv_layer in conv_layers:

            reset_keras()

            # training your model here

【TensorFlow】InternalError: Failed copying input tensor的更多相关文章

【Tensorflow】tf.nn.depthwise_conv2d如何实现深度卷积?
版权声明:本文为博主原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明. 本文链接:https://blog.csdn.net/mao_xiao_feng/article/ ...
【Tensorflow】tf.nn.atrous_conv2d如何实现空洞卷积？膨胀卷积
介绍关于空洞卷积的理论可以查看以下链接,这里我们不详细讲理论: 1.Long J, Shelhamer E, Darrell T, et al. Fully convolutional network ...
【TensorFlow】tf.nn.max_pool实现池化操作
max pooling是CNN当中的最大值池化操作,其实用法和卷积很类似有些地方可以从卷积去参考[TensorFlow]tf.nn.conv2d是怎样实现卷积的? tf.nn.max_pool(va ...
【TensorFlow】自主实现包含全节点Cell的LSTM层 Cell
0x00 前言常用的LSTM,或是双向LSTM,输出的结果通常是以下两个:1) outputs,包括所有节点的hidden2) 末节点的state,包括末节点的hidden和cell大部分任务有这些 ...
【TensorFlow】tf.nn.softmax_cross_entropy_with_logits的用法
在计算loss的时候,最常见的一句话就是 tf.nn.softmax_cross_entropy_with_logits ,那么它到底是怎么做的呢? 首先明确一点,loss是代价值,也就是我们要最小化 ...
【TensorFlow】：解决TensorFlow的ImportError: DLL load failed: 动态链接库(DLL)初始化例程失败
[背景] 在scikit-learn基础上系统结合数学和编程的角度学习了机器学习后(我的github:https://github.com/wwcom614/machine-learning),意犹未 ...
【转载】【TensorFlow】static_rnn 和dynamic_rnn的区别
原文地址: https://blog.csdn.net/qq_20135597/article/details/88980975 ----------------------------------- ...
【TensorFlow】tf.nn.conv2d是怎样实现卷积的？
tf.nn.conv2d是TensorFlow里面实现卷积的函数,参考文档对它的介绍并不是很详细,实际上这是搭建卷积神经网络比较核心的一个方法,非常重要 tf.nn.conv2d(input, fil ...
【LeetCode】Two Sum II - Input array is sorted
[Description] Given an array of integers that is already sorted in ascending order, find two numbers ...
【tensorflow】1.安装Tensorflow开发环境，安装Python 的IDE--PyCharm
================================================== 安装Tensorflow开发环境,安装Python 的IDE--PyCharm 1.PyCharm ...

随机推荐

模拟浏览器与服务器交互(简易TomCat框架)
模拟浏览器发送请求到服务器获取资源的思想和代码实现浏览器发送请求到服务器获取资源的流程和概念日常我们使用的浏览器,底层都是帮我们做了很多事情,我们只需要用,比如输入www.baidu.com,就可 ...
CentOS7.6 添加系统自启脚本
一.编辑脚本 1.在自定义的脚本中添加 # chkconfig: 235 20 80 # chkconfig: 2345 20 80 其中2345是默认启动级别,全部0-6共有7个级别. 0表示:表示 ...
TouchableOpacity无效
错误代码如下: <TouchableOpacity onPress={this.handleConfirmPress} activeOpacity={0.6} > <Text sty ...
NLP知识栈
echarts来显示世界地图和全国地图，并且可以下钻层级
echarts来显示世界地图和全国地图,并且可以下钻层级使用echarts来显示世界地图和全国地图,并且可以下钻层级使用的技术现有的功能遇到的问题解决总结参考内容直接来源码,地球资源包我 ...
Python3 时间戳格式化和减法运算
import datetime import time # 获取当前时间(2023-02-16 16:41:36) now_date = datetime.datetime.now().strftim ...
Python 闭包,生成式,推导式
闭包概念闭包,又称闭包函数或者闭合函数,其实和前面讲的嵌套函数类似, 不同之处在于,闭包中外部函数返回的不是一个具体的值,而是一个函数.一般情况下,返回的函数会赋值给一个变量,这个变量可以在后面被继 ...
replace 常用积累
1.替换有,或者.为: obj.keyword.replace(/,|./g,';') 2.替换元素标签类似于<em>文字</em>这种 let name=item.name. ...
设计模式 > 单一职责原则
SOLID原则并非单纯的1个原则,而是由5个设计原则组成的,它们分别是单一职责原则,开闭原则,里氏替换原则,接口隔离原则和依赖反转原则. 单一职责原则(SRP) 定义:一个类或者模块只负责完成一个职责 ...
MNIST数据集output with shape [1, 28, 28] doesn't match the broadcast shape [3, 28, 28]
transform = transforms.Compose([ transforms.ToTensor(), transforms.Lambda(lambda x: x.repeat(3,1,1)) ...

【TensorFlow】InternalError: Failed copying input tensor

【TensorFlow】InternalError: Failed copying input tensor的更多相关文章

随机推荐

热门专题