使用多块GPU进行训练 1.slim.arg_scope(对于同等类型使用相同操作) 2.tf.name_scope(定义名字的范围) 3.tf.get_variable_scope().reuse_variable(参数的复用) 4.tf.py

1. slim.arg_scope(函数, 传参) # 对于同类的函数操作，都传入相同的参数

from tensorflow.contrib import slim as slim

import tensorflow as tf

@slim.add_arg_scope # 进行修饰操作

def fun1(a=0, b=0):

    return a + b

with slim.arg_scope([fun1], a=2):

    x = fun1(b=2)

    print(x)
# 4

2. tf.name_scope('tower') 对范围内的操作都添加上该操作名

import tensorflow as tf

with tf.name_scope('tower'):

    op = tf.constant(1)

print(op.name)

with tf.name_scope('tower_1'):

    op = tf.constant(1)

print(op.name)
# tower/Const:0
# tower_1/Const:0

3. tf.get_variable_scope.reuse_variable() 对于相同的操作传入使用同一个网络结构，不需要再创建新值

import tensorflow as tf

import vgg

inputs_1 = tf.random_normal([10, 224, 224, 3])

input_2 = tf.random_normal([10, 224, 224, 3])

with tf.variable_scope('vgg_16'):

    net, end_points = vgg.vgg_16(inputs_1, 100, False)

    tf.get_variable_scope().reuse_variables() # 对上述的参数进行重复使用，相当于是reuse=True

    net_, end_points_ = vgg.vgg_16(input_2, 100, False)

with tf.Session() as sess:

     print('no error')

4. tf.py_func(func, [x], [tf.int32]) 与lamda配套使用，相当于是将参数传入到func中

参数说明：func表示定义的函数, [x] 表示传入的参数，[tf.int32]表示返回的类型

import numpy as np

import tensorflow as tf

def func(x):

    return x+20

softmax_idx = np.arange(10)

softmax_idx = map(lambda x:tf.py_func(func, [x], [tf.int32]), softmax_idx)

sess = tf.Session()

for softmax_ides in softmax_idx:

    print(sess.run(softmax_ides))

使用多块GPU进行训练，主要是通过tf.device('/gpu' + str(i)) 来实现的,通过对参数

tf.variable_scope(tf.get_variable_scope) as var_scope：定义参数范围

reuse = False if i== 0 else True 来定义参数是否重复使用，对于第一次创建网络,reuse=False,后续的网络使用reuse=True，表示不需要重新构建网络

这里同样使用tf.get_variable_scope().reuse_variable()来进行参数的复用

第一步：

第一步:使用tf.data.Dataset.range() 来构造数据队列

第二步：使用softmax_data.map(lamda x:tf.pyfunc( _parse_function, [x], [tf.string, tf.int32])) # 将数据集进行传入

第三步：使用softmax_data.map(tensor_from_slice) # 将数据转换为slice格式进行输出

第四步：softmax_data.batch(single_size_batch*args.num_gpus) # 创建一个batch的数据集

第五步：softmax_data.make_initializable_iterator() # 构造迭代器

第六步： softmax_data_element = softmax_iterator.get_next_batch() 获得一个batch的数据

第七步：softmax_data_element[0].set_shape() 进行维度的设置

第八步：使用tf.spilt(softmax_data_element[0], args.num_gpus) 将数据进行切分，构造出batch_image_list 和 batch_label_list

第二步：

第一步：循环 args.gpu_num

第二步：使用with tf.device('/gpu' + str(i)) # 将训练放在gpu上

第三步：使用with tf.name_scope('tower_' + str(i)) # 定义名字的变量防止冲突

第四步：with slim.arg_scope([slim.mode_variable, slim.variable], '/cpu0') # 将变量放在CPU上进行训练

第五步：使用tf.variable_scope(tf.get_variable_scope) as var_scope 定义变量的范围

第六步：使用tf.get_variable_scope().reuse_scope() 进行变量的复用

第三步：

第一步：将添加到tower_loss的loss求出均值

第二步：求取梯度，建立训练train_op

import tensorflow as tf

from tensorflow import data as tf_data

import random

import numpy as np

from tensorflow.contrib import slim as slim

def _sample_people_softmax(x):

    global softmax_ind

    if softmax_ind >= dataset_size:

        softmax_ind = 0

        random.shuffle(indices)

    true_num_batch = min(single_batch_size*args.num_gpus, dataset_size - softmax_ind)

    sample_paths = image_list[softmax_ind:softmax_ind + true_num_batch]

    sample_labels = label_list[softmax_ind:softmax_ind+true_num_batch]

    softmax_ind += true_num_batch

    return np.array(sample_paths), np.array(sample_labels, dtype=np.int32)

def _parse_function(filename, label):

    file_content = tf.read_file(filename)

    image = tf.image.decode_image(file_content)

    print(image.shape)

    if args.random_crop:

        print('use random crop')

        image = tf.random_crop(image, [args.image_size, args.image_size, 3])

    else:

        print('Not use random crop')

        image = tf.image.resize(image, [args.image_height, args.image_width, 3])

    if args.random_flip:

        image = tf.image.random_flip_left_right(image)

    image.set_shape((args.image_height, args.image_width, 3))

    if debug:

        image = tf.cast(image, tf.float32)

    else:

        image = tf.cast(image, tf.float32)

        image = tf.subtract(image, 127.5)

        image = tf.div(image, 128)

    return image, label

learning_rate = 0.01

opt = tf.train.AdamOptimizer(learning_rate)

def _from_tensor_slices(tensor_x, tensor_y):

    return tf_data.Dataset.from_tensor_slices((tensor_x, tensor_y))

with tf.device('/cpu:0'):

    softmax_data = tf_data.Dataset.range(args.epoch_size*args.max_nrof_epochs*100)

    softmax_data = softmax_data.map(lambda x: tf.py_func(_sample_people_softmax, [x], [tf.string, tf.float32]))

    softmax_data = softmax_data.flat_map(_from_tensor_slices)

    softmax_data = softmax_data.map(_parse_function)

    softmax_data = softmax_data.batch(args.num_gpus*single_batch_size)

    softmax_iterator = softmax_data.make_initializable_iterator()

    softmax_data_element = softmax_iterator.get_next()

    softmax_data_element[0].set_shape((args.num_gpus*single_batch_size, args.image_height, args.image_width, 3))

    sotfmax_data_element[1].set_shape((args.num_gpus*single_batch_size))

    batch_image_split = tf.split(softmax_data_element[0], args.num_gpus)

    batch_label_split = tf.split(softmax_data_element[1], args.num_gpus)

tower_loss = []

for i in range(args.num_gpus):

    with tf.device('/gpu:' + str(i)):

        with tf.name_scope('tower_' + str(i)) as scope: # 对参数进行命名防止出现变量冲突

            with slim.arg_scope([slim.model_variable, slim.variable], device='/cpu:0'): # 将变量和全局变量放在cpu上

                with tf.variable_scope(tf.get_variable_scope()) as var_scope: # 设置参数变量的范围

                    reuse = False if i==0 else True

                    '''

                    '''

                    tower_loss.append(loss) # 将loss添加到tower_loss里面

                tf.get_variable_scope().reuse_variables()  # 将参数变量进行重复的使用，相当于是reuse = True

tower_loss = tf.reduce_mean(loss) # 求得损失值

# 进行损失值的train_op更新的构造

grads = opt.compute_gradients(tower_loss, tf.trainable_variables(), colocate_gradients_with_ops=True)

grad_opt = opt.apply_gradients(grads) # 计算梯度

update_op = tf.get_collection(tf.GraphKeys.UPDATE_OPS)

with tf.control_dependencies(update_op):

    train_op = tf.group(grad_opt)

sess = tf.Session()

sess.run(softmax_iterator.initializer)

使用多块GPU进行训练 1.slim.arg_scope(对于同等类型使用相同操作) 2.tf.name_scope(定义名字的范围) 3.tf.get_variable_scope().reuse_variable(参数的复用) 4.tf.py_func(构造函数)的更多相关文章

TensorFlow指定使用GPU 多块gpu
持续监控GPU使用情况命令: $ watch -n 10 nvidia-smi1一.指定使用某个显卡如果机器中有多块GPU,tensorflow会默认吃掉所有能用的显存, 如果实验室多人公用一台服务器 ...
tensorflow 13：多gpu 并行训练
多卡训练模式: 进行深度学习模型训练的时候,一般使用GPU来进行加速,当训练样本只有百万级别的时候,单卡GPU通常就能满足我们的需求,但是当训练样本量达到上千万,上亿级别之后,单卡训练耗时很长,这个时 ...
在GPU上训练数据
在GPU上训练数据模型搬到GPU上数据搬到GPU上损失函数计算搬到GPU上
tf.contrib.slim arg_scope
缘由最近一直在看深度学习的代码,又一次看到了slim.arg_scope()的嵌套使用,具体代码如下: with slim.arg_scope( [slim.conv2d, slim.separab ...
深度学习原理与框架-猫狗图像识别-卷积神经网络(代码) 1.cv2.resize(图片压缩) 2..get_shape()[1:4].num_elements(获得最后三维度之和) 3.saver.save(训练参数的保存) 4.tf.train.import_meta_graph(加载模型结构) 5.saver.restore(训练参数载入)
1.cv2.resize(image, (image_size, image_size), 0, 0, cv2.INTER_LINEAR) 参数说明:image表示输入图片,image_size表示变 ...
slim.arg_scope（）的使用
[https://blog.csdn.net/u013921430 转载] slim是一种轻量级的tensorflow库,可以使模型的构建,训练,测试都变得更加简单.在slim库中对很多常用的函数进行 ...
C++ 为什么拷贝构造函数参数必须为引用？赋值构造函数参数也必须为引用吗？
之前写拷贝构造函数的时候,以为参数为引用,不为值传递,仅仅是为了减少一次内存拷贝.然而今天看到一篇文章发现自己对拷贝构造的参数理解有误. 参数为引用,不为值传递是为了防止拷贝构造函数的无限递归,最终导 ...
slim.arg_scope中python技巧
slim.arg_scope函数说明如下: Stores the default arguments for the given set of list_ops. For usage, please ...
目标检测算法SSD在window环境下GPU配置训练自己的数据集
由于最近想试一下牛掰的目标检测算法SSD.于是乎,自己做了几千张数据(实际只有几百张,利用数据扩充算法比如镜像,噪声,切割,旋转等扩充到了几千张,其实还是很不够).于是在网上找了相关的介绍,自己处理数 ...

随机推荐

vue-复制功能插件-兼容性最好的插件
记录给自己用,不进同一次坑: https://github.com/Inndy/vue-clipboard2
第五篇 CSS入门明白三种嵌套形式，三种常用控制器
CSS入门 css是层叠式样式表 css的作用是什么呢?举个抽象的例子啊,HTML是人,CSS则是衣服... css给html穿上衣服,有三种形式: 内嵌.内联.外联. 这三种形式,优先级为 ...
dedecms sql 替换或删除
UPDATE dede_archives SET writer='你需要修改的作者' WHERE writer=''; UPDATE dede_archives SET source='你需要修改的来 ...
这打车App麻烦了！遭黑客勒索巨额比特币
6月17日下午,易到用车发布<客服电话故障公告>称,5月25日-26日,易到平台遭到网络黑客攻击,核心服务器被入侵,攻击导致易到核心数据被加密,服务器宕机,绝大部分服务功能受到波及,且攻击 ...
【CF 463F】Escape Through Leaf
题意给你一棵 $n$ 个点的树,每个节点有两个权值 $a_i,b_i$. 从一个点 $u$ 可以跳到以其为根的子树内的任意一点 $v$(不能跳到 $u$ 自己),代价是 \(a_ ...
格式化输出的三种方式,运算符及流程控制之if判断
''' 格式化输出的三种方式,运算符及流程控制之if判断 ''' # 格式化输出的三种方式 # 一.占位符程序中经常会有这样场景:要求用户输入信息,然后打印成固定的格式比如要求用户输入用户名和年龄 ...
shell知识点（一）
Shell1.概述Shell是一个命令行解释器,它接收应用程序/用户命令,然后调用操作系统内核还是一个功能相当强大的编程语言,易编写.易调试.灵活性强2.shell解析器查看linux提供的shell ...
Acwing-91-最短Hamilton路径(状压DP)
链接: https://www.acwing.com/problem/content/93/ 题意: 给定一张 n 个点的带权无向图,点从 0~n-1 标号,求起点 0 到终点 n-1 的最短Hami ...
【JZOJ5428】【NOIP2017提高A组集训10.27】查询
题目给出一个长度为n的序列a[] 给出q组询问,每组询问形如$<x,y>$,求a序列的所有区间中,数字x的出现次数与数字y的出现次数相同的区间有多少个. 分析我们可以维护一个前缀和 ...
js 多个三目运算符优先级
读JS代码遇到一段看不懂运算优先级的代码,如下 var BrowserSys = {}; var ua = navigator.userAgent.toLowerCase(); var s; (s = ...

使用多块GPU进行训练 1.slim.arg_scope(对于同等类型使用相同操作) 2.tf.name_scope(定义名字的范围) 3.tf.get_variable_scope().reuse_variable(参数的复用) 4.tf.py_func(构造函数)

使用多块GPU进行训练 1.slim.arg_scope(对于同等类型使用相同操作) 2.tf.name_scope(定义名字的范围) 3.tf.get_variable_scope().reuse_variable(参数的复用) 4.tf.py_func(构造函数)的更多相关文章

随机推荐

热门专题