tensorflow中共享变量 tf.get_variable 和命名空间 tf.variable

tensorflow中有很多需要变量共享的场合，比如在多个GPU上训练网络时网络参数和训练数据就需要共享。

tf通过 tf.get_variable() 可以建立或者获取一个共享的变量。 tf.get_variable函数的作用从tf的注释里就可以看出来-- ‘Gets an existing variable with this name or create a new one’。

与 tf.get_variable 函数相对的还有一个 tf.Variable 函数，两者的区别是：

tf.Variable定义变量的时候会自动检测命名冲突并自行处理，例如已经定义了一个名称是 ‘wg_1’的变量，再使用tf.Variable定义名称是‘wg_1’的变量，会自动把后一个变量的名称更改为‘wg_1_0’，实际相当于创建了两个变量，tf.Variable不可以创建共享变量。
tf.get_variable定义变量的时候不会自动处理命名冲突，如果遇到重名的变量并且创建该变量时没有设置为共享变量，tf会直接报错。

变量可以共享之后还有一个问题就是当模型很大很复杂的时候，变量和操作的数量也比较庞大，为了方便对这些变量进行管理，维护条理清晰的graph结构，tf建立了一套共享机制，通过变量作用域（命名空间，variable_scope）实现对变量的共享和管理。例如，cnn的每一层中，均有weights和biases这两个变量，通过tf.variable_scope()为每一卷积层命名，就可以防止变量命名重复。

与 tf.variable_scope相对的还有一个 tf.name_scope 函数，两者的区别是：

tf.name_scope 主要用于管理一个图（graph）里面的各种操作，返回的是一个以scope_name命名的context manager。一个graph会维护一个name_space的堆，每一个namespace下面可以定义各种op或者子namespace，实现一种层次化有条理的管理，避免各个op之间命名冲突。
tf.variable_scope 一般与tf.name_scope()配合使用，用于管理一个图（graph）中变量的名字，避免变量之间的命名冲突，tf.variable_scope允许在一个variable_scope下面共享变量。

# coding: utf-8

import tensorflow as tf

# 定义的基本等价

v1 = tf.get_variable("v", shape=[1], initializer= tf.constant_initializer(1.0))

v2 = tf.Variable(tf.constant(1.0, shape=[1]), name="v")

with tf.variable_scope("abc"):

    v3=tf.get_variable("v",[1],initializer=tf.constant_initializer(1.0))

# 在变量作用域内定义变量，不同变量作用域内的变量命名可以相同

with tf.variable_scope("xyz"):

    v4=tf.get_variable("v",[1],initializer=tf.constant_initializer(1.0))

with tf.variable_scope("xyz", reuse=True):

    v5 = tf.get_variable("v")

    v6 = tf.get_variable("v",[1])

with tf.variable_scope("foo"):

    v7 = tf.get_variable("v", [1])

    # 通过 tf.get_variable_scope().reuse_variables() 设置以下的变量是共享变量;

    # 如果不加，v8的定义会由于重名而报错

    tf.get_variable_scope().reuse_variables()

    v8 = tf.get_variable("v", [1])

assert v7 is v8

with tf.variable_scope("foo_1") as foo_scope:

    v = tf.get_variable("v", [1])

with tf.variable_scope(foo_scope):

    w = tf.get_variable("w", [1])

with tf.variable_scope(foo_scope, reuse=True):

    v1 = tf.get_variable("v", [1])

    w1 = tf.get_variable("w", [1])

assert v1 is v

assert w1 is w

with tf.variable_scope("foo1"):

    with tf.name_scope("bar1"):

        v_1 = tf.get_variable("v", [1])

        x_1 = 1.0 + v_1

assert v_1.name == "foo1/v:0"

assert x_1.op.name == "foo1/bar1/add"

print v1==v2  # False

print v3==v4  # False 不同变量作用域中

print v3.name  # abc/v:0

print v4==v5  # 输出为True

print v5==v6  # True

tensorflow中共享变量 tf.get_variable 和命名空间 tf.variable_scope的更多相关文章

TensorFlow中的L2正则化函数：tf.nn.l2_loss()与tf.contrib.layers.l2_regularizerd()的用法与异同
tf.nn.l2_loss()与tf.contrib.layers.l2_regularizerd()都是TensorFlow中的L2正则化函数,tf.contrib.layers.l2_regula ...
TensorFlow中的变量命名以及命名空间.
What: 在Tensorflow中, 为了区别不同的变量(例如TensorBoard显示中), 会需要命名空间对不同的变量进行命名. 其中常用的两个函数为: tf.variable_scope, t ...
【tf.keras】tf.keras使用tensorflow中定义的optimizer
Update:2019/09/21 使用 tf.keras 时,请使用 tf.keras.optimizers 里面的优化器,不要使用 tf.train 里面的优化器,不然学习率衰减会出现问题. 使用 ...
Tensorflow中的name_scope和variable_scope
Tensorflow是一个编程模型,几乎成为了一种编程语言(里面有变量.有操作......). Tensorflow编程分为两个阶段:构图阶段+运行时. Tensorflow构图阶段其实就是在对图进行 ...
对tensorflow 中的attention encoder-decoder模型调试分析
#-*-coding:utf8-*- __author = "buyizhiyou" __date = "2017-11-21" import random, ...
tensorflow中使用tf.variable_scope和tf.get_variable的ValueError
ValueError: Variable conv1/weights1 already exists, disallowed. Did you mean to set reuse=True in Va ...
TensorFlow中get_variable共享变量调用
import tensorflow as tf with tf.variable_scope('v_scope',reuse=True) as scope1: Weights1 = tf.get_va ...
TF之RNN：TF的RNN中的常用的两种定义scope的方式get_variable和Variable—Jason niu
# tensorflow中的两种定义scope(命名变量)的方式tf.get_variable和tf.Variable.Tensorflow当中有两种途径生成变量 variable import te ...
tensorflow中 tf.add_to_collection、 tf.get_collection 和 tf.add_n函数
tf.add_to_collection(name, value) 用来把一个value放入名称是'name'的集合,组成一个列表; tf.get_collection(key, scope=Non ...

随机推荐

《Clean Code》一书回顾
<Clean Code>一书从翻开至今,已经差不多两个月的时间了,尽管刨去其中的假期,算下来实在是读得有点慢.阅读期间,断断续续的做了不少笔记.之前,每每在读完了一本技术书籍之后,其中的诸 ...
【android】如何实现猿题库题目的排版
最近我们的产品来了个新的模块,类似猿题库一样,给学生做题提高成绩的. 要求如下: 1:支持单选.多选.填空题 2:支持图片文字混排 3:输入框有交互,排版精致美观 4:为了体验优化,不能使用网页实现效 ...
cocos2dx 3.x 拼图小游戏
.h #define IMAGE_MAX 2 //图片的个数.. //图片结构体属性 struct IMAGE_DATA { cocos2d::Sprite *m_pImage; bool m_bO ...
Python 在字符串中处理html 和xml
问题: 想将HTML 或者XML 实体如&entity; 或&#code; 替换为对应的文本.再者,你需要转换文本中特定的字符(比如<, >, 或&). 解决方案: ...
GOEXIF读取和写入EXIF信息
最新版本的gexif,直接基于gdi+实现了exif信息的读取和写入,代码更清晰. /* * File: gexif.h * Purpose: cpp EXIF reader * 3/2/2017 & ...
linux体系结构与内核结构图解
1．当被问到Linux体系结构(就是Linux系统是怎么构成的)时,我们可以参照下图这么回答:从大的方面讲,Linux体系结构可以分为两块: (1)用户空间:用户空间中又包含了,用户的应用程序,C库 ...
IntelliJ IDEA 中配置lombok插件，编写简略风格Java代码
1.打开IDEA的Settings面板,并选择Plugins选项,然后点击 “Browse repositories..” 2.开启注释处理 3.在pom.xml中添加lombox <!-- h ...
about SpringBoot学习后记
<SpringBoot实战>第一章节入门的名称为Spring风云再起看起来Spring的功能确实受Java开发者喜爱在SpringBoot中,继续将Spring框架做了另一次的封装使框 ...
mysql的隔离性和锁
INNODB的隔离性质 INNODB的事务支持4种隔离机制,分别是 READ UNCOMMITTED, READ COMMITTED, REPEATABLE READ, and SERIALIZABL ...
[笔记] SQL性能优化 - 常用语句（一）
第一步 DBCC DROPCLEANBUFFERS 清除缓冲区 DBCC FREEPROCCACHE 删除计划高速缓存中的元素从缓冲池中删除所有清除缓冲区.要求具有 sysadmin 固定服务器角色 ...

tensorflow中共享变量 tf.get_variable 和命名空间 tf.variable_scope

tensorflow中共享变量 tf.get_variable 和命名空间 tf.variable_scope的更多相关文章

随机推荐

热门专题