『TensorFlow』批处理类

基础知识

下面有莫凡的对于批处理的解释:

fc_mean,fc_var = tf.nn.moments(

    Wx_plus_b,

    axes=[0],

    # 想要 normalize 的维度, [0] 代表 batch 维度

    # 如果是图像数据, 可以传入 [0, 1, 2], 相当于求[batch, height, width] 的均值/方差, 注意不要加入 channel 维度

)

scale = tf.Variable(tf.ones([out_size]))

shift = tf.Variable(tf.zeros([out_size]))

epsilon = 0.001

Wx_plus_b = tf.nn.batch_normalization(Wx_plus_b,fc_mean,fc_var,shift,scale,epsilon)

# 上面那一步, 在做如下事情:

# Wx_plus_b = (Wx_plus_b - fc_mean) / tf.sqrt(fc_var + 0.001)

# Wx_plus_b = Wx_plus_b * scale + shift

tf.contrib.layers.batch_norm：封装好的批处理类

class batch_norm():

    '''batch normalization层'''

    def __init__(self, epsilon=1e-5,

                 momentum=0.9, name='batch_norm'):

        '''

        初始化

        :param epsilon:    防零极小值

        :param momentum:   滑动平均参数

        :param name:       节点名称

        '''

        with tf.variable_scope(name):

            self.epsilon = epsilon

            self.momentum = momentum

            self.name = name

    def __call__(self, x, train=True):

        # 一个封装了的会在内部调用batch_normalization进行正则化的高级接口

        return tf.contrib.layers.batch_norm(x,

                                            decay=self.momentum,        # 滑动平均参数

                                            updates_collections=None,

                                            epsilon=self.epsilon,

                                            scale=True,

                                            is_training=train,          # 影响滑动平均

                                            scope=self.name)

1.

Note: when training, the moving_mean and moving_variance need to be updated.
    By default the update ops are placed in `tf.GraphKeys.UPDATE_OPS`, so they
    need to be added as a dependency to the `train_op`. For example:

    ```python
      update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS)
      with tf.control_dependencies(update_ops):
        train_op = optimizer.minimize(loss)
    ```

    One can set updates_collections=None to force the updates in place, but that
    can have a speed penalty, especially in distributed settings.

2.

is_training: Whether or not the layer is in training mode. In training mode
        it would accumulate the statistics of the moments into `moving_mean` and
        `moving_variance` using an exponential moving average with the given
        `decay`. When it is not in training mode then it would use the values of
        the `moving_mean` and the `moving_variance`.

tf.nn.batch_normalization：原始接口封装使用

实际上tf.contrib.layers.batch_norm对于tf.nn.moments和tf.nn.batch_normalization进行了一次封装，这个类又进行了一次封装（主要是制订了一部分默认参数），实际操作时可以仅仅使用tf.contrib.layers.batch_norm函数，它已经足够方便了。

添加了滑动平均处理之后，也就是不使用封装，直接使用tf.nn.moments和tf.nn.batch_normalization实现的batch_norm函数：

def batch_norm(x,beta,gamma,phase_train,scope='bn',decay=0.9,eps=1e-5):

    with tf.variable_scope(scope):

        # beta = tf.get_variable(name='beta', shape=[n_out], initializer=tf.constant_initializer(0.0), trainable=True)

        # gamma = tf.get_variable(name='gamma', shape=[n_out],

        #                         initializer=tf.random_normal_initializer(1.0, stddev), trainable=True)

        batch_mean,batch_var = tf.nn.moments(x,[0,1,2],name='moments')

        ema = tf.train.ExponentialMovingAverage(decay=decay)

        def mean_var_with_update():

            ema_apply_op = ema.apply([batch_mean,batch_var])

            with tf.control_dependencies([ema_apply_op]):

                return tf.identity(batch_mean),tf.identity(batch_var)

                # identity之后会把Variable转换为Tensor并入图中，

                # 否则由于Variable是独立于Session的，不会被图控制control_dependencies限制

        mean,var = tf.cond(phase_train,

                           mean_var_with_update,

                           lambda: (ema.average(batch_mean),ema.average(batch_var)))

　　     normed = tf.nn.batch_normalization(x, mean, var, beta, gamma, eps)

    return normed

另一种将滑动平均展开了的方式，

def batch_norm(x, size, training, decay=0.999):

    beta = tf.Variable(tf.zeros([size]), name='beta')

    scale = tf.Variable(tf.ones([size]), name='scale')

    pop_mean = tf.Variable(tf.zeros([size]))

    pop_var = tf.Variable(tf.ones([size]))

    epsilon = 1e-3

    batch_mean, batch_var = tf.nn.moments(x, [0, 1, 2])

    train_mean = tf.assign(pop_mean, pop_mean * decay + batch_mean * (1 - decay))

    train_var = tf.assign(pop_var, pop_var * decay + batch_var * (1 - decay))

    def batch_statistics():

        with tf.control_dependencies([train_mean, train_var]):

            return tf.nn.batch_normalization(x, batch_mean, batch_var, beta, scale, epsilon, name='batch_norm')

    def population_statistics():

        return tf.nn.batch_normalization(x, pop_mean, pop_var, beta, scale, epsilon, name='batch_norm')

return tf.cond(training, batch_statistics, population_statistics)

注， tf.cond：流程控制，参数一True，则执行参数二的函数，否则执行参数三函数。

『TensorFlow』批处理类的更多相关文章

『TensorFlow』专题汇总
TensorFlow:官方文档 TensorFlow:项目地址本篇列出文章对于全零新手不太合适,可以尝试TensorFlow入门系列博客,搭配其他资料进行学习. Keras使用tf.Session训 ...
『TensorFlow』流程控制
『PyTorch』第六弹_最小二乘法对比PyTorch和TensorFlow TensorFlow 控制流程操作 TensorFlow 提供了几个操作和类,您可以使用它们来控制操作的执行并向图中添加条 ...
『TensorFlow』读书笔记_降噪自编码器
『TensorFlow』降噪自编码器设计之前学习过的代码,又敲了一遍,新的收获也还是有的,因为这次注释写的比较详尽,所以再次记录一下,具体的相关知识查阅之前写的文章即可(见上面链接). # Aut ...
『TensorFlow』梯度优化相关
tf.trainable_variables可以得到整个模型中所有trainable=True的Variable,也是自由处理梯度的基础基础梯度操作方法: tf.gradients 用来计算导数.该 ...
『TensorFlow』模型保存和载入方法汇总
『TensorFlow』第七弹_保存&载入会话_霸王回马一.TensorFlow常规模型加载方法保存模型 tf.train.Saver()类,.save(sess, ckpt文件目录)方法 ...
『TensorFlow』命令行参数解析
argparse很强大,但是我们未必需要使用这么繁杂的东西,TensorFlow自己封装了一个简化版本的解析方式,实际上是对argparse的封装脚本化调用tensorflow的标准范式: impo ...
『TensorFlow』SSD源码学习_其一：论文及开源项目文档介绍
一.论文介绍读论文系列:Object Detection ECCV2016 SSD 一句话概括:SSD就是关于类别的多尺度RPN网络基本思路: 基础网络后接多层feature map 多层feat ...
『TensorFlow』DCGAN生成动漫人物头像_下
『TensorFlow』以GAN为例的神经网络类范式『cs231n』通过代码理解gan网络&tensorflow共享变量机制_上『TensorFlow』通过代码理解gan网络_中一.计算 ...
『TensorFlow』滑动平均
滑动平均会为目标变量维护一个影子变量,影子变量不影响原变量的更新维护,但是在测试或者实际预测过程中(非训练时),使用影子变量代替原变量. 1.滑动平均求解对象初始化 ema = tf.train.Ex ...

随机推荐

C# 如何使用长度来切分字符串
参考网址:https://blog.csdn.net/yenange/article/details/39637211 using System; using System.Collections.G ...
RabbitMQ的消息确认机制
一:确认种类 RabbitMQ的消息确认有两种. 一种是消息发送确认.这种是用来确认生产者将消息发送给交换器,交换器传递给队列的过程中,消息是否成功投递.发送确认分为两步,一是确认是否到达交换器,二是 ...
sql server创建windows账户
--不要干坏事 sql server中使用xp_cmdshell --1.允许配置高级选项 GO RECONFIGURE GO --2.开启xp_cmdshell服务 RECONFIGURE GO - ...
利用django-crontab设定定时任务
Django中想要设定定时任务的方法有很多,如celery.apscheduler.crontab等等,本文用crontab来实现. 想用apscheduler实现请看本人另一篇博客:使用APSche ...
VoiceXML简介
简单来说,VoiceXML就是语音网络世界的HTML,一种用于语音应用的开放标准的标记语言.VoiceXML的问世使得为HTML发展起来的web体系也能够轻松地创建和使用语音应用. 发展历史: 199 ...
node获取windows pc 机器的标示
var exec = require('child_process').exec; if(process.platform != "win32"){ //window throw ...
WxWidgets笔记
关于环境变量的配置:解压wxwidgets的压缩包之后要新建名为 WXWIN 的环境变量,变量的值为解压得到的目录,不知为何要使用此环境变量编译时使用的命令:mingw32-make -j1 -f ...
Unity之获取资源包的路径
先从缓存中获取,如果获取不到,则从包中获取. 如下: public static string GetAssetBundlePath(string path) { // 先尝试从 persist 目录 ...
html5，dom操作1
<body> <script>function hwd(){ var bb=document.getElementById('bt');// alert(bb.innerHTM ...
python 定义带默认参数的函数