TensorFlow中权重的随机初始化

　　一开始没看懂stddev是什么参数，找了一下，在tensorflow/python/ops里有random_ops，其中是这么写的：

def random_normal(shape, mean=0.0, stddev=1.0, dtype=types.float32,

                  seed=None, name=None):

  """Outputs random values from a normal distribution.

  Args:

    shape: A 1-D integer Tensor or Python array. The shape of the output tensor.

    mean: A 0-D Tensor or Python value of type `dtype`. The mean of the normal

      distribution.

    stddev: A 0-D Tensor or Python value of type `dtype`. The standard deviation

      of the normal distribution.

    dtype: The type of the output.

    seed: A Python integer. Used to create a random seed for the distribution.

      See

      [`set_random_seed`](../../api_docs/python/constant_op.md#set_random_seed)

      for behavior.

    name: A name for the operation (optional).

  Returns:

    A tensor of the specified shape filled with random normal values.

  """

　　也就是按照正态分布初始化权重，mean是正态分布的平均值，stddev是正态分布的标准差（standard deviation），seed是作为分布的random seed（随机种子，我百度了一下，跟什么伪随机数发生器还有关，就是产生随机数的），在mnist/concolutional中seed赋值为66478，挺有意思，不知道是什么原理。

　　后面还有truncated_normal的定义：

def truncated_normal(shape, mean=0.0, stddev=1.0, dtype=types.float32,

                     seed=None, name=None):

  """Outputs random values from a truncated normal distribution.

  The generated values follow a normal distribution with specified mean and

  standard deviation, except that values whose magnitude is more than 2 standard

  deviations from the mean are dropped and re-picked.

  Args:

    shape: A 1-D integer Tensor or Python array. The shape of the output tensor.

    mean: A 0-D Tensor or Python value of type `dtype`. The mean of the

      truncated normal distribution.

    stddev: A 0-D Tensor or Python value of type `dtype`. The standard deviation

      of the truncated normal distribution.

    dtype: The type of the output.

    seed: A Python integer. Used to create a random seed for the distribution.

      See

      [`set_random_seed`](../../api_docs/python/constant_op.md#set_random_seed)

      for behavior.

    name: A name for the operation (optional).

  Returns:

    A tensor of the specified shape filled with random truncated normal values.

  """

　　截断正态分布，以前都没听说过。

　　TensorFlow还提供了平均分布等。

参考：

1.https://tensorflow.googlesource.com/tensorflow/+/refs/heads/master/tensorflow/g3doc/api_docs/python

2.随机种子：http://baike.baidu.com/link?url=bjDp9u9pkEg2oWOffMep1RW6B1U-0AX2FNmykTtCAa8L_7xzA0ygq6AyLBf8pv7XW8b4gwUKlvMWiCsp32Nu8K

TensorFlow中权重的随机初始化的更多相关文章

tensorflow中的参数初始化方法
1. 初始化为常量 tf中使用tf.constant_initializer(value)类生成一个初始值为常量value的tensor对象. constant_initializer类的构造函数定义 ...
第二十二节，TensorFlow中的图片分类模型库slim的使用、数据集处理
Google在TensorFlow1.0,之后推出了一个叫slim的库,TF-slim是TensorFlow的一个新的轻量级的高级API接口.这个模块是在16年新推出的,其主要目的是来做所谓的“代码瘦 ...
第二十二节，TensorFlow中RNN实现一些其它知识补充
一初始化RNN 上一节中介绍了通过cell类构建RNN的函数,其中有一个参数initial_state,即cell初始状态参数,TensorFlow中封装了对其初始化的方法. 1.初始化为0 对于 ...
Tensorflow 中的优化器解析
Tensorflow:1.6.0 优化器(reference:https://blog.csdn.net/weixin_40170902/article/details/80092628) I: t ...
第十八节，TensorFlow中使用批量归一化(BN)
在深度学习章节里,已经介绍了批量归一化的概念,详情请点击这里:第九节,改善深层神经网络:超参数调试.正则化以优化(下) 神经网络在进行训练时,主要是用来学习数据的分布规律,如果数据的训练部分和测试部分 ...
TensorFlow中数据读取之tfrecords
关于Tensorflow读取数据,官网给出了三种方法: 供给数据(Feeding): 在TensorFlow程序运行的每一步, 让Python代码来供给数据. 从文件读取数据: 在TensorFlow ...
ML（5）——神经网络3（随机初始化与梯度检验）
随机初始化在线性回归和逻辑回归中,使用梯度下降法之前,将θ设置为0向量,有时会习惯性的将神经网络中的权重全部初始化为0,然而这在神经网络中并不适用. 以简单的三层神经网络为例,将全部权重都设置为0, ...
tensorflow中slim模块api介绍
tensorflow中slim模块api介绍翻译 2017年08月29日 20:13:35 http://blog.csdn.net/guvcolie/article/details/77686 ...
Tensorflow中使用CNN实现Mnist手写体识别
本文参考Yann LeCun的LeNet5经典架构,稍加ps得到下面适用于本手写识别的cnn结构,构造一个两层卷积神经网络,神经网络的结构如下图所示: 输入-卷积-pooling-卷积-pooling ...

随机推荐

传智播客JavaWeb day10-jdbc操作mysql、连接数据库六大步骤
第十天主要讲了jdbc操作mysql数据库,包括连接数据库六大步骤(注册数据库驱动.获得连接对象connetion.生成传输器stament.执行查询获得ResultSet.遍历结果集.关闭资源).介 ...
鼠标焦点变化引起mouseout事件
做了个小手术,渐渐回归网络啦! 问题: 在自制的提示离鼠标太近时,会引起无法提示的功能. 自制提示离图片太近时,提示图片一直一闪一闪的,截图截不出来,就只放改善后的图片(不闪). 原因: 为什么呢?书 ...
Java - 处理某些图片泛红
参考博文: http://blog.csdn.net/kobejayandy/article/details/44346809 http://blog.csdn.net/shixing_11/arti ...
CSS3 transition效果 360度旋转旋转放大放大移动
效果一:360°旋转修改rotate(旋转度数) * { transition:All 0.4s ease-in-out; -webkit-transition:All 0.4s ease-in-o ...
ionic cordova 热更新的一些问题
因为项目需要用到更新这一块的东西,所以就查了下cordova 的热更新,然后遇到了一些问题,记录下来备忘. 项目用的是ionic 下载cordova的内容就直接跳过了. 首先是下载cordova的插 ...
Java实验三
20145113 20145102实验三实验步骤编码标准编程标准包含:具有说明性的名字.清晰的表达式.直截了当的控制流.可读的代码和注释,以及在追求这些内容时一致地使用某些规则和惯用法的重要性 ...
解决导入myeclipse的项目注释和中文是乱码
1.先说真正解决我所遇到的问题的办法. 用记事本打开——另存为——格式改为UTF-8——保存后在myeclipse就正常显示了. 2.以下是网上找到的办法,试了一些并没有解决问题,但或许是中间必须的步 ...
css的初步了解
学习了很多知识在这里,今天三月二十一日,老师讲了css的基础对css有了初步的了解. 主要学习了以下几点: 一.css的选择器 1.派生选择器 2.类选择器 3.id选择器 4.属性选择器二.cs ...
PHP实例开发（3）PHP中MVC学习之ThinkPHP
PHP中MVC学习之ThinkPHP 1.什么是MVC MVC本来是存在于Desktop程序中的,M是指数据模型,V是指用户界面,C则是控制器.使用MVC的目的是将M和V的实现代码分离 MVC是一个设 ...
API测试
API(Application Programming Interface)包含: 单元测试(Unit Testing).模块测试(Module Testing).组件测试(Component Tes ...

TensorFlow中权重的随机初始化

TensorFlow中权重的随机初始化的更多相关文章

随机推荐

热门专题