tensorflow源码分析—

BasicLSTMCell 是最简单的LSTMCell，源码位于：/tensorflow/contrib/rnn/python/ops/core_rnn_cell_impl.py。
BasicLSTMCell 继承了RNNCell，源码位于：/tensorflow/python/ops/rnn_cell_impl.py

注意事项：
1. input_size 这个参数不能使用，使用的是num_units

2. state_is_tuple 官方建议设置为True。此时，输入和输出的states为c(cell状态)和h（输出）的二元组

3. 输入、输出、cell的维度相同，都是 batch_size * num_units，

cell = tf.contrib.rnn.BasicLSTMCell(num_units, forget_bias=0.0, state_is_tuple=True)　　#指定num_units

_initial_state = cell.zero_state(batch_size, tf.float32)　　　　　　　　　　　　　　　　　　　#指定batch_size,将c和h全部初始化为0，shape全是batch_size * num_units，

4.

class BasicLSTMCell(RNNCell):

  """Basic LSTM recurrent network cell.

  The implementation is based on: http://arxiv.org/abs/1409.2329.

  We add forget_bias (default: 1) to the biases of the forget gate in order to

  reduce the scale of forgetting in the beginning of the training.

  It does not allow cell clipping, a projection layer, and does not

  use peep-hole connections: it is the basic baseline.

  For advanced models, please use the full LSTMCell that follows.

  """

  def __init__(self, num_units, forget_bias=1.0, input_size=None,

               state_is_tuple=True, activation=tanh):

    """Initialize the basic LSTM cell.

    Args:

      num_units: int, The number of units in the LSTM cell.

      forget_bias: float, The bias added to forget gates (see above).

      input_size: Deprecated and unused.

      state_is_tuple: If True, accepted and returned states are 2-tuples of

        the `c_state` and `m_state`.  If False, they are concatenated

        along the column axis.  The latter behavior will soon be deprecated.

      activation: Activation function of the inner states.

    """

    if not state_is_tuple:

      logging.warn("%s: Using a concatenated state is slower and will soon be "

                   "deprecated.  Use state_is_tuple=True.", self)

    if input_size is not None:

      logging.warn("%s: The input_size parameter is deprecated.", self)

    self._num_units = num_units

    self._forget_bias = forget_bias

    self._state_is_tuple = state_is_tuple

    self._activation = activation

  @property

  def state_size(self):

    return (LSTMStateTuple(self._num_units, self._num_units)

            if self._state_is_tuple else 2 * self._num_units)

  @property

  def output_size(self):

    return self._num_units

  def __call__(self, inputs, state, scope=None):

    """Long short-term memory cell (LSTM)."""

    with vs.variable_scope(scope or "basic_lstm_cell"):

      # Parameters of gates are concatenated into one multiply for efficiency.

      if self._state_is_tuple:

        c, h = state

      else:

        c, h = array_ops.split(value=state, num_or_size_splits=2, axis=1)

　　　 # 线性计算 concat = [inputs, h]W + b 
　　　 # 线性计算，分配W和b，W的shape为（2*num_units, 4*num_units）, b的shape为（4*num_units,）,共包含有四套参数，
      # concat shape(batch_size, 4*num_units)
   　　# 注意：只有cell 的input和output的size相等时才可以这样计算，否则要定义两套W,b.每套再包含四套参数

      concat = _linear([inputs, h], 4 * self._num_units, True, scope=scope)

      # i = input_gate, j = new_input, f = forget_gate, o = output_gate

      i, j, f, o = array_ops.split(value=concat, num_or_size_splits=4, axis=1)

      new_c = (c * sigmoid(f + self._forget_bias) + sigmoid(i) *

               self._activation(j))

      new_h = self._activation(new_c) * sigmoid(o)

      if self._state_is_tuple:

        new_state = LSTMStateTuple(new_c, new_h)

      else:

        new_state = array_ops.concat([new_c, new_h], 1)

      return new_h, new_state

5. lstm层，每一batch的运算

        with tf.variable_scope("RNN"):

            for time_step in range(num_steps):

                if time_step > 0: tf.get_variable_scope().reuse_variables()

                (cell_output, state) = cell(inputs[:, time_step, :], state)

                outputs.append(cell_output)

6. 每一epoch

7.全部运算

tensorflow源码分析——BasicLSTMCell的更多相关文章

tensorflow源码分析
前言: 一般来说,如果安装tensorflow主要目的是为了调试些小程序的话,只要下载相应的包,然后,直接使用pip install tensorflow即可. 但有时我们需要将Tensorflow的 ...
tensorflow源码分析——LSTMCell
LSTMCell 是最简单的LSTMCell,源码位于:/tensorflow/contrib/rnn/python/ops/core_rnn_cell_impl.py.LSTMCell 继承了RNN ...
图解tensorflow 源码分析
http://www.cnblogs.com/yao62995/p/5773578.html https://github.com/yao62995/tensorflow
tensorflow源码分析——CTC
CTC是2006年的论文Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurren ...
[tensorflow源码分析] Conv2d卷积运算（前向计算，反向梯度计算）
[图解tensorflow源码] 入门准备工作附常用的矩阵计算工具[转]
[图解tensorflow源码] 入门准备工作附常用的矩阵计算工具[转] Link: https://www.cnblogs.com/yao62995/p/5773142.html tensorf ...
[图解tensorflow源码] 入门准备工作
tensorflow使用了自动化构建工具bazel.脚本语言调用c或cpp的包裹工具swig.使用EIGEN作为矩阵处理工具.Nvidia-cuBLAS GPU加速计算库.结构化数据存储格式prot ...
[图解tensorflow源码] [原创] Tensorflow 图解分析（Session, Graph, Kernels, Devices）
TF Prepare [图解tensorflow源码] 入门准备工作 [图解tensorflow源码] TF系统概述篇 Session篇 [图解tensorflow源码] Session::Run() ...
TensorFlow源码框架杂记
一.为什么我们需要使用线程池技术(ThreadPool) 线程:采用“即时创建,即时销毁”策略,即接受请求后,创建一个新的线程,执行任务,完毕后,线程退出: 线程池:应用软件启动后,立即创建一定数量的 ...

随机推荐

公司最喜欢问的Java集合类
java.util包中包含了一系列重要的集合类,而对于集合类,主要需要掌握的就是它的内部结构,以及遍历集合的迭代模式. 接口:Collection Collection是最基本的集合接口,一个Coll ...
Centos7搭建solr集群
1.复制4个Tomcat到solr-cloud目录下 [root@localhost software]# cp -r apache-tomcat-9.0.24 /usr/local/solr-clo ...
第十三章·Kibana深入-使用地图统计客户端IP
地址库在ELK中,我们可以使用地址库,来对IP进行分析,对日志进行分析,在ELKstack中只有Logstash可以做到,但是出图,是Kibana来出的,所以我们首先需要下载地址库数据文件,然后对L ...
解决Chrome无法安装CRX离线插件
解释说明: 谷歌浏览器Chrome,版本号67.0.3396.99,自这个版本后的Chrome,手动拖放插件文件crx到谷歌浏览器,这种安装插件的方式,一定会失败,它会提示“无法从该网站添加应用,扩展 ...
goquery 解析不了noscript
今天在用goquery的时候解析noscript标签的时候.发现一直获取不到里面的元素. google得到.需要去除noscript标签. s.Find("noscript"). ...
AIX文件系统/var空间100%的问题
一.问题说明/var/spool/mqueue目录下出现了多个df打头的文件,导致/var空间最终100% EBANK_P570_MAIN/var/spool/mqueue#ls -l total 8 ...
Python学习第一天（一）初始python
1.python的前世今生想要充分的了解一个人,无外乎首先充分了解他的过去和现在:咱们学习语言也是一样的套路 1.1 python的历史 Python(英国发音:/ˈpaɪθən/ 美国发音:/ˈp ...
1 request模块
官方文档真是好用的一匹官方文档:https://2.python-requests.org//zh_CN/latest/index.html 参考blog:https://www.cnblogs.c ...
2 APIView与序列化组件
1.入门 1.1 参考blog 官方文档:http://www.django-rest-framework.org/tutorial/quickstart/#quickstart yuan的Blog: ...
Linux下使用telnet测试端口号是否开放
telnet 127.0.0.1 80调用后,若提示bash: telnet: command not found,那么进行以下步骤: 1.检查telnet是否已经安装,或者有部分未安装: rpm - ...

tensorflow源码分析——BasicLSTMCell

tensorflow源码分析——BasicLSTMCell的更多相关文章

随机推荐

热门专题