Reshape

对于的张量x，x.shape=(a, b, c, d)的情况

若调用keras.layer.Reshape(target_shape=(-1, c, d))，

处理后的张量形状为(?, ?, c, d)

若调用tf.reshape(x, shape=[-1, c, d])

处理后的张量形状为(a*b, c, d)

为了在keras代码中实现tf.reshape的效果，用lambda层做，

调用Lambda(lambda x: tf.reshape(x, shape=[-1, c, d]))(x)

nice and cool.

输出Attention的打分

这里，我们希望attention层能够输出attention的score，而不只是计算weighted sum。

在使用时

score = Attention()(x)

weighted_sum = MyMerge()([score, x])

class Attention(Layer):

    def __init__(self, **kwargs):

        super(Attention, self).__init__(**kwargs)

    def build(self, input_shape):

        assert len(input_shape) == 3

        self.w = self.add_weight(name="attention_weight",

                                   shape=(input_shape[-1],

                                          input_shape[-1]),

                                   initializer='uniform',

                                   trainable=True

                                   )

        self.b = self.add_weight(name="attention_bias",

                                   shape=(input_shape[-1],),

                                   initializer='uniform',

                                   trainable=True

                                   )

        self.v = self.add_weight(name="attention_v",

                                 shape=(input_shape[-1], 1),

                                 initializer='uniform',

                                 trainable=True

                                 )

        super(Attention, self).build(input_shape)

    def call(self, inputs):

        x = inputs

        att = K.tanh(K.dot(x, self.w) + self.b)

        att = K.softmax(K.dot(att, self.v))

        print(att.shape)

        return att

    def compute_output_shape(self, input_shape):

        return input_shape[0], input_shape[1], 1

class MyMerge(Layer):

    def __init__(self, **kwargs):

        super(MyMerge, self).__init__(**kwargs)

    def call(self, inputs):

        att = inputs[0]

        x = inputs[1]

        att = tf.tile(att, [1, 1, x.shape[-1]])

        outputs = tf.multiply(att, x)

        outputs = K.sum(outputs, axis=1)

        return outputs

    def compute_output_shape(self, input_shape):

        return input_shape[1][0], input_shape[1][2]

keras中Model的嵌套

这边是转载自https://github.com/uhauha2929/examples/blob/master/Hierarchical%20Attention%20Networks%20.ipynb

可以看到，sentEncoder是Model类型，在后面的时候通过TimeDistributed(sentEncoder)，当成一个层那样被调用。

embedding_layer = Embedding(len(word_index) + 1,

                            EMBEDDING_DIM,

                            input_length=MAX_SENT_LENGTH)

sentence_input = Input(shape=(MAX_SENT_LENGTH,), dtype='int32')

embedded_sequences = embedding_layer(sentence_input)

l_lstm = Bidirectional(LSTM(100))(embedded_sequences)

sentEncoder = Model(sentence_input, l_lstm)

review_input = Input(shape=(MAX_SENTS,MAX_SENT_LENGTH), dtype='int32')

review_encoder = TimeDistributed(sentEncoder)(review_input)

l_lstm_sent = Bidirectional(LSTM(100))(review_encoder)

preds = Dense(2, activation='softmax')(l_lstm_sent)

model = Model(review_input, preds)

Keras实现Hierarchical Attention Network时的一些坑的更多相关文章

Hierarchical Attention Based Semi-supervised Network Representation Learning
Hierarchical Attention Based Semi-supervised Network Representation Learning 1. 任务给定:节点信息网络目标:为每个节 ...
Dual Attention Network for Scene Segmentation
Dual Attention Network for Scene Segmentation 原始文档 https://www.yuque.com/lart/papers/onk4sn 在本文中,我们通 ...
语义分割之Dual Attention Network for Scene Segmentation
Dual Attention Network for Scene Segmentation 在本文中,我们通过基于自我约束机制捕获丰富的上下文依赖关系来解决场景分割任务. 与之前通过多尺 ...
Paper | Residual Attention Network for Image Classification
目录 1. 相关工作 2. Residual Attention Network 2.1 Attention残差学习 2.2 自上而下和自下而上 2.3 正则化Attention 最近看了些关于att ...
A Survey of Model Compression and Acceleration for Deep Neural Network时s
A Survey of Model Compression and Acceleration for Deep Neural Network时s 本文全面概述了深度神经网络的压缩方法,主要可分为参数修 ...
注意力机制---Attention、local Attention、self Attention、Hierarchical attention
一.编码-解码架构目的:解决语音识别.机器翻译.知识问答等输出输入序列长度不相等的任务. C是输入的一个表达(representation),包含了输入序列的有效信息. 它可能是一个向量,也可能是一 ...
Residual Attention Network for Image Classification（CVPR 2017）详解
一.Residual Attention Network 简介这是CVPR2017的一篇paper,是商汤.清华.香港中文和北邮合作的文章.它在图像分类问题上,首次成功将极深卷积神经网络与人类视觉注 ...
论文解读（FedGAT）《Federated Graph Attention Network for Rumor Detection》
论文信息论文标题:Federated Graph Attention Network for Rumor Detection论文作者:Huidong Wang, Chuanzheng Bai, Ji ...
关于pyinstaller打包程序时设置icon时的一个坑
关于pyinstaller打包程序时设置icon时的一个坑之前在用pyinstaller打包程序的时候遇到了关于设置图标的一点小问题,无论在后面加--icon 或是-i都出现报错.查了下st ...

随机推荐

Qt编写自定义控件71-圆弧进度条
一.前言现在web形式的图表框架非常流行,国产代表就是echart,本人用过几次,三个字屌爆了来形容,非常强大,而且易用性也非常棒,还是开源免费的,使用起来不要太爽,内置的各种图表和仪表盘等非常丰富 ...
安装私有docker仓库
简介: 虽然国内已经有了很多docker加速镜像,以前用的daocloud,最近又找到了阿里云. 但是私有网络部署kubernetes,用不了加速镜像,还是自己部署一个比较好. 一:安装docker ...
登录另一台linux主机并且执行相应的命令
[root@bogon ~]# cat a.sh #!/bin/bash ssh root@192.168.0.98 'ls /root'
css代码陷阱
1.选择器优先级 <!DOCTYPE html> <html lang="en"> <head> <meta charset=" ...
windows2008R2下iis7.5中的url重写(urlrewrite)
以前在windows2003里,使用的是iis6.0,那时常使用的URL重写组件是iisrewrite,当服务器升级到windows2008R2时,IIS成了64位的7.5,结果iisreite组件是 ...
SET IDENTITY_INSERT的用法，具体去体验一下
如果将值插入到表的标识列中,需要启用 SET IDENTITY_INSERT. 举例如下: 创建表Orders.Products,Orders表与Products表分别有标识列OrderID与Prod ...
16、vue引入echarts，划中国地图
vue引入echarts npm install echarts --save main.js引入 import echarts from 'echarts' Vue.prototype.$echar ...
SPSS数据分析基础考题
选择题 1. SPSS发行版本的说法,正确的是: B A. 两年发行一个新版本 B.一年发行一个新版本 C.没有任何规律 D.三年发行三个新版本 2.哪些是SPSS统计分析软件的基本窗口: A A.结 ...
【Chrome插件】Session Buddy--搁置标签页
写在前面:看文章前请先看文章写作时间,避免浪费时间.2019-09-10 使用场景 Chrome打开许多网页,临时有事需要把当前的一些标签页一键保存,等待事后继续处理. 操作演示原片地址:https ...
idea安装svn
idea不像eclipse那样是用插件,idea是直接指向已经安装好的svn.exe.

Keras实现Hierarchical Attention Network时的一些坑

Reshape

输出Attention的打分

keras中Model的嵌套

Keras实现Hierarchical Attention Network时的一些坑的更多相关文章

随机推荐

热门专题