论文信息

论文标题：Self-Attention Graph Pooling
论文作者：Junhyun Lee, Inyeop Lee, Jaewoo Kang
论文来源：2019, ICML
论文地址：download
论文代码：download

1 Introduction

　　图池化三种类型：

- Topology based pooling；
- Hierarchical pooling；（使用所有从 GNN 获得的节点表示）
- Hierarchical pooling；

　　关于 Hierarchical pooling 聚类分配矩阵：

　　　　$\begin{array}{j}S^{(l)}=\operatorname{softmax}\left(\mathrm{GNN}_{l}\left(A^{(l)}, X^{(l)}\right)\right) \\A^{(l+1)}=S^{(l) \top} A^{(l)} S^{(l)}\end{array} \quad\quad\quad\quad(1)$

　　gPool 取得了与 DiffPool 相当的性能，gPool 需要的存储复杂度为 $\mathcal{O}(|V|+|E|)$，而 DiffPool 需要 $\mathcal{O}\left(k|V|^{2}\right)$，其中 $V$、$E$ 和 $k$ 分别表示顶点、边和池化率。gPool 使用一个可学习的向量 $p$ 来计算投影分数，然后使用这些分数来选择排名靠前的节点。投影得分由 $p$ 与所有节点的特征之间的点积得到。这些分数表示可以保留的节点的信息量。下面的公式大致描述了 gPool 中的池化过程：

　　　　$\begin{array}{l} y=X^{(l)} \mathbf{p}^{(l)} /\left\|\mathbf{p}^{(l)}\right\|\\ \mathrm{idx}=\operatorname{top}-\operatorname{rank}(y,\lceil k N\rceil)\\A^{(l+1)}=A_{\mathrm{idx}, \mathrm{idx}}^{(l)}\end{array} \quad\quad\quad\quad(2)$

2 Method

　　框架如下：

2.1. Self-Attention Graph Pooling

Self-attention mask

　　本文使用图卷积来获得自注意分数：

　　　　$Z=\sigma\left(\tilde{D}^{-\frac{1}{2}} \tilde{A} \tilde{D}^{-\frac{1}{2}} X \Theta_{a t t}\right) \quad\quad\quad\quad(3)$

　　其中，自注意得分 $Z \in \mathbb{R}^{N \times 1}$、邻接矩阵 $\tilde{A} \in \mathbb{R}^{N \times N}$、注意力参数矩阵 $\Theta_{a t t} \in \mathbb{R}^{F \times 1}$、特征矩阵 $X \in \mathbb{R}^{N \times F}$、度矩阵 $\tilde{D} \in \mathbb{R}^{N \times N}$。

　　这里考虑节点选择方法，即使输入不同大小和结构的图，也会保留输入图的部分节点。

　　　　$\begin{array}{l} \mathrm{idx}=\operatorname{top}-\operatorname{rank}(Z,\lceil k N\rceil)\\Z_{\text {mask }}=Z_{\mathrm{idx}}\end{array} \quad\quad\quad\quad(4)$

　　基于自注意得分 $Z$ ，选择保留前 $ \lceil k N\rceil$ 个节点，其中 $k \in(0,1]$ 代表着池化率（pooling ratio），$Z_{\text{mask}}$ 是 feature attention mask。。

Graph pooling

　　接着获得新特征矩阵和邻接矩阵：

　　　　 $\begin{array}{l} X^{\prime}=X_{\mathrm{idx},:}\\X_{\text {out }}=X^{\prime} \odot Z_{\text {mask }}\\A_{\text {out }}=A_{\mathrm{idx}, \mathrm{idx}}\end{array} \quad\quad\quad\quad(5)$

　　其中，$\odot$ is the broadcasted elementwise product。

Variation of SAGPool

　　利用图特征矩阵 $X$ 和拓扑结构 $A$ ，计算注意力得分矩阵 $Z$ 的通用形式：

　　　　$Z=\sigma(\operatorname{GNN}(X, A)) \quad\quad\quad\quad(6)$

　　比如 $\text { SAGPool }_{\text {augmentation }}$，加入二跳邻居信息：

　　　　$Z=\sigma\left(\operatorname{GNN}\left(X, A+A^{2}\right)\right) \quad\quad\quad\quad(7)$

　　比如 $\text { SAGPool }_{\text {serial }}$，堆叠多层 GNN：

　　　　$Z=\sigma\left(\mathrm{GNN}_{2}\left(\sigma\left(\mathrm{GNN}_{1}(X, A)\right), A\right)\right) \quad\quad\quad\quad(8)$

　　比如 $\text { SAGPool }_{\text {parallel }}$，平均多重注意力分数。$M$ 个 GNN 的平均注意得分如下：

　　　　$Z=\frac{1}{M} \sum_{m} \sigma\left(\mathrm{GNN}_{m}(X, A)\right) \quad\quad\quad\quad(9)$

2.2 Model Architecture

　　本节用来验证模块的有效性。

Convolution layer

　　图卷积 GCN：

　　　　$h^{(l+1)}=\sigma\left(\tilde{D}^{-\frac{1}{2}} \tilde{A} \tilde{D}^{-\frac{1}{2}} h^{(l)} \Theta\right) \quad\quad\quad\quad(10)$

　　与 $\text{Eq.3}$ 不同的是，$\Theta \in \mathbb{R}^{F \times F^{\prime}}$ 。

Readout layer

　　根据 JK-net architecture 的思想：

　　　　$s=\frac{1}{N} \sum_{i=1}^{N} x_{i} \| \max _{i=1}^{N} x_{i} \quad\quad\quad\quad(11)$

　　其中：

- $N$ 代表着节点的个数；
- $x_{i}$ 代表着第 $i$ 个节点的特征向量；

Global pooling architecture & Hierarchical pooling architecture

　　对比如下：

3 Experiments

数据集

基线实验

SAGPool 的变体

4 Conclusion

　　本文提出了一种基于自注意的SAGPool图池化方法。我们的方法具有以下特征：分层池、同时考虑节点特征和图拓扑、合理的复杂度和端到端表示学习。SAGPool使用一致数量的参数，而不管输入图的大小如何。我们工作的扩展可能包括使用可学习的池化比率来获得每个图的最优聚类大小，并研究每个池化层中多个注意掩模的影响，其中最终的表示可以通过聚合不同的层次表示来获得。

论文解读（SAGPool）《Self-Attention Graph Pooling》的更多相关文章

论文解读《Deep Attention-guided Graph Clustering with Dual Self-supervision》
论文信息论文标题:Deep Attention-guided Graph Clustering with Dual Self-supervision论文作者:Zhihao Peng, Hui Liu ...
论文解读GALA《Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning》
论文信息 Title:<Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learn ...
论文解读（ChebyGIN）《Understanding Attention and Generalization in Graph Neural Networks》
论文信息论文标题:Understanding Attention and Generalization in Graph Neural Networks论文作者:Boris Knyazev, Gra ...
论文解读（GraphMAE）《GraphMAE: Self-Supervised Masked Graph Autoencoders》
论文信息论文标题:GraphMAE: Self-Supervised Masked Graph Autoencoders论文作者:Zhenyu Hou, Xiao Liu, Yukuo Cen, Y ...
论文解读（KP-GNN）《How Powerful are K-hop Message Passing Graph Neural Networks》
论文信息论文标题:How Powerful are K-hop Message Passing Graph Neural Networks论文作者:Jiarui Feng, Yixin Chen, ...
论文解读（SR-GNN）《Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training Data》
论文信息论文标题:Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training Data论文作者:Qi Zhu, ...
论文解读（LG2AR）《Learning Graph Augmentations to Learn Graph Representations》
论文信息论文标题:Learning Graph Augmentations to Learn Graph Representations论文作者:Kaveh Hassani, Amir Hosein ...
论文解读（GCC）《Efficient Graph Convolution for Joint Node RepresentationLearning and Clustering》
论文信息论文标题:Efficient Graph Convolution for Joint Node RepresentationLearning and Clustering论文作者:Chaki ...
论文解读（AGC）《Attributed Graph Clustering via Adaptive Graph Convolution》
论文信息论文标题:Attributed Graph Clustering via Adaptive Graph Convolution论文作者:Xiaotong Zhang, Han Liu, Qi ...

随机推荐

遇到MyBatis-Plus的错误之“Table 'mybatis_plus.user' doesn't exist”
一.问题 Table 'mybatis_plus.user' doesn't exist 二.原因表中没有user表三.解决方案生成user表既可四.结果图运行后显示查询出来的数据五.总结 ...
SQL函数对应的数据库(案例)
ROS环境变量的设置
一.前言(大神可以直接跳过) 本博客主要就是为了介绍ROS中环境变量的设置过程,还不是很了解ROS的可以去看一下我的博客,ROS简介-从零开始讲解ROS(适合超零基础阅读) ROS为什么需要设置环境变 ...
前端网络安全——前端XSS
XSS攻击:Cross Site Scripting(跨站脚本攻击) XSS攻击原理:程序+数据=结果,如果数据中包含了一部分程序,那么结果就会执行不属于站点的程序. XSS攻击能干什么?能注入Scr ...
vue配置请求转发解决跨域问题
通过nodejs的请求转发到后台,前端地址:http://localhost:8080 后端地址:http://localhost:8081 vue.config.js内容如下: let prox ...
java生成多级菜单树
使用java实现一个多级菜单树结构先上数据库 ps_pid字段很重要,是父级菜单的id Menu类 Menu类要新增一个字段,用来存放子菜单 /** * 子菜单列表 */ private List& ...
Redis 缓存击穿（失效）、缓存穿透、缓存雪崩怎么解决？
原始数据存储在 DB 中(如 MySQL.Hbase 等),但 DB 的读写性能低.延迟高. 比如 MySQL 在 4 核 8G 上的 TPS = 5000,QPS = 10000 左右,读写平均耗时 ...
linux mysql授权远程连接，创建用户等
1.进入mysql 2.此命令是为密码为 root .IP(%)任意的 root 用户授权.(*.* 表示数据库.表,to后为root用户:%:模糊查询,所有 IP 都可以,可指定其他主机 IP:by ...
Python学习报告及后续学习计划
第一次有学习Python的想法是源于寒假在家的时候,高中同学问我是否学了Python(用于深度学习),当时就到b站收藏了黑马最新的教学视频,但是"收藏过等于我看了",后续就是过完年 ...
HCIE笔记-第三节-数据链路层与MAC地址
如果数据进行封装时,基于E2或者802.3标准,此时我们称之为是一个以太网数据帧. E2和802.3作用:定义帧头和帧尾的格式. 以太网是现在局域网组网的唯一标准. 数据:对于下层的每个层级而言,上层 ...

论文解读（SAGPool）《Self-Attention Graph Pooling》