论文信息

论文标题：Towards Robust False Information Detection on Social Networks with Contrastive Learning
论文作者：Chunyuan Yuan, Qianwen Ma, Wei Zhou, Jizhong Han, Songlin Hu
论文来源：2019,CIKM
论文地址：download
论文代码：download

1 Introduction

　　问题：会话图中轻微的扰动讲导致现有模型的预测崩溃。

　　研究了两大类数据增强策略（破坏会话图结构）：

　　贡献：

　　(1) 提出了RDCL框架，为虚假信息检测提供了鲁棒的检测结果，该框架利用对比学习从多个角度提高了模型对扰动信号的感知。

　　(2) 证明了硬正样本对可以提高对比学习的效果。

　　(3) 提出了一种有效的硬样本对生成方法 HPG，它可以增加对比学习的效果，使模型学习更鲁棒的表示。

　　(4) 通过比较实验、在不同的 GNN 和两个数据集上进行的消融实验，证明了该模型的有效性。

2 Methodlogy

　　问题定义：预测无向会话图的标签。

　　整体框架如下：

2.1 Data Perturbations

node-based data perturbation

Comments contain noise (CN)

　　在除根节点以外的节点中，以 $\rho $ 的采样率采样节点，对于采样的节点用高斯分布初始化，没有被采样到的节点采用 0 填充：

　　　　$X_{C N}^{-r}=X^{-r}+X_{G a u s s i o n}^{-r}$

Comments are deleted (CD)

　　在除根节点以外的节点中，以 $\rho $ 的采样率采样节点，然后将其节点特征向量置 0 ：

　　　　$X_{C D}^{-r}=X^{-r} \odot D^{-r}$

Comments are exchangeable (CE)

　　在除根节点以外的节点中，以 $\rho $ 的采样率采样节点，交换节点特征向量。

topology-based data perturbation

Propagation sub-structure is removed (PR)

　　在除根节点以外的节点中，随机选择一部分节点，并删除其形成的子图。

Propagation structure is uncertain (PU)

　　以 $\rho $ 的采样率采样边，并删除边：

　　　　$A_{P U}=A-A_{\text {drop }}$

Propagation structure is incorrect (PI)

　　随机选择两个节点 $C_i$ 和 $C_j$，对于节点 $C_i$，选择删除它和它父节点之间的边，并添加 $C_j$ 和 $C_i$ 之间的边。

2.2 Contrastive Perturbation Learning

　　对于一张图，采用不同的数据增强策略，得到两个增强图，并获得其对应的图级表示，使用 NT-XENT 损失作为自监督损失：

　　　　${\large \mathcal{L}_{\mathrm{SSL}}=-\log \frac{\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)}{\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)+\sum\limits _{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)}} $

　　Note：需要对每个特征向量 $z_{m}^{i}, z_{m}^{j},z_{\text {neg }}$ 使用 $l_2$ normalization。

　　假设：对于含有相同标签的图，将他们认为是正样本对，每个 batch 中有 $P$ 张图，加上数据增强后生成的 $2P$ 张图，总共有 $3P$ 张图，自监督对比损失如下：

　　　　${\large \mathcal{L}_{S C L}=-\frac{1}{3 P} \log \frac{\sum\limits _{Y_{s}=Y_{m}} \exp \left(z_{m} \cdot z_{s} / \tau\right)}{\sum\limits_{Y_{s}=Y_{m}} \exp \left(z_{m} \cdot z_{s} / \tau\right)+\sum\limits_{Y_{d} \neq Y_{m}} \exp \left(z_{m} \cdot z_{d} / \tau\right)}} $

　　[ Anchor 和数据增强图之间的对比损失]

2.3 Perturbation Sample Pairs Generation

　　自监督损失：

　　　　$\begin{aligned}\mathcal{L}_{\mathrm{SSL}}=&-z_{m}^{i} \cdot z_{m}^{j} / \tau +\log \left(\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)+\sum\limits_{\mathrm{Neg}} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)\right)\end{aligned}$

　　[数据增强图之间的对比损失]

　　上述 $\mathcal{L}_{\text {SSL }}$ 关于 $z_{m}^{i}$ 的梯度为：

　　　　$\begin{aligned}\frac{\partial \mathcal{L}_{S S L}}{\partial z_{m}^{i}} &=-\frac{1}{\tau}\left(z_{m}^{j}-\frac{\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right) z_{m}^{j}+\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right) z_{n e g}}{\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)+\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)}\right) \\&=-\frac{\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)\left(z_{m}^{j}-z_{m}^{i}\right)-\left(z_{n e g}-z_{m}^{i}\right)}{\tau \exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)+\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g}\right) / \tau} \\&=-\frac{1}{C_{1} \tau}\left(\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)\left(z_{m}^{j}-z_{m}^{i}\right)+C_{2}\right)\end{aligned}$

　　其中：

　　　　$C_{1}=\exp \left(z_{m}^{i} \cdot z_{m}^{j} / \tau\right)+\sum\limits_{N e g} \exp \left(z_{m}^{i} \cdot z_{n e g} / \tau\right)$

　　　　$C_{2}=z_{n e g}-z_{m}^{i}$

　　$\text{Eq.7}$ 在分子中的梯度贡献主要来自于（$z_{m}^{j}-z_{m}^{i}$）。因此，如果能够增加图级空间中样本对之间的距离，它将提供更大的梯度信号，从而增加模型的学习难度，提高对比学习的质量。所以，本文的对比视图生成方法如下：

　　Figure 5 说明，由 HPG 生成的数据增强图，他们之间的相似度小于其他数据增强方法，那么损失函数 SSL 会加大对模型的惩罚，提高对比学习的质量。

　　虽然扰动会加大学习的难度，但是他们提供了足够的信息去保存视图之间的一致性。

2.4 Training Objective

　　图分类损失：

　　　　$\mathcal{L}_{C E}=-y \log \left(\hat{y}_{1}\right)-(1-y) \log \left(1-\hat{y}_{0}\right)$

　　总损失：

　　　　$\mathcal{L}_{\text {joint }}(\theta)=\mathcal{L}_{C E}+\alpha \mathcal{L}_{S S L}+\beta \mathcal{L}_{S C L}$

3 Experiment

3.1 Datasets

3.2 Performance Comparison

3.3 Robustness Studies

　　基于本文的 6 中数据增强策略，对比 GACL 和本文方法：

3.4 The robustness on different perturbation scenarios

　　研究采用复杂数据增强策略组合的对比实验：

3.5 Ablation Studies

　　研究如下 6 中数据增强策略 Node Mask , Edge Drop , Mixed , Node-based, Topology-based and our method HPG 的实验对比结果：

Ablation studies on model components

3.6 Graph-level Representation Studies

3.7 The Impact of Perturbation Probability $\rho$

　　不同扰动率和不同编码器的实验对比：

谣言检测(RDCL)——《Towards Robust False Information Detection on Social Networks with Contrastive Learning》的更多相关文章

谣言检测（GACL）《Rumor Detection on Social Media with Graph Adversarial Contrastive Learning》
论文信息论文标题:Rumor Detection on Social Media with Graph AdversarialContrastive Learning论文作者:Tiening Sun ...
谣言检测（RDEA）《Rumor Detection on Social Media with Event Augmentations》
论文信息论文标题:Rumor Detection on Social Media with Event Augmentations论文作者:Zhenyu He, Ce Li, Fan Zhou, Y ...
谣言检测（ClaHi-GAT）《Rumor Detection on Twitter with Claim-Guided Hierarchical Graph Attention Networks》
论文信息论文标题:Rumor Detection on Twitter with Claim-Guided Hierarchical Graph Attention Networks论文作者:Erx ...
谣言检测——《MFAN: Multi-modal Feature-enhanced Attention Networks for Rumor Detection》
论文信息论文标题:MFAN: Multi-modal Feature-enhanced Attention Networks for Rumor Detection论文作者:Jiaqi Zheng, ...
谣言检测（PLAN）——《Interpretable Rumor Detection in Microblogs by Attending to User Interactions》
论文信息论文标题:Interpretable Rumor Detection in Microblogs by Attending to User Interactions论文作者:Ling Min ...
谣言检测（PSIN）——《Divide-and-Conquer: Post-User Interaction Network for Fake News Detection on Social Media》
论文信息论文标题:Divide-and-Conquer: Post-User Interaction Network for Fake News Detection on Social Media论 ...
谣言检测——（PSA）《Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks》
论文信息论文标题:Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks论文作者:Jiayin ...
谣言检测（DUCK）《DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks》
论文信息论文标题:DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks论 ...
谣言检测——(GCAN)《GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media》
论文信息论文标题:GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Medi ...

随机推荐

MapReduce核心原理（下）
MapReduce 中的排序 MapTask 和 ReduceTask 都会对数据按key进行排序.该操作是 Hadoop 的默认行为,任何应用程序不管需不需要都会被排序.默认排序是字典顺序排序,排序 ...
Python自学教程5-字符串有哪些常用操作
任何编程语言,不管是Python.Java 还是 Golang, 字符串都是最重要的一种数据类型. 但是字符串的操作又很多,初学者经常毫无头绪,不知道从哪儿学起,也不知道哪些操作用得多,今天九柄就和你 ...
01 - 快速体验 Spring Security 5.7.2 | 权限管理基础
在前面SpringBoot 2.7.2 的系列文章中,已经创建了几个 computer 相关的接口,这些接口直接通过 Spring Doc 或 POSTMAN 就可以访问.例如: GET http:/ ...
PostgreSQL 绑定变量窥探
今天我们要探讨的是 custom执行计划和通用执行计划.这一技术在 Oracle中被称为绑定变量窥视.但 Kingbase中并没有这样的定义,更严格地说,Kingbase叫做custom执行计划和通用 ...
KingbaseES 的行列转换
目录背景行转列数据准备分组聚合函数+CASE 根据压缩数据的格式,横向展开数据列选取不同方式 crosstab函数 PIVOT 操作符 PIVOT 操作符的限制工具 ksql 的元命令 \c ...
面试突击82：SpringBoot 中如何操作事务？
在 Spring Boot 中操作事务有两种方式:编程式事务或声明式事务,接下来我们一起来看二者的具体实现. 1.编程式事务在 Spring Boot 中实现编程式事务又有两种实现方法: 使用 Tr ...
第六章：Django 综合篇 - 14：Django 日志
Django使用Python内置的logging模块实现它自己的日志系统. 如果你没有使用过logging模块,请参考Python教程中的相关章节. 直达链接<logging模块详解>. ...
通过 Traefik 使用 Kubernetes Service APIs 进行流量路由 (http,https,金丝雀发布)
文章转载自:https://mp.weixin.qq.com/s?__biz=MzU4MjQ0MTU4Ng==&mid=2247490229&idx=1&sn=ca817054 ...
0-mysql数据库下载及安装
1 下载mysql源安装包 wget http://dev.mysql.com/get/mysql57-community-release-el7-8.noarch.rpm 2 安装mysql源 yu ...
Python——索引与切片
#索引与切片 ##1.序列序列:list,tuple,str 其中list是可变序列 typle,str是不可变序列 #修改序列的值 list = [3,4,5] tup = (3,4,5) str ...

谣言检测(RDCL)——《Towards Robust False Information Detection on Social Networks with Contrastive Learning》