[ Wechat：Y466551 | 付费咨询，非诚勿扰 ]

论文信息

论文标题：Dynamic Weighted Learning for Unsupervised Domain Adaptation

论文作者：Jihong Ouyang、Zhengjie Zhang、Qingyi Meng
论文来源：2023 aRxiv
论文地址：download
论文代码：download
视屏讲解：click

1 介绍

2 方法

2.1 出发点

　　反应的问题：随着域对齐的实现，判别性在下降；

2.2 模型框架

2.3 Sample Weighting

　　为避免由于源域和目标域样本数量差距过大导致模型产生倾向性，本文对每个域的样本进行加权：

　　　　$\begin{array}{l}\hat{x}_{i}^{s}=a\left(1+\frac{n_{t}}{n_{s}}\right) x_{i}^{s} \quad, \quad i=1,2, \ldots, n_{s} \\\hat{x}_{j}^{t}=a\left(1+\frac{n_{s}}{n_{t}}\right) x_{j}^{t} \quad, \quad j=1,2, \ldots, n_{t}\end{array} $

　　其中，$a \in(0,1]$ 是一个控制样本加权程度的超参数。

2.4 Domain Alignment Learning and Class Discrimination Learning

　　域对齐(对抗性学习)：

　　　　$\begin{array}{r} \underset{\theta_{g}}{\text{min}} \; \underset{\theta_{d}}{\text{max}} \; \mathcal{L}_{d a}\left(\theta_{g}, \theta_{d}\right)=\mathbb{E}_{x_{i}^{s} \sim \mathcal{D}_{s}} \log \left[D\left(G\left(\hat{x}_{i}^{s}\right)\right)\right] +\mathbb{E}_{x_{j}^{t} \sim \mathcal{D}_{t}} \log \left[1-D\left(G\left(\hat{x}_{j}^{t}\right)\right)\right]\end{array}$

　　鉴别性特征学习：

　　　　$\begin{aligned} \underset{\theta_{g}, \theta_{c}}{\text{min}} \; \underset{\theta_{c_{1}}, \theta_{c_{2}}}{\text{max}} \; \mathcal{L}_{c d} & \left(\theta_{g}, \theta_{c}, \theta_{c_{1}}, \theta_{c_{1}}\right) \\= & \mathbb{E}_{x_{j}^{t} \sim \mathcal{D}_{t}}\left\|C_{1}\left(G\left(\hat{x}_{j}^{t}\right)\right)-C_{2}\left(G\left(\hat{x}_{j}^{t}\right)\right)\right\|_{1} \\& +\left\|C\left(G\left(\hat{x}_{j}^{t}\right)\right)-C_{1}\left(G\left(\hat{x}_{j}^{t}\right)\right)\right\|_{1} \\& +\left\|C\left(G\left(\hat{x}_{j}^{t}\right)\right)-C_{2}\left(G\left(\hat{x}_{j}^{t}\right)\right)\right\|_{1}\end{aligned}$

　　Note：$C$、$C_{1}$、$C_{2}$ 是使用源域数据预训练得到的分类器。首先，固定 $G$ 和 $C$ 最大化 $C_1$ 和 $C_2$ 的差异。然后，固定 $C_{1}$ 和 $C_{2}$ 训练 $G$ 和 $C$。

2.5 Dynamic Weighted Learning

　　域对齐度量 [ MMD ]：

　　　　$\operatorname{MMD}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)=\left\|\mathbb{E}_{x_{i}^{s} \sim \mathcal{D}_{s}} G\left(\hat{x}_{i}^{s}\right)-\mathbb{E}_{x_{j}^{t} \sim \mathcal{D}_{t}} G\left(\hat{x}_{j}^{t}\right)\right\|^{2}$

　　鉴别性度量 [ LDA ]：

　　　　$\underset{\mathbf{W}}{\text{max}} \; J(\mathbf{W})=\frac{\operatorname{tr}\left(\mathbf{W}^{\top} \mathbf{S}_{\mathbf{b}} \mathbf{W}\right)}{\operatorname{tr}\left(\mathbf{W}^{\top} \mathbf{S}_{\mathbf{w}} \mathbf{W}\right)}$

　　其中，$\mathbf{S}_{\mathrm{b}}$ 为类间散射矩阵，$\mathbf{S}_{\mathbf{w}}$ 为类内散射矩阵。

　　注意：$J(\mathbf{W})$ 越大，具有更好的辨别性。

　　由于上述两个评价标准不在一个数量级上，本文对其进行了归一化处理：

　　　　$\begin{array}{l}\operatorname{\text{M}} \tilde{\text{M}} \text{D}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)=\frac{\operatorname{MMD}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)-\operatorname{MMD}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)_{\min }}{\operatorname{MMD}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)_{\max }-\operatorname{MMD}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)_{\min }} \end{array}$

　　　　$\tilde{J}(\mathbf{W})=\frac{J(\mathbf{W})-J(\mathbf{W})_{\min }}{J(\mathbf{W})_{\max }-J(\mathbf{W})_{\min }}$

　　构造一个动态平衡因子：

　　　　$\tau=\frac{\operatorname{M} \tilde{\mathbf{M}}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)}{\operatorname{M} \tilde{\mathbf{M}}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)+(1-\tilde{J}(\mathbf{W}))}$

　　注意：$\text{M} \tilde{\text{M}} \text{D}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)$ 越小代表这域对齐效果越好，$1-\tilde{J}(\mathbf{W})$ 越小代表这鉴别性特征越好。

当域对齐的程度远优于类的可辨别性时，$\text{M} \tilde{\text{M}} \text{D}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)$ 接近 $0$，$1-\tilde{J}(\mathbf{W}) $ 接近 $1$ ，$\tau$ 接近 $0$ ；
当域对齐程度远低于类别识别程度时，$\text{M} \tilde{\text{M}} \text{D}\left(\mathcal{D}_{s}, \mathcal{D}_{t}\right)$ 接近 $1$，$1-\tilde{J}(\mathbf{W}) $ 接近 $0$ ，$\tau$ 接近 $1$ ；

　　基于 $\tau$ 的良好特性，采用 $\tau$ 作为域对齐损失的权重，$1−\tau $ 作为类鉴别损失的权重。因此，得到的域对齐和类鉴别的动态加权模型如下：

　　　　$\begin{array}{l} \underset{\theta_{g}, \theta_{c}}{\text{min}} \;\; \underset{\theta_{\theta_{d}, \theta_{c_{1}}, \theta_{c_{2}}}}{\text{max}} \tau \cdot \mathcal{L}_{d a}\left(\theta_{g}, \theta_{d}\right)+ (1-\tau) \cdot \mathcal{L}_{c d}\left(\theta_{g}, \theta_{c}, \theta_{c_{1}}, \theta_{c_{2}}\right)\end{array}$

当领域对齐学习的有效性远远低于类辨别学习时，模型增加了域对齐学习的权重；
当鉴别学习的学习效果远低于域对齐学习时，模型增加鉴别学习的权重；

　　在这种动态加权学习机制下，可以保持域对齐学习与类辨别学习之间的一致性，从而避免过度的域对齐或类可辨别性。

2.6 Overall Training Objective

　　总体训练目标整合了样本加权、领域对齐学习、类判别学习和动态加权学习。此外，还需要最小化标记源样本的期望源误差。最终的极大极小目标：

　　　　$\begin{array}{l}\underset{\theta_{g}, \theta_{c}}{\text{min}} \;\;\underset{\theta_{d}, \theta_{c_{1}}, \theta_{c_{2}}}{\text{max}}\sum_{i=1}^{t_{s}} \mathcal{L}_{c e}\left(C\left(G\left(x_{i}^{s} ; \theta_{g}\right) ; \theta_{c}\right), y_{i}^{s}\right) +\tau \cdot \mathcal{L}_{d a}\left(\theta_{g}, \theta_{d}\right)+(1-\tau) \cdot \mathcal{L}_{c d}\left(\theta_{g}, \theta_{c}, \theta_{c_{1}}, \theta_{c_{2}}\right)\end{array}$

3 实验

分类结果

收敛性分析

　　对于每个子图，红色曲线的左轴表示分类误差，蓝色曲线的右轴表示平衡因子 $\tau$ 的值。可以发现，随着迭代，它们两者都逐渐收敛到一个平坦的值。这意味着随着 $\tau$ 的减少，使得类的可鉴别性被强调，使得分类误差也减小。

　　在迭代过程中，当 $\tau$ 的变化相对明显时，识别精度的提高也相对明显。我们将 $\tau$ 的初始值设为 $0.5$，可以发现 $\tau$ 在第一个时期急剧下降到 $0.5$ 以下，说明该模型的对齐性相对较好，但可辨别性相对较差。

混淆矩阵可视化

对齐度和可鉴别性度的分析

消融实验

论文解读（DWL）《Dynamic Weighted Learning for Unsupervised Domain Adaptation》的更多相关文章

论文解读（CDCL）《Cross-domain Contrastive Learning for Unsupervised Domain Adaptation》
论文信息论文标题:Cross-domain Contrastive Learning for Unsupervised Domain Adaptation论文作者:Rui Wang, Zuxuan ...
迁移学习（DCCL）《Domain Confused Contrastive Learning for Unsupervised Domain Adaptation》
论文信息论文标题:Domain Confused Contrastive Learning for Unsupervised Domain Adaptation论文作者:Quanyu Long, T ...
论文解读（CDTrans）《CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation》
论文信息论文标题:CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation论文作者:Tongkun Xu, Weihu ...
论文解读（ToAlign）《ToAlign: Task-oriented Alignment for Unsupervised Domain Adaptation》
论文信息论文标题:ToAlign: Task-oriented Alignment for Unsupervised Domain Adaptation论文作者:Guoqiang Wei, Cuil ...
迁移学习（CLDA）《CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation》
论文信息论文标题:CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation论文作者:Ankit Singh论文来源:NeurI ...
论文解读（CAN）《Contrastive Adaptation Network for Unsupervised Domain Adaptation》
论文信息论文标题:Contrastive Adaptation Network for Unsupervised Domain Adaptation论文作者:Guoliang Kang, Lu Ji ...
迁移学习《Asymmetric Tri-training for Unsupervised Domain Adaptation》
论文信息论文标题:Asymmetric Tri-training for Unsupervised Domain Adaptation论文作者:Kuniaki Saito, Y. Ushiku, T ...
虚假新闻检测（CADM）《Unsupervised Domain Adaptation for COVID-19 Information Service with Contrastive Adversarial Domain Mixup》
论文信息论文标题:Unsupervised Domain Adaptation for COVID-19 Information Service with Contrastive Adversari ...
迁移学习（IIMT）——《Improve Unsupervised Domain Adaptation with Mixup Training》
论文信息论文标题:Improve Unsupervised Domain Adaptation with Mixup Training论文作者:Shen Yan, Huan Song, Nanxia ...
迁移学习（TSRP）《Improving Pseudo Labels With Intra-Class Similarity for Unsupervised Domain Adaptation》
论文信息论文标题:Improving Pseudo Labels With Intra-Class Similarity for Unsupervised Domain Adaptation论文作者 ...

随机推荐

SSH客户端常用工具SecureCRT操作
目录 1.1 SecureCRT工具介绍 1.2 SecureCRT工具安装 1.3配置SecureCRT连接Linux主机 1.4调整SecureCRT终端显示和回滚缓冲区大小 1.5调整字体及光标 ...
Git&GitHub简介与入手（三）
五.eclipse中的git使用 1.eclipse中本地库操作 (检测显示创建成功): (设置本地库范围签名): (一些git里的图标,保存着不同内容): https://github.com/gi ...
2020-10-23：go中channel的创建流程是什么？
福哥答案2020-10-23:1.元素大小是否小于2的16次方,否则throw.2.对齐检查,否则throw.3.元素大小和容量的乘积不能超出范围,否则panic.4.生成*hchan,设置buf.4 ...
2022-04-14：小美有一个长度为n的数组，为了使得这个数组的和尽量大，她向会魔法的小团进行求助。小团可以选择数组中至多两个不相交的子数组，并将区间里的数全都变为原来的10倍。小团想知道他
2022-04-14:小美有一个长度为n的数组, 为了使得这个数组的和尽量大,她向会魔法的小团进行求助. 小团可以选择数组中至多两个不相交的子数组, 并将区间里的数全都变为原来的10倍. 小团想知道他 ...
2021-11-27：给定一个数组arr，长度为N，做出一个结构，可以高效的做如下的查询: 1) int querySum(L,R) : 查询arr[L...R]上的累加和； 2) int query
2021-11-27:给定一个数组arr,长度为N,做出一个结构,可以高效的做如下的查询: int querySum(L,R) : 查询arr[L-R]上的累加和; int queryAim(L,R) ...
＜form＞表单中的action和method使用方法
<form action="" method="post"> form是表单里面的内容是要提交出去的. action 是链接点击浏览选择 ...
prefetch_related() 一对多、多对多查询优化，反向查询
prefetch_related() 一对多.多对多查询优化,反向查询 Student.objects.filter(age__lt=30).prefetch_related('course') # ...
从前后端的角度分析options预检请求
摘要:options预检请求是干嘛的?options请求一定会在post请求之前发送吗?前端或者后端开发需要手动干预这个预检请求吗?不用文档定义堆砌名词,从前后端角度单独分析,大白话带你了解! 本文分 ...
回溯理论基础及leetcode例题
学习参考回溯与递归相辅相成;回溯是递归的副产品,只要有递归就会有回溯. 回溯函数也就是递归函数,指的都是一个函数. 回溯搜索法纯暴力搜索解决的问题组合问题:N个数里面按一定规则找出k个数的集 ...
ent M2M模型在pxc集群中的一个大坑
ent M2M模型在pxc集群中的一个大坑事故简要分析 PXC集群3个节点,在插入数据时,如果使用数据库自己生成的主键,一般顺序为1,4,7,10- 这里就是坑的源头,在ent底层代码中,在做M2M ...

论文解读（DWL）《Dynamic Weighted Learning for Unsupervised Domain Adaptation》