《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记

Code Address：https://github.com/junyanz/CycleGAN.

Abstract

引出Image Translating的概念（greyscale to color, image to semantic labels, edge-map to photograph.），并申明了本作的动机，不使用 image pairs来训练图片的风格转换：We present an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples.作者希望能学习一个映射maping G，将域A中的图片转换到域B的图片中，，反之，也建立一个映射F，将域B中的图片转成域A中的图片，两个域的训练集图片并不是成对出现。转换后的图片需要分别定义自己的D来做训练，达到欺骗和识别的对抗训练，使得生成在本域的图片y'和实际属于本域的图片y不可被分辨，这样在训练时，可以将原有的GAN结构扩展为cycle的形式（and vice versa）.

Introduction

可能是计算机paper里最富诗情画意的introduction：，随后作者用一定篇幅剖析了人类可以将任何现实中看到的场景映射成莫奈风格的画作，哪怕莫奈从没画过这些场景，那么计算机是否也可以做到这一点呢？这样得以解决现实中成批出现的训练集需要耗费极高的采集、制作、标注成本的难题。接着进一步阐述了为什么要用循环的方式来扩展GAN，因为从A到B域映射出来的图片可能有非常多的可能，并且都满足B域的分布，加入一个反向映射的循环，可以加强转换的约束性，同时还能避免GAN中常见的mode collapse的问题，作者称其为cycle consistent。

Relate Work

作者借鉴的RelatedWork包括： GAN、Image-to-Image Translation、Unpaired Image-to-Image Translation、Neural Style Transfer、Cycle Consistency

Model

模型的Loss方面分为两个部分：

（1）Adversarial Loss：

　　　　对于G:X->Y的映射有

　　　　　对于F：Y->X的映射也有类似的一个对抗损失

（2）Cycle Consistency Loss：

最终目标函数：

在后面的实验中，将这几个loss的作用都进行了直观的展示，表明缺一不可。

实现

模型架构基于[3],在风格转换和超分辨率上都表现不错，使用了instance normalization。并且对D，使用了70*70的PatchGANs，判别70*70的像素的真伪，相对于全像素判别的D减少了参数[4,5,6]。

具体实现中，作者使用了更稳定，生成质量更高的最小二乘GAN的Loss来替换原始GAN（least square loss）[2]：

并且为了避免模式震荡（mode oscillation）[1]，作者对Dx和Dy做了一个滞后更新，用之前生成的50张左右图片来训练D而不是实时用G生成的图片来生成

实验结果（略）

不足

CycleGAN对非成对图片集的转换成功主要集中在色彩和贴图转换上，在几何形态上的转换大多以失败告终（猫->狗）。此外，与成对数据集的训练结果相比，依然存在不足。

1.Y. Taigman, A. Polyak, and L. Wolf. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200, 2016

2.Multiclass generative adversarial networks with the l2 loss function.

3.J. Johnson, A. Alahi, and L. Fei-Fei. Perceptual losses for real-time style transfer and super-resolution. In ECCV, pages 694–711. Springer, 2016.

4.P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros. Imageto-image translation with conditional adversarial networks. arXiv preprint arXiv:1611.07004, 2016

5. C. Ledig, L. Theis, F. Husz´ar, J. Caballero, A. Cunningham,A. Acosta, A. Aitken, A. Tejani, J. Totz, Z. Wang, et al. Photo-realistic single image superresolution using a generative adversarial network. arXiv preprint arXiv:1609.04802, 2016. 5
6.C. Li and M. Wand. Precomputed real-time texture synthesis with markovian generative adversarial networks. ECCV, 2016. 5

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记的更多相关文章

《Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition》论文笔记
论文题目:<Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition> 论文作者:Qibin ...
[place recognition]NetVLAD: CNN architecture for weakly supervised place recognition 论文翻译及解析（转）
https://blog.csdn.net/qq_32417287/article/details/80102466 abstract introduction method overview Dee ...
论文笔记系列-Auto-DeepLab:Hierarchical Neural Architecture Search for Semantic Image Segmentation
Pytorch实现代码:https://github.com/MenghaoGuo/AutoDeeplab 创新点 cell-level and network-level search 以往的NAS ...
论文笔记——Rethinking the Inception Architecture for Computer Vision
1. 论文思想 factorized convolutions and aggressive regularization. 本文给出了一些网络设计的技巧. 2. 结果用5G的计算量和25M的参数. ...
论文笔记：Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells 2019-04- ...
论文笔记：ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware 2019-03-19 16:13:18 Pape ...
论文笔记：DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search 2019-03-19 10:04:26accepted by ICLR 2019 Paper:https://arx ...
论文笔记：Progressive Neural Architecture Search
Progressive Neural Architecture Search 2019-03-18 20:28:13 Paper:http://openaccess.thecvf.com/conten ...
论文笔记：Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation
Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation2019-03-18 14:4 ...
论文笔记系列-DARTS: Differentiable Architecture Search
Summary 我的理解就是原本节点和节点之间操作是离散的,因为就是从若干个操作中选择某一个,而作者试图使用softmax和relaxation(松弛化)将操作连续化,所以模型结构搜索的任务就转变成了 ...

随机推荐

《计算机网络课程设计》基本操作（基于Cisco Packet Tracer）
第一次课 Router> #用户模式 Router# #特权模式 lhx(config)# #全局配置模式 Router>enable #进入特权模式 Router#configure t ...
HTTP错误：java.lang.IllegalArgumentException: Illegal character in scheme at index 0: http://xxxxxx
读取T卡文件里的域名,HTTP请求出现如下错误 java.lang.IllegalArgumentException: Illegal character in scheme at index 0: ...
【POJ3264】Balanced Lineup（RMQ）
题意:每天,农夫 John 的N(1 <= N <= 50,000)头牛总是按同一序列排队. 有一天, John 决定让一些牛们玩一场飞盘比赛. 他准备找一群在对列中为置连续的牛来进行比赛 ...
linux 报错：E: Package 'libmemcached' has no installation candidate
linux 报错:E: Package 'libmemcached' has no installation candidate 网上查资料说是软件安装源没有这个软件,需要添加软件源. 1.备份源列表 ...
iOS tableview上textView在编辑状态时，tableview自动上移的功能
在viewcognroller中,添加tableview时, tableview中cell上的textField如果吊起键盘时,tableview时可以自动上移,但是如果是textView吊起键盘,t ...
Two Paths--cf14D（树的直径）
题目链接:http://codeforces.com/problemset/problem/14/D D. Two Paths time limit per test 2 seconds memory ...
HDU 1244 【DP】
题意: 中文. 思路: 先初步处理,用give-take求出每个城市剩的钱. 求解问题转化成使得和不小于0的最长连续字串. 枚举起点,然后当该起点加的和为负时开始枚举下一起点.(这个状态的转移) 2W ...
Java并发包——线程安全的Collection相关类
Java并发包——线程安全的Collection相关类摘要:本文主要学习了Java并发包下线程安全的Collection相关的类. 部分内容来自以下博客: https://www.cnblogs.c ...
MySQL错误日志、binlog日志、查询日志、慢查询日志简介
1.数据库的日志是帮助数据库管理员,追踪分析数据库曾经发生的各种事件的有力依据,mysql中提供了错误日志.binlog日志(二进制日志).查处日志.慢查询日志.在此,我力求解决以下问题:各个日志的作 ...
CentOS系统中常用查看系统信息和日志命令小结
转载:http://www.3lian.com/edu/2015/04-09/204628.html 进程 # ps -ef # 查看所有进程 # top # 实时显示进程状态(另一篇文章里面有详细的 ...

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记的更多相关文章

随机推荐

热门专题