《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记

Code Address：https://github.com/junyanz/CycleGAN.

Abstract

引出Image Translating的概念（greyscale to color, image to semantic labels, edge-map to photograph.），并申明了本作的动机，不使用 image pairs来训练图片的风格转换：We present an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples.作者希望能学习一个映射maping G，将域A中的图片转换到域B的图片中，，反之，也建立一个映射F，将域B中的图片转成域A中的图片，两个域的训练集图片并不是成对出现。转换后的图片需要分别定义自己的D来做训练，达到欺骗和识别的对抗训练，使得生成在本域的图片y'和实际属于本域的图片y不可被分辨，这样在训练时，可以将原有的GAN结构扩展为cycle的形式（and vice versa）.

Introduction

可能是计算机paper里最富诗情画意的introduction：，随后作者用一定篇幅剖析了人类可以将任何现实中看到的场景映射成莫奈风格的画作，哪怕莫奈从没画过这些场景，那么计算机是否也可以做到这一点呢？这样得以解决现实中成批出现的训练集需要耗费极高的采集、制作、标注成本的难题。接着进一步阐述了为什么要用循环的方式来扩展GAN，因为从A到B域映射出来的图片可能有非常多的可能，并且都满足B域的分布，加入一个反向映射的循环，可以加强转换的约束性，同时还能避免GAN中常见的mode collapse的问题，作者称其为cycle consistent。

Relate Work

作者借鉴的RelatedWork包括： GAN、Image-to-Image Translation、Unpaired Image-to-Image Translation、Neural Style Transfer、Cycle Consistency

Model

模型的Loss方面分为两个部分：

（1）Adversarial Loss：

　　　　对于G:X->Y的映射有

　　　　　对于F：Y->X的映射也有类似的一个对抗损失

（2）Cycle Consistency Loss：

最终目标函数：

在后面的实验中，将这几个loss的作用都进行了直观的展示，表明缺一不可。

实现

模型架构基于[3],在风格转换和超分辨率上都表现不错，使用了instance normalization。并且对D，使用了70*70的PatchGANs，判别70*70的像素的真伪，相对于全像素判别的D减少了参数[4,5,6]。

具体实现中，作者使用了更稳定，生成质量更高的最小二乘GAN的Loss来替换原始GAN（least square loss）[2]：

并且为了避免模式震荡（mode oscillation）[1]，作者对Dx和Dy做了一个滞后更新，用之前生成的50张左右图片来训练D而不是实时用G生成的图片来生成

实验结果（略）

不足

CycleGAN对非成对图片集的转换成功主要集中在色彩和贴图转换上，在几何形态上的转换大多以失败告终（猫->狗）。此外，与成对数据集的训练结果相比，依然存在不足。

1.Y. Taigman, A. Polyak, and L. Wolf. Unsupervised cross-domain image generation. arXiv preprint arXiv:1611.02200, 2016

2.Multiclass generative adversarial networks with the l2 loss function.

3.J. Johnson, A. Alahi, and L. Fei-Fei. Perceptual losses for real-time style transfer and super-resolution. In ECCV, pages 694–711. Springer, 2016.

4.P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros. Imageto-image translation with conditional adversarial networks. arXiv preprint arXiv:1611.07004, 2016

5. C. Ledig, L. Theis, F. Husz´ar, J. Caballero, A. Cunningham,A. Acosta, A. Aitken, A. Tejani, J. Totz, Z. Wang, et al. Photo-realistic single image superresolution using a generative adversarial network. arXiv preprint arXiv:1609.04802, 2016. 5
6.C. Li and M. Wand. Precomputed real-time texture synthesis with markovian generative adversarial networks. ECCV, 2016. 5

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记的更多相关文章

《Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition》论文笔记
论文题目:<Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition> 论文作者:Qibin ...
[place recognition]NetVLAD: CNN architecture for weakly supervised place recognition 论文翻译及解析（转）
https://blog.csdn.net/qq_32417287/article/details/80102466 abstract introduction method overview Dee ...
论文笔记系列-Auto-DeepLab:Hierarchical Neural Architecture Search for Semantic Image Segmentation
Pytorch实现代码:https://github.com/MenghaoGuo/AutoDeeplab 创新点 cell-level and network-level search 以往的NAS ...
论文笔记——Rethinking the Inception Architecture for Computer Vision
1. 论文思想 factorized convolutions and aggressive regularization. 本文给出了一些网络设计的技巧. 2. 结果用5G的计算量和25M的参数. ...
论文笔记：Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells 2019-04- ...
论文笔记：ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware 2019-03-19 16:13:18 Pape ...
论文笔记：DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search 2019-03-19 10:04:26accepted by ICLR 2019 Paper:https://arx ...
论文笔记：Progressive Neural Architecture Search
Progressive Neural Architecture Search 2019-03-18 20:28:13 Paper:http://openaccess.thecvf.com/conten ...
论文笔记：Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation
Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation2019-03-18 14:4 ...
论文笔记系列-DARTS: Differentiable Architecture Search
Summary 我的理解就是原本节点和节点之间操作是离散的,因为就是从若干个操作中选择某一个,而作者试图使用softmax和relaxation(松弛化)将操作连续化,所以模型结构搜索的任务就转变成了 ...

随机推荐

bzoj3142[Hnoi2013]数列组合
Description 小 T最近在学着买股票,他得到内部消息:F公司的股票将会疯涨.股票每天的价格已知是正整数,并且由于客观上的原因,最多只能为N.在疯涨的K天中小T观察到:除第一天外每天的股价都 ...
Hihocoder #1067 : 最近公共祖先·二
时间限制:10000ms 单点时限:1000ms 内存限制:256MB 描述上上回说到,小Hi和小Ho用非常拙劣——或者说粗糙的手段山寨出了一个神奇的网站,这个网站可以计算出某两个人的所有共同祖先中 ...
PHP 基础复习 2018-06-21
(1)PHP Zip File 函数 $zip = zip_open("test.zip"); if ($zip) { while ($zip_entry = zip_read($ ...
Codeforces 658C Bear and Forgotten Tree 3【构造】
题目链接: http://codeforces.com/contest/658/problem/C 题意: 给定结点数,树的直径(两点的最长距离),树的高度(1号结点距离其他结点的最长距离),写出树边 ...
CodeForces 592A PawnChess
简单暴力模拟. #include<cstdio> #include<cstring> #include<cmath> #include<algorithm&g ...
webstorm 添加markdown支持
第一步:file---setting---plugins---搜索markdown support 安装第二步:file---settind---file types---增加*.md处理
[Poj3744]Scout YYF I （概率dp + 矩阵乘法）
Scout YYF I Time Limit: 1000MS Memory Limit: 65536K Total Submissions: 9552 Accepted: 2793 Descr ...
开源项目SwipeBackLayout的问题处理
在安卓系统4.4会出现滑动时底层没有之前的activity界面?解决:在主界面设置如下: <item name="android:windowIsTranslucent"&g ...
DTrace scripts for Mac OS X
http://www.cnblogs.com/Proteas/p/3727297.html http://dtrace.org/blogs/brendan/2011/10/10/top-10-dtra ...
sklearn特征工程总结
转自: http://www.cnblogs.com/jasonfreak/p/5448385.html https://www.zhihu.com/question/28641663/answer/ ...

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记

《Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks》论文笔记的更多相关文章

随机推荐

热门专题