neural_transfer风格迁移

ContentLoss

首先是要定义一个内容差异损失函数，这里直接调用functional.mse_loss(input,self.target)就可以计算出其内容差异损失。

注意这里一般是定义一个网络模型，输入和输出一直，这样才在后面方便直接求出 loss

class ContentLoss(nn.Module):

    def __init__(self, target,):

        super(ContentLoss, self).__init__()

        # we 'detach' the target content from the tree used

        # to dynamically compute the gradient: this is a stated value,

        # not a variable. Otherwise the forward method of the criterion

        # will throw an error.

        self.target = target.detach()

    def forward(self, input):

        self.loss = F.mse_loss(input, self.target)

        return input

detach是使他不能求梯度

StyleLoss

这里是定义风格差异损失函数，首先就先定一个一个gram函数来求 c*c c为层数

def gram_matrix(input):

    a, b, c, d = input.size()  # a=batch size(=1)

    # b=number of feature maps

    # (c,d)=dimensions of a f. map (N=c*d)

    features = input.view(a * b, c * d)  # resise F_XL into \hat F_XL

    G = torch.mm(features, features.t())  # compute the gram product

    # we 'normalize' the values of the gram matrix

    # by dividing by the number of element in each feature maps.

    return G.div(a * b * c * d)

然后利用gram函数来求loss

class StyleLoss(nn.Module):

    def __init__(self, target_feature):

        super(StyleLoss, self).__init__()

        self.target = gram_matrix(target_feature).detach()

    def forward(self, input):

        G = gram_matrix(input)

        self.loss = F.mse_loss(G, self.target)

        return input

新定义网络

然后将 ContentLoss和 StyleLoss 加入新定义的迁移网络（normalization见下面）

cnn = models.vgg19(pretrained=True).features.to(device).eval()

content_layers_default = ['conv_4']

style_layers_default = ['conv_1', 'conv_2', 'conv_3', 'conv_4', 'conv_5']

def get_style_model_and_losses(cnn, normalization_mean, normalization_std,

                               style_img, content_img,

                               content_layers=content_layers_default,

                               style_layers=style_layers_default):

    cnn = copy.deepcopy(cnn)

    # normalization module

    normalization = Normalization(normalization_mean, normalization_std).to(device)

    # just in order to have an iterable access to or list of content/syle

    # losses

    content_losses = []

    style_losses = []

    # assuming that cnn is a nn.Sequential, so we make a new nn.Sequential

    # to put in modules that are supposed to be activated sequentially

    model = nn.Sequential(normalization)

   # model = nn.Sequential()

    i = 0  # increment every time we see a conv

    for layer in cnn.children():

        if isinstance(layer, nn.Conv2d):

            i += 1

            name = 'conv_{}'.format(i)

        elif isinstance(layer, nn.ReLU):

            name = 'relu_{}'.format(i)

            # The in-place version doesn't play very nicely with the ContentLoss

            # and StyleLoss we insert below. So we replace with out-of-place

            # ones here.

            #********注意**************不能掉 vgg默认inplace=ture

            layer = nn.ReLU(inplace=False)  

        elif isinstance(layer, nn.MaxPool2d):

            name = 'pool_{}'.format(i)

        elif isinstance(layer, nn.BatchNorm2d):

            name = 'bn_{}'.format(i)

        else:

            raise RuntimeError('Unrecognized layer: {}'.format(layer.__class__.__name__))

        model.add_module(name, layer)

        if name in content_layers:

            # add content loss:

            target = model(content_img).detach()

            content_loss = ContentLoss(target)

            model.add_module("content_loss_{}".format(i), content_loss)

            content_losses.append(content_loss)

        if name in style_layers:

            # add style loss:

            target_feature = model(style_img).detach()

            style_loss = StyleLoss(target_feature)

            model.add_module("style_loss_{}".format(i), style_loss)

            style_losses.append(style_loss)

    # now we trim off the layers after the last content and style losses

    for i in range(len(model) - 1, -1, -1):

        if isinstance(model[i], ContentLoss) or isinstance(model[i], StyleLoss):

            break

    model = model[:(i + 1)]

    return model, style_losses, content_losses

normalization模型

cnn_normalization_mean = torch.tensor([0.485, 0.456, 0.406]).to(device)

cnn_normalization_std = torch.tensor([0.229, 0.224, 0.225]).to(device)

# create a module to normalize input image so we can easily put it in a

# nn.Sequential

class Normalization(nn.Module):

    def __init__(self, mean, std):

        super(Normalization, self).__init__()

        # .view the mean and std to make them [C x 1 x 1] so that they can

        # directly work with image Tensor of shape [B x C x H x W].

        # B is batch size. C is number of channels. H is height and W is width.

        self.mean = torch.tensor(mean).view(-1, 1, 1)

        self.std = torch.tensor(std).view(-1, 1, 1)

    def forward(self, img):

        # normalize img

        return (img - self.mean) / self.std

输入参数定义

input_img = content_img.clone()

optimizer = optim.LBFGS([input_img.requires_grad_()])

两种方式

#input_param = nn.Parameter(input_img.data)

#optimizer = optim.LBFGS([input_param])

更新input_img

注意梯度共轭和LBFGS 方法更新梯度的时候要用闭包的形式

def run_style_transfer(cnn, normalization_mean, normalization_std,

                       content_img, style_img, input_img, num_steps=300,

                       style_weight=500000, content_weight=1):

    """Run the style transfer."""

    print('Building the style transfer model..')

    model, style_losses, content_losses = get_style_model_and_losses(cnn,

        normalization_mean, normalization_std, style_img, content_img)

   # optimizer = get_input_optimizer(input_img)

    print('Optimizing..')

    run = [0]

    while run[0] <= num_steps:

        def closure():

            # correct the values of updated input image

            input_img.data.clamp_(0, 1)

            optimizer.zero_grad()

            model(input_img)

            style_score = 0

            content_score = 0

            for sl in style_losses:

                style_score += sl.loss

            for cl in content_losses:

                content_score += cl.loss

            style_score *= style_weight

            content_score *= content_weight

            loss = style_score + content_score

            loss.backward()

            run[0] += 1

            if run[0] % 50 == 0:

                print("run {}:".format(run))

                print('Style Loss : {:4f} Content Loss: {:4f}'.format(

                    style_score.item(), content_score.item()))

                print()

            return style_score + content_score

        optimizer.step(closure)

    # a last correction...

    input_img.data.clamp_(0, 1)

    return input_img

其余代码参考 https://pytorch.org/tutorials/advanced/neural_style_tutorial.html#sphx-glr-advanced-neural-style-tutorial-py

neural_transfer风格迁移的更多相关文章

图像风格迁移（Pytorch）
图像风格迁移最后要生成的图片是怎样的是难以想象的,所以朴素的监督学习方法可能不会生效, Content Loss 根据输入图片和输出图片的像素差别可以比较损失 \(l_{content} = \fr ...
keras图像风格迁移
风格迁移: 在内容上尽量与基准图像保持一致,在风格上尽量与风格图像保持一致. 1. 使用预训练的VGG19网络提取特征 2. 损失函数之一是"内容损失"(content loss) ...
Gram格拉姆矩阵在风格迁移中的应用
Gram定义 n维欧式空间中任意k个向量之间两两的内积所组成的矩阵,称为这k个向量的格拉姆矩阵(Gram matrix) 根据定义可以看到,每个Gram矩阵背后都有一组向量,Gram矩阵就是由这一组向 ...
『cs231n』通过代码理解风格迁移
『cs231n』卷积神经网络的可视化应用文件目录 vgg16.py import os import numpy as np import tensorflow as tf from downloa ...
Keras实现风格迁移
风格迁移风格迁移算法经历多次定义和更新,现在应用在许多智能手机APP上. 风格迁移在保留目标图片内容的基础上,将图片风格引用在目标图片上. 风格本质上是指在各种空间尺度上图像中的纹理,颜色和视觉图案 ...
fast neural style transfer图像风格迁移基于tensorflow实现
引自:深度学习实践:使用Tensorflow实现快速风格迁移一.风格迁移简介风格迁移(Style Transfer)是深度学习众多应用中非常有趣的一种,如图,我们可以使用这种方法把一张图片的风格“ ...
Distill详述「可微图像参数化」：神经网络可视化和风格迁移利器！
近日,期刊平台 Distill 发布了谷歌研究人员的一篇文章,介绍一个适用于神经网络可视化和风格迁移的强大工具:可微图像参数化.这篇文章从多个方面介绍了该工具. 图像分类神经网络拥有卓越的图像生成能力 ...
ng-深度学习-课程笔记-14: 人脸识别和风格迁移(Week4)
1 什么是人脸识别( what is face recognition ) 在相关文献中经常会提到人脸验证(verification)和人脸识别(recognition). verification就 ...
[DeeplearningAI笔记]卷积神经网络4.6-4.10神经网络风格迁移
4.4特殊应用:人脸识别和神经网络风格转换觉得有用的话,欢迎一起讨论相互学习~Follow Me 4.6什么是神经网络风格转换neural style transfer 将原图片作为内容图片Cont ...

随机推荐

NOIP2018普及T2暨洛谷P5016 龙虎斗
题目链接:https://www.luogu.org/problemnew/show/P5016 分析: 这是一道模拟题.看到题目,我们首先要把它细致的读明白,模拟题特别考察细节,往往会有想不到的坑点 ...
个人永久性免费-Excel催化剂功能第97波-快递单号批量查询物流信息
电商时代,快递已进千万家,做电商零售行业的,快递信息的再挖掘,也显得更有意义,是数据精细化运营中必不可少的一环.一般站在系统的角度,数据用于业务流转的增删改查使用,而对于分析需求来说,这些业务系统里集 ...
java高并发系列 - 第16天：JUC中等待多线程完成的工具类CountDownLatch，必备技能
这是java高并发系列第16篇文章. 本篇内容介绍CountDownLatch及使用场景提供几个示例介绍CountDownLatch的使用手写一个并行处理任务的工具类假如有这样一个需求,当我们 ...
Shrio多Realm认证及认证策略
在大型的系统中,安全数据可能会存放在多个数据库中,而且每个数据的加密方式也是不一样的,那么单一的Realm就无法完成. 这时,就需要用到多Realm认证了,多Realm又涉及到认证策略,及在多个Rea ...
C#8.0 中使用默认接口成员更新接口
连载目录 [已更新最新开发文章,点击查看详细] 从 .NET Core 3.0 上的 C# 8.0 开始,可以在声明接口成员时定义实现. 最常见的方案是安全地将成员添加到已经由无数客户端发布并使 ...
[PTA] 数据结构与算法题目集 6-7 在一个数组中实现两个堆栈
//如果堆栈已满,Push函数必须输出"Stack Full"并且返回false:如果某堆栈是空的,则Pop函数必须输出"Stack Tag Empty"(其中 ...
Intellij IDEA 打开文件tab数量限制的调整
最近在阅读一个开源项目,发现IDEA对同时打开的文件TAB有限制,最多只能打开10个文件Tab,再多开就会把原来的Tab关闭了. 找了半天,在菜单中找到了调整方法,以Mac为例,调整路径为 “Inte ...
10w数组去重，排序，找最多出现次数（精华）
package cn.tedu.javaweb.test; import java.util.*; /* * @author XueWeiWei * @date 2019/6/11 8:19 */@S ...
Docker 容器基本操作[Docker 系列-2]
Docker 入门及安装[Docker 系列-1] 镜像就像是一个安装程序,而容器则是程序运行时的一个状态. 查看容器查看容器启动 docker 后,使用 docker ps 命令可以查看当前正 ...
ES 23 - 检索和过滤的区别 (query vs. filter)
目录 1 filter与query示例 1.1 准备测试数据 1.2 搜索测试 2 filter与query的区别 2.1 相关度处理上的不同 2.2 性能上的对比 2.3 对比结论本文以 ES 6 ...