Saliency Maps

这部分想探究一下 CNN 内部的原理，参考论文 Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps.

一般我们反向传播 CNN 的时候，是可以得到图片的梯度（Image
Gradient）的，但是因为网络要学习的参数是权重 W，因此都不会用到这个梯度。这篇论文可视化了一下图片的梯度，称作是 saliency map，发现其实是网络对不同处像素值关注的权重。得到的结果甚至可以辅助做 segmentation 问题。

通俗来说就是，给定一张图片X，我们想要知道到底是图片中的哪些部分决定了该图片的最终分类结果，我们可以通过反向传播求出X关于loss function的偏导矩阵，这个偏导矩阵就是该图片的图像梯度，然后计算出类显著度图（class saliency map, csm）。Karen Simonyan论文的3.1节给出了计算方法：如果图片是灰度图，那么csm就取图像梯度的绝对值；如果是RGB图，csm就取图像梯度3个通道中绝对值最大的那个通道。csm中元素值的大小表示对应位置的图片像素对最终分类结果的影响程度。

from cs231n.layers import softmax_loss

def compute_saliency_maps(X, y, model):

  """

  Compute a class saliency map using the model for images X and labels y.

  Input:

  - X: Input images, of shape (N, 3, H, W)

  - y: Labels for X, of shape (N,)

  - model: A PretrainedCNN that will be used to compute the saliency map.

  Returns:

  - saliency: An array of shape (N, H, W) giving the saliency maps for the input

    images.

  """

  saliency = None

  ##############################################################################

  # TODO: Implement this function. You should use the forward and backward     #

  # methods of the PretrainedCNN class, and compute gradients with respect to  #

  # the unnormalized class score of the ground-truth classes in y.             #

  ##############################################################################

  scores, cache = model.forward(X)

  loss, dscores = softmax_loss(scores, y)

  dX, grads = model.backward(dscores, cache)

  saliency = dX.max(axis=1)

  return saliency

Fooling Images

给定一个类别标签，CNN 希望对应能输入什么样的图片呢？可以考虑把图片当做变量，固定模型中的权重，来优化下面的目标函数，

其中是给定类标签 y 时模型的评分。

def make_fooling_image(X, target_y, model):

  """

  Generate a fooling image that is close to X, but that the model classifies

  as target_y.

  Inputs:

  - X: Input image, of shape (1, 3, 64, 64)

  - target_y: An integer in the range [0, 100)

  - model: A PretrainedCNN

  Returns:

  - X_fooling: An image that is close to X, but that is classifed as target_y

    by the model.

  """

  X_fooling = X.copy()

  ##############################################################################

  # TODO: Generate a fooling image X_fooling that the model will classify as   #

  # the class target_y. Use gradient ascent on the target class score, using   #

  # the model.forward method to compute scores and the model.backward method   #

  # to compute image gradients.                                                #

  #                                                                            #

  # HINT: For most examples, you should be able to generate a fooling image    #

  # in fewer than 100 iterations of gradient ascent.                           #

  ##############################################################################

  while True:

        print i

        scores, cache = model.forward(X_fooling, mode='test')

        if scores[0].argmax() == target_y:

            break

        loss, dscores = softmax_loss(scores, target_y)       # 使用目标分类计算分类层梯度

        dX, grads = model.backward(dscores, cache)           # 逆向传播推导图片梯度

        X_fooling -= dX * 1000                               # 修改图片，为了fooling的目的学习率设定的超大

  return X_fooling

『cs231n』作业3问题3选讲_通过代码理解图像梯度的更多相关文章

『cs231n』作业3问题1选讲_通过代码理解RNN&图像标注训练
一份不错的作业3资料(含答案) RNN神经元理解单个RNN神经元行为括号中表示的是维度向前传播 def rnn_step_forward(x, prev_h, Wx, Wh, b): " ...
『cs231n』作业3问题2选讲_通过代码理解LSTM网络
LSTM神经元行为分析 LSTM 公式可以描述如下: itftotgtctht=sigmoid(Wixxt+Wihht−1+bi)=sigmoid(Wfxxt+Wfhht−1+bf)=sigmoid( ...
『cs231n』作业3问题4选讲_图像梯度应用强化
[注],本节(上节也是)的model是一个已经训练完成的CNN分类网络. 随机数图片向前传播后对目标类优化,反向优化图片本体 def create_class_visualization(target ...
『cs231n』作业2选讲_通过代码理解Dropout
Dropout def dropout_forward(x, dropout_param): p, mode = dropout_param['p'], dropout_param['mode'] i ...
『cs231n』作业2选讲_通过代码理解优化器
1).Adagrad一种自适应学习率算法,实现代码如下: cache += dx**2 x += - learning_rate * dx / (np.sqrt(cache) + eps) 这种方法的 ...
『cs231n』作业1选讲_通过代码理解KNN&交叉验证&SVM
通过K近邻算法探究numpy向量运算提速茴香豆的“茴”字有... ... 使用三种计算图片距离的方式实现K近邻算法: 1.最为基础的双循环 2.利用numpy的broadca机制实现单循环 3.利用 ...
『cs231n』通过代码理解风格迁移
『cs231n』卷积神经网络的可视化应用文件目录 vgg16.py import os import numpy as np import tensorflow as tf from downloa ...
『cs231n』计算机视觉基础
线性分类器损失函数明细: 『cs231n』线性分类器损失函数最优化Optimiz部分代码: 1.随机搜索 bestloss = float('inf') # 无穷大 for num in range ...
『TensorFlow』DCGAN生成动漫人物头像_下
『TensorFlow』以GAN为例的神经网络类范式『cs231n』通过代码理解gan网络&tensorflow共享变量机制_上『TensorFlow』通过代码理解gan网络_中一.计算 ...

随机推荐

MySQL数据库读写分离、读负载均衡方案选择
MySQL数据库读写分离.读负载均衡方案选择一.MySQL Cluster外键所关联的记录在别的分片节点中性能很差对需要进行分片的表需要修改引擎Innodb为NDB因此MySQL Cluster不适 ...
将图片文件转化为字节数组字符串，并对其进行Base64编码处理，以及对字节数组字符串进行Base64解码并生成图片
实际开发中涉及图片上传并且量比较大的时候一般处理方式有三种 1.直接保存到项目中最老土直接方法,也是最不适用的方法,量大对后期部署很不方便 2.直接保存到指定路径的服务器上.需要时候在获取,这种方式 ...
shell脚本简单实例
1.模拟linnux登录shell #/bin/bashecho -n "login:" read nameecho -n "password:"read pa ...
CentOS随笔 - 4.CentOS7安装MySql 5.5.60(下载 tar 方式安装)
前言转帖请注明出处: http://www.cnblogs.com/Troy-Lv5/ 由于公司也有php+mysql的项目, 所以今天也把Mysql装了一遍. 为了与以前的程序和数据库兼容, 这次 ...
pythoy的configparser模块
生成配置文件的模块 DEFAULT块,在以块为单位取块的值时,都会出现 import configparser config = configparser.ConfigParser() #相当于生成了 ...
ELK之logstash6.5
首先安装,这里采用rpm安装: # rpm --import https://artifacts.elastic.co/GPG-KEY-elasticsearch 创建repo文件: [root@no ...
GitHub Desktop离线安装包
GitHub Desktop离线安装包.上传时间是2017-02-05 版本3.3.4.0,Git shell版本是v2.11.0. 百度网盘的下载链接: http://pan.baidu.com/s ...
stl string 使用(转载)
出处:http://www.cnblogs.com/lzjsky/archive/2011/01/23/1942508.html 1. 查找字符 std::wstring strData = L&qu ...
Java求两个数平均值
如何正确的求2个数的平均值.在练习算法二分查找的时候发现的,以前没有注意到的bug 备注:数据以int类型为例一.以前的通用写法 /** * 求a+b平均值 * @param a * @param ...
String和int互相转换，String转float
String-->int int a=Integer.parseIn(str); int-->String String s= a+""; String-->fl ...

『cs231n』作业3问题3选讲_通过代码理解图像梯度

Saliency Maps

Fooling Images

『cs231n』作业3问题3选讲_通过代码理解图像梯度的更多相关文章

随机推荐

热门专题