voc-fcn-alexnet网络结构理解

一、写在前面

fcn是首次使用cnn来实现语义分割的，论文地址：fully convolutional networks for semantic segmentation

实现代码地址：https://github.com/shelhamer/fcn.berkeleyvision.org

全卷积神经网络主要使用了三种技术：

1. 卷积化（Convolutional）

2. 上采样（Upsample）

3. 跳跃结构（Skip Layer）

为了便于理解，我拿最简单的结构voc-fcn-alexnet进行说明，该网络结构主要用到了前面两个技术，不包含跳跃结构。

二、voc-fcn-alexnet 的train.prototxt文件

layer {

  name: "data"

  type: "Python"

  top: "data"

  top: "label"

  python_param {

    module: "voc_layers"

    layer: "SBDDSegDataLayer"

    param_str: "{\'sbdd_dir\': \'../data/sbdd/dataset\', \'seed\': 1337, \'split\': \'train\', \'mean\': (104.00699, 116.66877, 122.67892)}"

  }

}

layer {

  name: "conv1"

  type: "Convolution"

  bottom: "data"

  top: "conv1"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu1"

  type: "ReLU"

  bottom: "conv1"

  top: "conv1"

}

layer {

  name: "pool1"

  type: "Pooling"

  bottom: "conv1"

  top: "pool1"

  pooling_param {

    pool: MAX

    kernel_size:

    stride:

  }

}

layer {

  name: "norm1"

  type: "LRN"

  bottom: "pool1"

  top: "norm1"

  lrn_param {

    local_size:

    alpha: 0.0001

    beta: 0.75

  }

}

layer {

  name: "conv2"

  type: "Convolution"

  bottom: "norm1"

  top: "conv2"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu2"

  type: "ReLU"

  bottom: "conv2"

  top: "conv2"

}

layer {

  name: "pool2"

  type: "Pooling"

  bottom: "conv2"

  top: "pool2"

  pooling_param {

    pool: MAX

    kernel_size:

    stride:

  }

}

layer {

  name: "norm2"

  type: "LRN"

  bottom: "pool2"

  top: "norm2"

  lrn_param {

    local_size:

    alpha: 0.0001

    beta: 0.75

  }

}

layer {

  name: "conv3"

  type: "Convolution"

  bottom: "norm2"

  top: "conv3"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu3"

  type: "ReLU"

  bottom: "conv3"

  top: "conv3"

}

layer {

  name: "conv4"

  type: "Convolution"

  bottom: "conv3"

  top: "conv4"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu4"

  type: "ReLU"

  bottom: "conv4"

  top: "conv4"

}

layer {

  name: "conv5"

  type: "Convolution"

  bottom: "conv4"

  top: "conv5"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu5"

  type: "ReLU"

  bottom: "conv5"

  top: "conv5"

}

layer {

  name: "pool5"

  type: "Pooling"

  bottom: "conv5"

  top: "pool5"

  pooling_param {

    pool: MAX

    kernel_size:

    stride:

  }

}

layer {

  name: "fc6"

  type: "Convolution"

  bottom: "pool5"

  top: "fc6"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu6"

  type: "ReLU"

  bottom: "fc6"

  top: "fc6"

}

layer {

  name: "drop6"

  type: "Dropout"

  bottom: "fc6"

  top: "fc6"

  dropout_param {

    dropout_ratio: 0.5

  }

}

layer {

  name: "fc7"

  type: "Convolution"

  bottom: "fc6"

  top: "fc7"

  convolution_param {

    num_output:

    pad:

    kernel_size:

    group:

    stride:

  }

}

layer {

  name: "relu7"

  type: "ReLU"

  bottom: "fc7"

  top: "fc7"

}

layer {

  name: "drop7"

  type: "Dropout"

  bottom: "fc7"

  top: "fc7"

  dropout_param {

    dropout_ratio: 0.5

  }

}

layer {

  name: "score_fr"

  type: "Convolution"

  bottom: "fc7"

  top: "score_fr"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  convolution_param {

    num_output:

    pad:

    kernel_size:

  }

}

layer {

  name: "upscore"

  type: "Deconvolution"

  bottom: "score_fr"

  top: "upscore"

  param {

    lr_mult:

  }

  convolution_param {

    num_output:

    bias_term: false

    kernel_size:

    stride:

  }

}

layer {

  name: "score"

  type: "Crop"

  bottom: "upscore"

  bottom: "data"

  top: "score"

  crop_param {

    axis:

    offset:

  }

}

layer {

  name: "loss"

  type: "SoftmaxWithLoss"

  bottom: "score"

  bottom: "label"

  top: "loss"

  loss_param {

    ignore_label:

    normalize: true

  }

}

三、网络结构

假设输入的图片为500x500，

根据train.prototxt文件，可以得到上图的网络结构，该网络结构除了前五层的卷积层，也把后面的三层改为了卷积层，score_fr是卷积层的最后一层，也叫heatmap热图，热图就是我们最重要的高维特诊图，得到高维特征的heatmap之后，就是最重要的一步也是最后的一步，对原图像进行upsampling（即反卷积），把图像进行放大，得到原图像的大小。

四、损失函数

该网络的损失函数为SoftmaxWithLoss。首先进行softmax求解，求出每个像素点属于不同类别的概率，因为总共是分为21类，所以每个像素点对应21个概率值（输出通道数为21）。然后求解每个像素点所属实际类别概率的log值之和的平均，再取负数，可得到损失函数，参考如下：

end

voc-fcn-alexnet网络结构理解的更多相关文章

pascalcontext-fcn全卷积网络结构理解
一.说明 fcn的开源代码:https://github.com/shelhamer/fcn.berkeleyvision.org 论文地址:fully convolutional networks ...
Alexnet网络结构
最近试一下kaggle的文字检测的题目,目前方向有两个ssd和cptn.直接看看不太懂,看到Alexnet是基础,今天手写一下网络,记录一下啊. 先理解下Alexnet中使用的原件和作用: 激活函数使 ...
Xception网络结构理解
Xception网络是由inception结构加上depthwise separable convlution,再加上残差网络结构改进而来/ 常规卷积是直接通过一个卷积核把空间信息和通道信息直接提取出 ...
深入理解AlexNet网络
原文地址:https://blog.csdn.net/luoluonuoyasuolong/article/details/81750190 AlexNet论文:<ImageNet Classi ...
LeNet, AlexNet, VGGNet, GoogleNet, ResNet的网络结构
1. LeNet 2. AlexNet 3. 参考文献: 1. 经典卷积神经网络结构——LeNet-5.AlexNet.VGG-16 2. 初探Alexnet网络结构 3.
深度学习与CV教程(14) | 图像分割 (FCN,SegNet,U-Net,PSPNet,DeepLab,RefineNet)
作者:韩信子@ShowMeAI 教程地址:http://www.showmeai.tech/tutorials/37 本文地址:http://www.showmeai.tech/article-det ...
【深度学习系列】用PaddlePaddle和Tensorflow实现AlexNet
上周我们用PaddlePaddle和Tensorflow实现了图像分类,分别用自己手写的一个简单的CNN网络simple_cnn和LeNet-5的CNN网络识别cifar-10数据集.在上周的实验表现 ...
【深度学习系列】用PaddlePaddle和Tensorflow实现经典CNN网络AlexNet
上周我们用PaddlePaddle和Tensorflow实现了图像分类,分别用自己手写的一个简单的CNN网络simple_cnn和LeNet-5的CNN网络识别cifar-10数据集.在上周的实验表现 ...
tensorflow学习笔记——AlexNet
1,AlexNet网络的创新点 AlexNet将LeNet的思想发扬光大,把CNN的基本原理应用到了很深很宽的网络中.AlexNet主要使用到的新技术点如下: (1)成功使用ReLU作为CNN的激活函 ...

随机推荐

Unity3d对象池
Singleton.cs 12345678910111213 using UnityEngine;/// <summary>/// 单例模版类/// </summary>pub ...
常忽略的css技巧
1.利用 CSS 的伪类中的content属性获取attr中的信息效果图:鼠标放上去出现提示 css代码: .box{position:relative;display:inline-block;m ...
dubbo入门学习笔记之入门demo(基于普通maven项目)
注:本笔记接dubbo入门学习笔记之环境准备继续记录; (四)开发服务提供者和消费者并让他们在启动时分别向注册中心注册和订阅服务需求:订单服务中初始化订单功能需要调用用户服务的获取用户信息的接口(订 ...
CentOS 6 RPM安装包下载地址
32位系统的RPM安装包的下载地址 http://mirrors.163.com/centos/6/os/i386/Packages/ 64位系统的RPM安装包的下载地址 http://mirrors ...
vim里添加自动补齐插件，与python 函数补齐
参考 http://www.jb51.net/article/58009.htm 将 # cat ~/.vimrc filetype plugin on let g:pydiction_locati ...
点击iframe窗口里的超链接，打开新页面的方式
点击iframe窗口里的超链接打开新页面的方式: a标签中设置按钮点击事件,事件调用的方法使用如下方法跳转链接: window.open('url链接', '_blank');
idea 里自动下载私服jar一直不能下载下来
idea 里自动下载私服jar一直不能下载下来,只生成了.lastUpdated文件,检查了setting.xml文件.网络,私服,均无问题,在idea中打开Terminal窗口,在所要更新的pom. ...
docker中的oracle-11g-安装配置
docker镜像:wnameless/oracle-xe-11g 启动镜像的命令: docker run -d -v /data/oracle_data:/data/oracle_data -p 11 ...
数据t转换
#!/usr/bin/perl use strict; use warnings; open my $fh,"a.out"; open OUT,">a_t.o ...
python学习1---列表、矩阵、数组
1.列表与数组区别 numpy数组的所有元素类型是相同的,而列表的元素类型是任意的. 2.numpy数组与矩阵区别矩阵必须是二维的,数组可以是多维的,matrix是array的一个分支. matri ...

voc-fcn-alexnet网络结构理解

voc-fcn-alexnet网络结构理解的更多相关文章

随机推荐

热门专题