SSD训练网络参数计算

一个预测层的网络结构如下所示：

可以看到，是由三个分支组成的，分别是"PriorBox"层，以及conf、loc的预测层，其中，conf与loc的预测层的参数是由PriorBox的参数计算得到的，具体计算公式如下：

min_size与max_size分别对应一个尺度的预测框（有几个就对应几个预测框），in_size只管自己的预测，而max_size是与aspect_ratio联系在一起的；

filp参数是对应aspect_ratio的预测框*2，以几个max_size，再乘以几；最终得到结果为A

conf、loc的参数是在A的基础上再乘以类别数（加背景），以及4

如下，是需要预测两类的其中一个尺度的网络参数；

如上算出的是，每个格子需要预测的conf以及loc的个数；

每个预测层有H*W个格子，因此，总共预测的loc以及conf的个数是需要乘以H*W的；

如下是某一个层的例子（转自：http://www.360doc.com/content/17/1013/16/42392246_694639090.shtml）

注意最后这里的num_priorbox的值与前面的并不一样，这里是每个预测层所有的输出框的个数：

layer {

  name: "combined_2_EltwisePROD_relu"

  type: "ReLU"

  bottom: "combined_2_EltwisePROD"

  top: "combined_2_EltwisePROD_relu"

}

###########################################

###################################################################

layer {

  name: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  type: "Convolution"

  bottom: "combined_2_EltwisePROD_relu"

  top: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  convolution_param {

    num_output:

    bias_term: false

    pad:

    kernel_size:

    stride:

    weight_filler {

      type: "gaussian"

      std: 0.01

    }

  }

}

layer {

  name: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter_bn"

  type: "BatchNorm"

  bottom: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  batch_norm_param {

    moving_average_fraction: 0.999

    eps: 0.001

  }

}

layer {

  name: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter_scale"

  type: "Scale"

  bottom: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  scale_param {

    filler {

      type: "constant"

      value: 1.0

    }

    bias_term: true

    bias_filler {

      type: "constant"

      value: 0.0

    }

  }

}

layer {

  name: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  type: "Convolution"

  bottom: "combined_2_EltwisePROD_relu"

  top: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  convolution_param {

    num_output:

    bias_term: false

    pad:

    kernel_size:

    stride:

    weight_filler {

      type: "gaussian"

      std: 0.01

    }

  }

}

layer {

  name: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter_bn"

  type: "BatchNorm"

  bottom: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  batch_norm_param {

    moving_average_fraction: 0.999

    eps: 0.001

  }

}

layer {

  name: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter_scale"

  type: "Scale"

  bottom: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  scale_param {

    filler {

      type: "constant"

      value: 1.0

    }

    bias_term: true

    bias_filler {

      type: "constant"

      value: 0.0

    }

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_loc"

  type: "Convolution"

  bottom: "rescombined_2_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "combined_2_EltwisePROD_relu_mbox_loc"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  convolution_param {

    engine: CAFFE

    num_output:

    pad:

    kernel_size:

    stride:

    weight_filler {

      type: "xavier"

    }

    bias_filler {

      type: "constant"

      value:

    }

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_loc_perm"

  type: "Permute"

  bottom: "combined_2_EltwisePROD_relu_mbox_loc"

  top: "combined_2_EltwisePROD_relu_mbox_loc_perm"

  permute_param {

    order:

    order:

    order:

    order:

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_loc_flat"

  type: "Flatten"

  bottom: "combined_2_EltwisePROD_relu_mbox_loc_perm"

  top: "combined_2_EltwisePROD_relu_mbox_loc_flat"

  flatten_param {

    axis:

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_conf_new"

  type: "Convolution"

  bottom: "rescombined_2i_EltwisePROD_relu_inter256_mbox_locnew_inter"

  top: "combined_2_EltwisePROD_relu_mbox_conf_new"

  param {

    lr_mult:

    decay_mult:

  }

  param {

    lr_mult:

    decay_mult:

  }

  convolution_param {

    engine: CAFFE

    num_output:

    pad:

    kernel_size:

    stride:

    weight_filler {

      type: "xavier"

    }

    bias_filler {

      type: "constant"

      value:

    }

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_conf_new_perm"

  type: "Permute"

  bottom: "combined_2_EltwisePROD_relu_mbox_conf_new"

  top: "combined_2_EltwisePROD_relu_mbox_conf_new_perm"

  permute_param {

    order:

    order:

    order:

    order:

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_conf_new_flat"

  type: "Flatten"

  bottom: "combined_2_EltwisePROD_relu_mbox_conf_new_perm"

  top: "combined_2_EltwisePROD_relu_mbox_conf_new_flat"

  flatten_param {

    axis:

  }

}

layer {

  name: "combined_2_EltwisePROD_relu_mbox_priorbox"

  type: "PriorBox"

  bottom: "combined_2_EltwisePROD_relu"

  bottom: "data"

  top: "combined_2_EltwisePROD_relu_mbox_priorbox"

  prior_box_param {

    min_size: 12.0

    min_size: 6.0

    max_size: 30.0

    max_size: 20.0

    aspect_ratio:

    aspect_ratio: 2.5

    aspect_ratio:

    flip: true

    clip: false

    variance: 0.1

    variance: 0.1

    variance: 0.2

    variance: 0.2

    step:

    offset: 0.5

  }

}

SSD训练网络参数计算的更多相关文章

LeNet-5网络结构及训练参数计算
经典神经网络诞生记: 1.LeNet,1998年 2.AlexNet,2012年 3.ZF-net,2013年 4.GoogleNet,2014年 5.VGG,2014年 6.ResNet,201 ...
『计算机视觉』Mask-RCNN_训练网络其二：train网络结构&损失函数
Github地址:Mask_RCNN 『计算机视觉』Mask-RCNN_论文学习『计算机视觉』Mask-RCNN_项目文档翻译『计算机视觉』Mask-RCNN_推断网络其一:总览『计算机视觉』M ...
CNN网络参数
卷积神经网络 LeNet-5各层参数详解 LeNet论文阅读:LeNet结构以及参数个数计算 LeNet-5共有7层,不包含输入,每层都包含可训练参数:每个层有多个Feature Map,每个 ...
pytorch和tensorflow的爱恨情仇之定义可训练的参数
pytorch和tensorflow的爱恨情仇之基本数据类型 pytorch和tensorflow的爱恨情仇之张量 pytorch版本:1.6.0 tensorflow版本:1.15.0 之前我们就已 ...
『计算机视觉』Mask-RCNN_训练网络其三：训练Model
Github地址:Mask_RCNN 『计算机视觉』Mask-RCNN_论文学习『计算机视觉』Mask-RCNN_项目文档翻译『计算机视觉』Mask-RCNN_推断网络其一:总览『计算机视觉』M ...
『计算机视觉』Mask-RCNN_训练网络其一：数据集与Dataset类
Github地址:Mask_RCNN 『计算机视觉』Mask-RCNN_论文学习『计算机视觉』Mask-RCNN_项目文档翻译『计算机视觉』Mask-RCNN_推断网络其一:总览『计算机视觉』M ...
卷积神经网络(CNN)张量(图像)的尺寸和参数计算(深度学习)
分享一些公式计算张量(图像)的尺寸,以及卷积神经网络(CNN)中层参数的计算. 以AlexNet网络为例,以下是该网络的参数结构图. AlexNet网络的层结构如下: 1.Input: 图 ...
关于LeNet-5卷积神经网络 S2层与C3层连接的参数计算的思考？？？
https://blog.csdn.net/saw009/article/details/80590245 关于LeNet-5卷积神经网络 S2层与C3层连接的参数计算的思考??? 首先图1是LeNe ...
caffe 网络参数设置
weight_decay防止过拟合的参数,使用方式: 样本越多,该值越小模型参数越多,该值越大一般建议值: weight_decay: 0.0005 lr_mult, decay_mult 关于偏 ...

随机推荐

Python 生成随机数函数和加密函数（MD5）
内容来自debugtalk import hashlib import random import string def gen_random_string(str_len): '''生成指定长度的随 ...
ansible的错误
错误 [root@bogon ansible]# ansible test -m ping 192.168.16.155 | FAILED! => { "msg": &quo ...
2159 -- Ancient Cipher
Ancient Cipher Time Limit: 1000MS Memory Limit: 65536K Total Submissions: 36074 Accepted: 11765 ...
linux下如何找出交叉编译器的某个库路径?
答: 使用选项-print-file-name=<lib_name> 如列出libstdc++.so.6的库路径:aarch64-linux-gnu-gcc -print-file-nam ...
[面试] Java高级软件工程师面试考纲(转)
如果要应聘高级开发工程师职务,仅仅懂得Java的基础知识是远远不够的,还必须懂得常用数据结构.算法.网络.操作系统等知识.因此本文不会讲解具体的技术,笔者综合自己应聘各大公司的经历,整理了一份大公司对 ...
Oracle常用CURD
-------------------------------------------------------------------------------------通用函数和条件判断函数使用N ...
为什么HashMap继承了AbstractMap还要实现Map?
前言之前看源码一直忽略了这个现象,按理说HashMap的父类AbstractMap已经实现了Map,它为什么还要实现一次呢?遂上网查了一下,背后原因让人大跌眼镜. 原因这是类库设计者的拼写错误,其 ...
ssm整合的springmvc.xml的配置
<?xml version="1.0" encoding="UTF-8"?><beans xmlns="http://www.spr ...
java+web上传文件夹内的所有文件
javaweb上传文件上传文件的jsp中的部分上传文件同样可以使用form表单向后端发请求,也可以使用 ajax向后端发请求 1.通过form表单向后端发送请求 <form id=" ...
docker启动cavisor监控
docker启动cavisor监控 docker run --volume=/:/rootfs:ro --volume=/var/run:/var/run:rw --volume=/sys:/sys: ...

SSD训练网络参数计算

SSD训练网络参数计算的更多相关文章

随机推荐

热门专题