目标检测系列 --- RCNN: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report

1. Architecture:

Region proposals: 使用selective search获取region proposals，对于每一幅图像获取约2000个region proposals，并将每一个proposal wrap到需要的size，论文中为224*224，之所以要wrap到同一size，是因为Feature extraction网络的输入需要固定的size
Feature extraction: 使用AlexNet获取每一个region proposal的feature vector（4096维）。
SVM: 对于每一个类别，使用SVM分类器对feature map进行打分，以判断当前region proposal是所对应的物体类别还是background。图像中的每个region都会给出对应的score，并对这些region使用贪心的非极大值抑制（non-maximum suppression）来获取具有较高IOU的region，如保留IOU大于threshold=0.3的region。

在Inference阶段，作者指出了两个关键属性：

对于每种类别，所有的CNN参数共享
和其他的方法相比，此模型的feature vector的维度相对较低

2. Training:

pre-training: 在ILSVRC 2012的数据集上进行分类训练，feature map的输出为4096维，全连接层的输出为1000（1000类）。
fine-tuning: 主要是对Architecture中的model进行稍微的改动。训练数据集为PASCAL VOC 2007，将最后一层全连接层的输出换成21（20类+背景）。并通过selective search得到的region proposal作为网络的输入。
物体类别分类器：对于一个二值的分类器，需要有包含正负标签的样本，本模型的方法是当region proposal的IOU大于threshold（比如：0.5）时，将其标记为positive，其余的为negative，以此来对每个类别训练一个linear SVM。

Reference:

https://arxiv.org/abs/1311.2524

目标检测系列 --- RCNN: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report的更多相关文章

论文阅读笔记二十四：Rich feature hierarchies for accurate object detection and semantic segmentation Tech report(R-CNN CVPR2014)
论文源址:http://www.cs.berkeley.edu/~rbg/#girshick2014rcnn 摘要在PASCAL VOC数据集上,最好的方法的思路是将低级信息与较高层次的上下文信息进 ...
目标检测--Rich feature hierarchies for accurate object detection and semantic segmentation(CVPR 2014)
Rich feature hierarchies for accurate object detection and semantic segmentation 作者: Ross Girshick J ...
论文笔记：Rich feature hierarchies for accurate object detection and semantic segmentation
在上计算机视觉这门课的时候,老师曾经留过一个作业:识别一张 A4 纸上的手写数字.按照传统的做法,这种手写体或者验证码识别的项目,都是按照定位+分割+识别的套路.但凡上网搜一下,就能找到一堆识别的教程 ...
深度学习论文翻译解析（八）：Rich feature hierarchies for accurate object detection and semantic segmentation
论文标题:Rich feature hierarchies for accurate object detection and semantic segmentation 标题翻译:丰富的特征层次结构 ...
2 - Rich feature hierarchies for accurate object detection and semantic segmentation（阅读翻译）
Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick Jeff ...
目标检测论文解读1——Rich feature hierarchies for accurate object detection and semantic segmentation
背景在2012 Imagenet LSVRC比赛中,Alexnet以15.3%的top-5 错误率轻松拔得头筹(第二名top-5错误率为26.2%).由此,ConvNet的潜力受到广泛认可,一炮而红 ...
R-CNN（Rich feature hierarchies for accurate object detection and semantic segmentation）论文理解
论文地址:https://arxiv.org/pdf/1311.2524.pdf 翻译请移步: https://www.cnblogs.com/xiaotongtt/p/6691103.html ht ...
Rich feature hierarchies for accurate object detection and semantic segmentation（理解）
0 - 背景该论文是2014年CVPR的经典论文,其提出的模型称为R-CNN(Regions with Convolutional Neural Network Features),曾经是物体检测领 ...
论文笔记（一）---翻译 Rich feature hierarchies for accurate object detection and semantic segmentation
论文网址: https://arxiv.org/abs/1311.2524 RCNN利用深度学习进行目标检测. 摘要可以将ImageNet上的进全图像分类而训练好的大型卷积神经网络用到PASCAL的 ...

随机推荐

【SSH之旅】一步步学习Struts1框架（二）：Struts实例
从上篇博客能够看到,事实上Struts1框架就是封装了一些页面的转向.数据类型的转换,去除冗余的if else推断.除了这些,事实上还封装了一些我们寻经常使用的JSTL标签库,文件上传等等. 以下看怎 ...
nodejs 中使用shell脚本
虽然nodejs的require('child_process').execSync可以使用shell,但其实问题特别多.尤其是符号和语法冲突,可读性也很差,只能完成一些小规模的shell使用. co ...
Html Agility Pack/SgmlReader 解析html
Html Agility Pack/SgmlReader 解析html HtmlDocument htmlDoc = new HtmlDocument(); //输出成xml格式 htmlDoc.Op ...
在centos 5.11上安装pylint
上午花了两三个小时在折腾pylint的安装. 如果是普通的linux倒也简单,只用运行一个pip install pylint就行了. 可是偏偏那么巧,服务器的版本是centos 5.11 这个版本对 ...
Atitit.增强系统稳定性----虚拟内存的设置
Atitit.增强系统稳定性----虚拟内存的设置 1.1. 读取虚拟内存配置1 1.2. 禁止虚拟内存1 1.3. 默认所有驱动器虚拟内存1 1.4. 设置c d盘虚拟内存为系统管理1 1.5. 设 ...
联想 e460 查看内存卡槽数
公司配了个8g的内存条,想着给电脑安装下.电脑是新的,自带就是8g的. 拆开电脑的 “L” 形区域后,发现两个内存卡槽都插满了,我还以为找错了呢.看了下卡槽内两个内存卡的信息,两个都是4g的. 没办 ...
BodyTagSupport小案例1
做了个简单的实验:写一个tag,将tag body中的内容打印成一个三角形代码很简单就不赘述了,直接贴在下面,值得注意的是这个图(摘自李兴华JAVA开发实战经典) 在做的过程中遇到了如下问题: 1. ...
scala读写文件
def main(args: Array[String]): Unit = { //1 read for( i<- Source.fromFile("test.dat").g ...
spring quartz定时任务配置
cronExpression表达式: 字段允许值允许的特殊字符秒 0-59 , - * /分 0-59 , - * /小时 0-23 , - * /日期 1-31 , - * ? / L W C月 ...
如何利用gdb调试程序？
程序的大体意思就是各一个数组赋值,然后输出来程序已经写在了上面,下面我们就来看一下如何产生带有调试信息的编译文件,这里我们要使用gcc的-g参数,用于在编译文件中加入一些调试信息.发现加了-g和没有加 ...

目标检测系列 --- RCNN: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report

目标检测系列 --- RCNN: Rich feature hierarchies for accurate object detection and semantic segmentation Tech report的更多相关文章

随机推荐

热门专题