论文笔记——Rethinking the Inception Architecture for Computer Vision

1. 论文思想

factorized convolutions and aggressive regularization.
本文给出了一些网络设计的技巧。

2. 结果

用5G的计算量和25M的参数。With an ensemble of 4 models and multi-crop evaluation, we report 3.5% top-5 error and 17.3% top-1 error.

3. Introduction

scaling up convolution network in efficient ways.

4. General Design Principles

Avoid representational bottlenecks, especially early in the network.(简单说就是feature map的大小要慢慢的减小。)
Higher dimensional representations are easier to process locally within a network. Increasing the activations per tile in a convolutional network allows for more disentangled features. The resulting networks will train faster.(在网络较深层应该利用更多的feature map，有利于容纳更多的分解特征。这样可以加速训练)
Spatial aggregation can be done over lower dimensional embeddings without much or any loss in representational power.(也就是bottleneck layer的设计)
Balance the width and depth of the network.（Increasing both the width and the depth of the network can contribute to higher quality networks.同时增加网络的深度和宽度）

5. Factorizing Convolution With Large Filter Size

分解较大filter size的卷积。

5.1. Factorization into smaller convolutions

一个5x5的卷积可以分解为两个3x3的卷积。

实验表明，将一个卷积分解为两个卷积的时候，在第一个卷积之后利用ReLU会提升准确率。也就是说线性分解性能会差一些。

5.2 Spatial Factorization into Asymmetric Convolutions

将3x3的卷积分解成31和13的卷积，可以减少33%计算量，如果将3x3分解为两个2x2，可以减少11%计算量，而且利用非对称卷积的效果还更好。
实践表明，不要过早的使用这种分解操作，在feature map 大小为(12 ~ 20)之间，使用它，效果是比较好的。

6. Utility of Auxiliary Classifier

7. Efﬁcient Grid Size Reduction

左边引入了 representational bottleneck,右边的会增加大量的计算量，最佳的做法就是减少feature map大小的同时增大channel的数目。

以上才是正确的方式。

论文笔记——Rethinking the Inception Architecture for Computer Vision的更多相关文章

inception_v2版本《Rethinking the Inception Architecture for Computer Vision》(转载)
转载链接:https://www.jianshu.com/p/4e5b3e652639 Szegedy在2015年发表了论文Rethinking the Inception Architecture ...
Rethinking the inception architecture for computer vision的 paper 相关知识
这一篇论文很不错,也很有价值;它重新思考了googLeNet的网络结构--Inception architecture,在此基础上提出了新的改进方法; 文章的一个主导目的就是:充分有效地利用compu ...
图像分类（三）GoogLenet Inception_v3：Rethinking the Inception Architecture for Computer Vision
Inception V3网络(注意,不是module了,而是network,包含多种Inception modules)主要是在V2基础上进行的改进,特点如下: 将滤波器尺寸(Filter Size) ...
Rethinking the Inception Architecture for Computer Vision
https://arxiv.org/abs/1512.00567 Convolutional networks are at the core of most state-of-the-art com ...
【Network architecture】Rethinking the Inception Architecture for Computer Vision（inception-v3）论文解析
目录 0. paper link 1. Overview 2. Four General Design Principles 3. Factorizing Convolutions with Larg ...
论文笔记：Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells 2019-04- ...
论文笔记：DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search 2019-03-19 10:04:26accepted by ICLR 2019 Paper:https://arx ...
论文笔记：Progressive Neural Architecture Search
Progressive Neural Architecture Search 2019-03-18 20:28:13 Paper:http://openaccess.thecvf.com/conten ...
论文笔记系列-DARTS: Differentiable Architecture Search
Summary 我的理解就是原本节点和节点之间操作是离散的,因为就是从若干个操作中选择某一个,而作者试图使用softmax和relaxation(松弛化)将操作连续化,所以模型结构搜索的任务就转变成了 ...

随机推荐

06_常用 Linux 命令的基本使用
常用 Linux 命令的基本使用目标理解学习 Linux 终端命令的原因常用 Linux 命令体验 01. 学习 Linux 终端命令的原因 Linux 刚面世时并没有图形界面,所有的操作全靠命 ...
mysql 内置功能触发器介绍
使用触发器可以在用户对表进行[增.删.改]操作时前后定义一些操作,注意:没有查询创建触发器 create trigger 触发器的名字之前(before)或者之后(after) 行为(inser ...
mysql 内置功能存储过程目录
mysql 内置功能存储过程介绍 mysql 内置功能存储过程创建无参存储过程 mysql 内置功能存储过程创建有参存储过程 mysql 内置功能存储过程删除存储过程
(c++) int 转 string,char*,const char*和string的相互转换
一.int 和string的相互转换 1 int 转化为 string c++ //char *itoa( int value, char *string,int radix); // 原型说明: / ...
Kaggle案例泰坦尼克号问题
泰坦里克号预测生还人口问题泰坦尼克号问题背景 - 就是那个大家都熟悉的『Jack and Rose』的故事,豪华游艇倒了,大家都惊恐逃生,可是救生艇#### 的数量有限,无法人人都有,副船长发话了l ...
jmeter 线程组之间的参数传递（加密接口测试三）
场景测试中,一次登录后做多个接口的操作,然后登录后的uid需要关联传递给其他接口发送请求的时候使用. 1.在登录接口响应信息中提取uid字段值 1>login请求 -->添加 --> ...
Ajax棵
ajax 1.什么是ajax?(异步请求,局部刷新) ajax是一个改善用户体验的技术,实质上是利用浏览器端ajax对象()向服务器发送异步(ajax对象在向服务器发送请求的时候,用户可以继续其他操作 ...
有关padding的二三事~~
浏览器支持所有浏览器都支持 padding 属性. 注释:任何的版本的 Internet Explorer (包括 IE8)都不支持属性值 "inherit". 定义和用法 pa ...
Linux系统下C语言程序的构建过程
本文转载自:http://www.ruanyifeng.com/blog/2014/11/compiler.html 源码要运行,必须先转成二进制的机器码.这是编译器的任务. 比如,下面这段源码(假定 ...
python import win32clipboard 报错DLL load failed: %1 不是有效的 Win32 应用程序。
在python中引入win32clipboard时报错,DLL load failed: %1 不是有效的 Win32 应用程序 >>> import win32clipboardT ...

论文笔记——Rethinking the Inception Architecture for Computer Vision

1. 论文思想

2. 结果

3. Introduction

4. General Design Principles

5. Factorizing Convolution With Large Filter Size

5.1. Factorization into smaller convolutions

5.2 Spatial Factorization into Asymmetric Convolutions

6. Utility of Auxiliary Classifier

7. Efﬁcient Grid Size Reduction

论文笔记——Rethinking the Inception Architecture for Computer Vision的更多相关文章

随机推荐

热门专题