VGG网络的Pytorch实现

1.文章原文地址

Very Deep Convolutional Networks for Large-Scale Image Recognition

2.文章摘要

在这项工作中，我们研究了在大规模的图像识别数据集上卷积神经网络的深度对准确率的影响。我们主要贡献是使用非常小（3×3）卷积核的架构对深度增加的网络进行全面的评估，其结果表明将深度增大到16-19层时网络的性能会显著提升。这些发现是基于我们在ImageNet Challenge 2014的目标检测和分类任务分别获得了第一名和第二名的成绩而得出的。另外该网络也可以很好的推广到其他数据集上，在这些数据集上获得了当前最好结果。我们已经公开了性能最佳的ConvNet模型，为了促进在计算机视觉中使用深度视觉表征的进一步研究。

3.网络结构

4.Pytorch实现

 import torch.nn as nn

 try:

     from torch.hub import load_state_dict_from_url

 except ImportError:

     from torch.utils.model_zoo import load_url as load_state_dict_from_url

 __all__ = [

     'VGG', 'vgg11', 'vgg11_bn', 'vgg13', 'vgg13_bn', 'vgg16', 'vgg16_bn',

     'vgg19_bn', 'vgg19',

 ]

 model_urls = {

     'vgg11': 'https://download.pytorch.org/models/vgg11-bbd30ac9.pth',

     'vgg13': 'https://download.pytorch.org/models/vgg13-c768596a.pth',

     'vgg16': 'https://download.pytorch.org/models/vgg16-397923af.pth',

     'vgg19': 'https://download.pytorch.org/models/vgg19-dcbb9e9d.pth',

     'vgg11_bn': 'https://download.pytorch.org/models/vgg11_bn-6002323d.pth',

     'vgg13_bn': 'https://download.pytorch.org/models/vgg13_bn-abd245e5.pth',

     'vgg16_bn': 'https://download.pytorch.org/models/vgg16_bn-6c64b313.pth',

     'vgg19_bn': 'https://download.pytorch.org/models/vgg19_bn-c79401a0.pth',

 }

 class VGG(nn.Module):

     def __init__(self, features, num_classes=1000, init_weights=True):

         super(VGG, self).__init__()

         self.features = features

         self.avgpool = nn.AdaptiveAvgPool2d((7, 7))　　#固定全连接层的输入

         self.classifier = nn.Sequential(

             nn.Linear(512 * 7 * 7, 4096),

             nn.ReLU(True),

             nn.Dropout(),

             nn.Linear(4096, 4096),

             nn.ReLU(True),

             nn.Dropout(),

             nn.Linear(4096, num_classes),

         )

         if init_weights:

             self._initialize_weights()

     def forward(self, x):

         x = self.features(x)

         x = self.avgpool(x)

         x = x.view(x.size(0), -1)

         x = self.classifier(x)

         return x

     def _initialize_weights(self):

         for m in self.modules():

             if isinstance(m, nn.Conv2d):

                 nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')

                 if m.bias is not None:

                     nn.init.constant_(m.bias, 0)

             elif isinstance(m, nn.BatchNorm2d):

                 nn.init.constant_(m.weight, 1)

                 nn.init.constant_(m.bias, 0)

             elif isinstance(m, nn.Linear):

                 nn.init.normal_(m.weight, 0, 0.01)

                 nn.init.constant_(m.bias, 0)

 def make_layers(cfg, batch_norm=False):

     layers = []

     in_channels = 3

     for v in cfg:

         if v == 'M':

             layers += [nn.MaxPool2d(kernel_size=2, stride=2)]

         else:

             conv2d = nn.Conv2d(in_channels, v, kernel_size=3, padding=1)

             if batch_norm:

                 layers += [conv2d, nn.BatchNorm2d(v), nn.ReLU(inplace=True)]

             else:

                 layers += [conv2d, nn.ReLU(inplace=True)]

             in_channels = v

     return nn.Sequential(*layers)

 cfgs = {

     'A': [64, 'M', 128, 'M', 256, 256, 'M', 512, 512, 'M', 512, 512, 'M'],

     'B': [64, 64, 'M', 128, 128, 'M', 256, 256, 'M', 512, 512, 'M', 512, 512, 'M'],

     'D': [64, 64, 'M', 128, 128, 'M', 256, 256, 256, 'M', 512, 512, 512, 'M', 512, 512, 512, 'M'],

     'E': [64, 64, 'M', 128, 128, 'M', 256, 256, 256, 256, 'M', 512, 512, 512, 512, 'M', 512, 512, 512, 512, 'M'],

 }

 def _vgg(arch, cfg, batch_norm, pretrained, progress, **kwargs):

     if pretrained:

         kwargs['init_weights'] = False

     model = VGG(make_layers(cfgs[cfg], batch_norm=batch_norm), **kwargs)

     if pretrained:

         state_dict = load_state_dict_from_url(model_urls[arch],

                                               progress=progress)

         model.load_state_dict(state_dict)

     return model

 def vgg11(pretrained=False, progress=True, **kwargs):

     """VGG 11-layer model (configuration "A")

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg11', 'A', False, pretrained, progress, **kwargs)

 def vgg11_bn(pretrained=False, progress=True, **kwargs):

     """VGG 11-layer model (configuration "A") with batch normalization

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg11_bn', 'A', True, pretrained, progress, **kwargs)

 def vgg13(pretrained=False, progress=True, **kwargs):

     """VGG 13-layer model (configuration "B")

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg13', 'B', False, pretrained, progress, **kwargs)

 def vgg13_bn(pretrained=False, progress=True, **kwargs):

     """VGG 13-layer model (configuration "B") with batch normalization

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg13_bn', 'B', True, pretrained, progress, **kwargs)

 def vgg16(pretrained=False, progress=True, **kwargs):

     """VGG 16-layer model (configuration "D")

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg16', 'D', False, pretrained, progress, **kwargs)

 def vgg16_bn(pretrained=False, progress=True, **kwargs):

     """VGG 16-layer model (configuration "D") with batch normalization

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg16_bn', 'D', True, pretrained, progress, **kwargs)

 def vgg19(pretrained=False, progress=True, **kwargs):

     """VGG 19-layer model (configuration "E")

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg19', 'E', False, pretrained, progress, **kwargs)

 def vgg19_bn(pretrained=False, progress=True, **kwargs):

     """VGG 19-layer model (configuration 'E') with batch normalization

     Args:

         pretrained (bool): If True, returns a model pre-trained on ImageNet

         progress (bool): If True, displays a progress bar of the download to stderr

     """

     return _vgg('vgg19_bn', 'E', True, pretrained, progress, **kwargs)

参考

https://github.com/pytorch/vision/tree/master/torchvision/models

VGG网络的Pytorch实现的更多相关文章

ResNet网络的Pytorch实现
1.文章原文地址 Deep Residual Learning for Image Recognition 2.文章摘要神经网络的层次越深越难训练.我们提出了一个残差学习框架来简化网络的训练,这些 ...
VGG网络
VGG论文给出了一个非常振奋人心的结论:卷积神经网络的深度增加和小卷积核的使用对网络的最终分类识别效果有很大的作用.记得在AlexNet论文中,也做了最后指出了网络深度的对最终的分类结果有很大的作用. ...
第二十四节，TensorFlow下slim库函数的使用以及使用VGG网络进行预训练、迁移学习(附代码)
在介绍这一节之前,需要你对slim模型库有一些基本了解,具体可以参考第二十二节,TensorFlow中的图片分类模型库slim的使用.数据集处理,这一节我们会详细介绍slim模型库下面的一些函数的使用 ...
关于VGG网络的介绍
本博客参考作者链接:https://zhuanlan.zhihu.com/p/41423739 前言: VGG是Oxford的Visual Geometry Group的组提出的(大家应该能看出VGG ...
基于深度学习的人脸识别系统（Caffe+OpenCV+Dlib）【三】VGG网络进行特征提取
前言基于深度学习的人脸识别系统,一共用到了5个开源库:OpenCV(计算机视觉库).Caffe(深度学习库).Dlib(机器学习库).libfacedetection(人脸检测库).cudnn(gp ...
群等变网络的pytorch实现
CNN对于旋转不具有等变性,对于平移有等变性,data augmentation的提出就是为了解决这个问题,但是data augmentation需要很大的模型容量,更多的迭代次数才能够在训练数据集合 ...
U-Net网络的Pytorch实现
1.文章原文地址 U-Net: Convolutional Networks for Biomedical Image Segmentation 2.文章摘要普遍认为成功训练深度神经网络需要大量标注 ...
GoogLeNet网络的Pytorch实现
1.文章原文地址 Going deeper with convolutions 2.文章摘要我们提出了一种代号为Inception的深度卷积神经网络,它在ILSVRC2014的分类和检测任务上都取得 ...
AlexNet网络的Pytorch实现
1.文章原文地址 ImageNet Classification with Deep Convolutional Neural Networks 2.文章摘要我们训练了一个大型的深度卷积神经网络用于 ...

随机推荐

android基础---->传感器的使用
现在每部Android 手机里面都会内置有许多的传感器,它们能够监测到各种发生在手机上的物理事件,而我们只要灵活运用这些事件就可以编写出很多好玩的应用程序.今天我们开始简单的传感器使用的学习. 目录导 ...
Eclipse下Maven安装和配置
1. 下载 Maven 在百度输入 Maven 搜索 ,找到它的官网(http://maven.apache.org/),点击进入下载页面. 下载页面地址: http://maven.apache.o ...
php有关类和对象的相关知识1
有关类和对象的相关知识类的自动加载类的自动加载是指,在外面的页面中,并不需要去“引入”(包含)类文件,但是程序会在需要一个类的时候就自动去“动态加载”该类. 什么叫做“需要一个类”?通常是这样的情 ...
Eclipse使用JDBC方式连接SQLServer2017
这篇博客写的比较详细了,图文并茂: https://blog.csdn.net/rebekahq/article/details/78691343 这里补充一些可能会遇到的问题: 1.与博客中不同之处 ...
6.2.3 reginst中的yzm
@RequestMapping("/reginst/{yzm}") public Object reginst(User user,@PathVariable String yzm ...
[转帖]crontab每小时运行一次
crontab每小时运行一次先给出crontab的语法格式对于网上很多给出的每小时定时任务写法,可以说绝大多数都是错误的!比如对于下面的这种写法: 00 * * * * #每隔一小时执行一 ...
Mysql 定时备份(mysqldump)
#!/bin/bash today=`date +%Y-%m-%d` deleday=`date -d '7 day ago' +%Y-%m-%d` path=/home/data/mysqlback ...
windows 下搭建go开发环境
下载golang安装包: 步骤一: 先打开环境变量配置的界面步骤二;配置我们的环境变量对上图的一个说明: 1).path这个变量不需要再创建,因为系统本就有,在后面添加即可. 2).增加Go的bi ...
2019牛客多校八 H. How Many Schemes (AC自动机,树链剖分)
大意: 给定树, 每条边有一个字符集合, 给定$m$个模式串, $q$个询问$(u,v)$, 对于路径$(u,v)$中的所有边, 每条边从对应字符集合中取一个字符, 得到一个串$s$, 求$s$至少包 ...
L2范数归一化概念和优势
1 归一化处理归一化是一种数理统计中常用的数据预处理手段,在机器学习中归一化通常将数据向量每个维度的数据映射到(0,1)或(-1,1)之间的区间或者将数据向量的某个范数映射为1,归一化 ...