nn.Module vs nn.functional

前者会保存权重等信息，后者只是做运算

parameters()

返回可训练参数

nn.ModuleList vs. nn.ParameterList vs. nn.Sequential

layer_list = [nn.Conv2d(5,5,3), nn.BatchNorm2d(5), nn.Linear(5,2)]

class myNet(nn.Module):

  def __init__(self):

    super().__init__()

    self.layers = layer_list

  def forward(x):

    for layer in self.layers:

      x = layer(x)

net = myNet()

print(list(net.parameters()))  # Parameters of modules in the layer_list don't show up.

nn.ModuleList的作用就是wrap pthon list，这样其中的参数会被注册，因此可以返回可训练参数(ParameterList)。

nn.Sequential的作用如下：

class myNet(nn.Module):

  def __init__(self):

    super().__init__()

    self.layers = nn.Sequential(

        nn.Relu(inplace=True),

        nn.Linear(10, 10)

    )

  def forward(x):

    x = layer(x)

x = torch.rand(10)

net = myNet()

print(net(x).shape)

可以看到Sequential的作用就是按照指定的顺序构建网络结构，得到一个完整的模块，而ModuleList则只是像list那样把元素集合起来而已。

nn.modules vs. nn.children

class myNet(nn.Module):

  def __init__(self):

    super().__init__()

    self.convBN =  nn.Sequential(nn.Conv2d(10,10,3), nn.BatchNorm2d(10))

    self.linear =  nn.Linear(10,2)

  def forward(self, x):

    pass

Net = myNet()

print("Printing children\n------------------------------")

print(list(Net.children()))

print("\n\nPrinting Modules\n------------------------------")

print(list(Net.modules()))

输出信息如下：

Printing children

------------------------------

[Sequential(

  (0): Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1))

  (1): BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)

), Linear(in_features=10, out_features=2, bias=True)]

Printing Modules

------------------------------

[myNet(

  (convBN1): Sequential(

    (0): Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1))

    (1): BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)

  )

  (linear): Linear(in_features=10, out_features=2, bias=True)

), Sequential(

  (0): Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1))

  (1): BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)

), Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1)), BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True), Linear(in_features=10, out_features=2, bias=True)]

可以看到children只会返回子元素，子元素可能是单个操作，如Linear，也可能是Sequential。而modules()返回的信息更加详细，不仅会返回children一样的信息，同时还会递归地返回，例如modules()会迭代地返回Sequential中包含的若干个子元素。

named_*

named_parameters: 返回一个iterator,每次它会提供包含参数名的元组。

In [27]: x = torch.nn.Linear(2,3)

In [28]: x_name_params = x.named_parameters()

In [29]: next(x_name_params)

Out[29]:

('weight', Parameter containing:

 tensor([[-0.5262,  0.3480],

         [-0.6416, -0.1956],

         [ 0.5042,  0.6732]], requires_grad=True))

In [30]: next(x_name_params)

Out[30]:

('bias', Parameter containing:

 tensor([ 0.0595, -0.0386,  0.0975], requires_grad=True))

named_modules

这个其实就是把上面提到的nn.modules以iterator的形式返回，每次读取和上面一样也是用next()，示例如下：

In [46]:  class myNet(nn.Module):

    ...:    def __init__(self):

    ...:      super().__init__()

    ...:      self.convBN1 =  nn.Sequential(nn.Conv2d(10,10,3), nn.BatchNorm2d(10))

    ...:      self.linear =  nn.Linear(10,2)

    ...:

    ...:    def forward(self, x):

    ...:      pass

    ...:                                                                                   

In [47]: net = myNet()                                                                     

In [48]: net_named_modules = net.named_modules()                                           

In [49]: next(net_named_modules)

Out[49]:

('', myNet(

   (convBN1): Sequential(

     (0): Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1))

     (1): BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)

   )

   (linear): Linear(in_features=10, out_features=2, bias=True)

 ))                                                                                        

In [50]: next(net_named_modules)

Out[50]:

('convBN1', Sequential(

   (0): Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1))

   (1): BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)

 ))                                                                                        

In [51]: next(net_named_modules)

Out[51]: ('convBN1.0', Conv2d(10, 10, kernel_size=(3, 3), stride=(1, 1)))                  

In [52]: next(net_named_modules)

Out[52]:

('convBN1.1',

 BatchNorm2d(10, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True))          

In [53]: next(net_named_modules)

Out[53]: ('linear', Linear(in_features=10, out_features=2, bias=True))                     

In [54]: next(net_named_modules)

---------------------------------------------------------------------------

StopIteration                             Traceback (most recent call last)

<ipython-input-54-05e848b071b8> in <module>

----> 1 next(net_named_modules)

StopIteration:

named_children

同named_modules

参考

https://blog.paperspace.com/pytorch-101-advanced/

Pytorch: parameters(),children(),modules(),named_*区别的更多相关文章

jquery 中后代遍历之children、find区别
jquery 中children.find区别首先看一段HTML代码,如下: <table id="tb"> <tr> <td>0</t ...
web.config中httpModules和Modules的区别
最近用到了mvc的 Modules管道时,发现web.config中有两个modules 1.system.web节点下的httpModules 2.system.webServer节点下的modul ...
Odoo中Application与modules的区别
转载请注明原文地址:https://www.cnblogs.com/cnodoo/p/9278681.html 一:Application(应用) application一般是针对大功能的模块,如提供 ...
jquery选择器中的find和空格，children和>的区别、及父节点兄弟节点，还有判断是否存在的写法
一.find和空格,children和>及其它的区别空格:$('parent childchild')表示获取parent下的所有的childchild节点(所有的子孙). 等效成 = ...
jQuery初学:find()方法及children方法的区别分析
首先看看英文解释吧: children方法: find方法: 通过以上的解释,可以总结如下: 1:children及find方法都用是用来获得element的子elements的,两者都不会返回 te ...
find()与children()方法的区别
来源:http://www.jb51.net/article/26195.htm 总经一下前段时间用于的jQuery方法:find及children.需要的朋友可以参考下. 首先看看英文解释吧: ch ...
children()与find()区别
1.children() 返回被选元素的所有直接子元素,该方法只会向下一级对 DOM 树进行遍历: 2.find() 返回被选元素的后代元素,一路向下直到最后一个后代.
vue-loader v15、vue-loader v14及之前版本，配置css modules的区别
vue-loader v15 配置css modules: 是在 css-loader 里配置官方文档:https://vue-loader.vuejs.org/zh/migrating.html# ...
jQuery：find()方法与children()方法的区别
1:children及find方法都用是用来获得element的子elements的,两者都不会返回 text node,就像大多数的jQuery方法一样. 2:children方法获得的仅仅是元素一 ...

随机推荐

6.Go-错误,defer,panic和recover
6.1.错误 Go语言中使用builtin包下error接口作为错误类型 Go语言中错误都作为方法/函数的返回值自定义错误类型 //Learn_Go/main.go package main imp ...
ajax与重定向
网上有不少说法ajax的请求url浏览器不会重定向的说法是片面的,正常是这样的: 当服务器将302响应发给浏览器时,浏览器并不是直接进行ajax回调处理,而是先执行302重定向——从Response ...
how to design AWS SQS?
遇到这么一题system design,怎么做? 几个月以前,有同事提出要用Webapi代替现有的WCF,当时我投的反对票.而且我给了很充分的理由,不仅仅是时间不足,人手不够,更重要的是这个变化太大, ...
nginx日志说明
一.日志说明 nginx日志主要有两种:访问日志和错误日志.访问日志主要记录客户端访问nginx的每一个请求,格式可以自定义:错误日志主要记录客户端访问nginx出错时的日志,格式不支持自定义.两种 ...
Python安装（64位Win8.1专业版）
本文出处:http://www.cnblogs.com/leonwen/p/4700648.html 嗯,开始学Python. 我安装的是Python 2.7.10版本,安装的时候除了选了路径其他均n ...
vue学习面向对象，在项目中怎么用呢？
面向对象感觉很牛逼,可是在项目中怎么用呢? 我至今见到的用法,写了一个用户对象. 效果:只要执行了new User(userInfo)就会在cookie,localStorage存放数据. 所以最简单 ...
Redux + React-router 的入门和配置教程
(转载)原文链接: https://juejin.im/post/5dcaaa276fb9a04a965e2c9b#heading-18 前言
部门工资前三高的所有员工 - LeetCode
Employee 表包含所有员工信息,每个员工有其对应的工号 Id,姓名 Name,工资 Salary 和部门编号 DepartmentId . +----+-------+--------+---- ...
携程 Apollo分布式部署
一.环境准备操作系统:CentOS release 7.5 (启动脚本理论上支持所有Linux发行版,建议CentOS 7) JDK :jdk1..0_162 (建议安装Java 1.8+) MyS ...
【09】Jenkins：Pipeline 补充
写在前面的话我们在使用普通的构建任务的时候使用了 Sonar 做代码质量管理,也使用了 Publish Over SSH 插件中更新上线,但是我们在 Pipeline 怎么使用他们呢. 如果你没有查 ...

Pytorch: parameters(),children(),modules(),named_*区别