一、使用Numpy初始化：【直接对Tensor操作】

对Sequential模型的参数进行修改：

 import numpy as np

 import torch

 from torch import nn

 # 定义一个 Sequential 模型

 net1 = nn.Sequential(

     nn.Linear(30, 40),

     nn.ReLU(),

     nn.Linear(40, 50),

     nn.ReLU(),

     nn.Linear(50, 10)

 )

 # 访问第一层的参数

 w1 = net1[0].weight

 b1 = net1[0].bias

 print(w1)

 #对第一层Linear的参数进行修改：

 # 定义第一层的参数 Tensor 直接对其进行替换

 net1[0].weight.data = torch.from_numpy(np.random.uniform(3, 5, size=(40, 30)))

 print(net1[0].weight)
23
24 #若模型中相同类型的层都需要初始化成相同的方式，一种更高效的方式：使用循环去访问：
25 for layer in net1:
26     if isinstance(layer, nn.Linear): # 判断是否是线性层
27         param_shape = layer.weight.shape
28         layer.weight.data = torch.from_numpy(np.random.normal(0, 0.5, size=param_shape)) 
29         # 定义为均值为 0，方差为 0.5 的正态分布

对Module模型的参数初始化：

对于 Module 的参数初始化，其实也非常简单，如果想对其中的某层进行初始化，可以直接像 Sequential 一样对其 Tensor 进行重新定义，其唯一不同的地方在于，如果要用循环的方式访问，需要介绍两个属性，children 和 modules，下面我们举例来说明：

1、创建Module模型类：

 class sim_net(nn.Module):

     def __init__(self):

         super(sim_net, self).__init__()

         self.l1 = nn.Sequential(

             nn.Linear(30, 40),

             nn.ReLU()

         )

         self.l1[0].weight.data = torch.randn(40, 30) # 直接对某一层初始化

         self.l2 = nn.Sequential(

             nn.Linear(40, 50),

             nn.ReLU()

         )

         self.l3 = nn.Sequential(

             nn.Linear(50, 10),

             nn.ReLU()

         )

     def forward(self, x):

         x = self.l1(x)

         x =self.l2(x)

         x = self.l3(x)

         return x

2、创建模型对象：

net2 = sim_net()

3、访问children：

# 访问 children

for i in net2.children():

    print(i)
　　　　　#打印的结果：

Sequential(

  (0): Linear(in_features=30, out_features=40)

  (1): ReLU()

)

Sequential(

  (0): Linear(in_features=40, out_features=50)

  (1): ReLU()

)

Sequential(

  (0): Linear(in_features=50, out_features=10)

  (1): ReLU()

)

4、访问modules：

# 访问 modules

for i in net2.modules():

    print(i)

#打印的结果

sim_net(

  (l1): Sequential(

    (0): Linear(in_features=30, out_features=40)

    (1): ReLU()

  )

  (l2): Sequential(

    (0): Linear(in_features=40, out_features=50)

    (1): ReLU()

  )

  (l3): Sequential(

    (0): Linear(in_features=50, out_features=10)

    (1): ReLU()

  )

)

Sequential(

  (0): Linear(in_features=30, out_features=40)

  (1): ReLU()

)

Linear(in_features=30, out_features=40)

ReLU()

Sequential(

  (0): Linear(in_features=40, out_features=50)

  (1): ReLU()

)

Linear(in_features=40, out_features=50)

ReLU()

Sequential(

  (0): Linear(in_features=50, out_features=10)

  (1): ReLU()

)

Linear(in_features=50, out_features=10)

ReLU()

通过上面的例子，可以看到：

children 只会访问到模型定义中的第一层，因为上面的模型中定义了三个 Sequential，所以只会访问到三个 Sequential，而 modules 会访问到最后的结构，比如上面的例子，modules 不仅访问到了 Sequential，也访问到了 Sequential 里面，这就对我们做初始化非常方便。

5、采用循环初始化：

for layer in net2.modules():

    if isinstance(layer, nn.Linear):

        param_shape = layer.weight.shape

        layer.weight.data = torch.from_numpy(np.random.normal(0, 0.5, size=param_shape))

二、torch.nn.init初始化

PyTorch 还提供了初始化的函数帮助我们快速初始化，就是 torch.nn.init，其操作层面仍然在 Tensor 上。先介绍一种初始化方法：

Xavier 初始化方法：

其中 $n_j$ 和 $n_{j+1}$ 表示该层的输入和输出数目。

这种非常流行的初始化方式叫 Xavier，方法来源于 2010 年的一篇论文 Understanding the difficulty of training deep feedforward neural networks，其通过数学的推到，证明了这种初始化方式可以使得每一层的输出方差是尽可能相等的。

`torch.nn.init：`

from torch.nn import init

init.xavier_uniform(net1[0].weight) # 这就是上面我们讲过的 Xavier 初始化方法，PyTorch 直接内置了其实现

#这就直接修改了net1[0].weight的值

Pytorch基础（6）----参数初始化的更多相关文章

pytorch对模型参数初始化
1.使用apply() 举例说明: Encoder :设计的编码其模型 weights_init(): 用来初始化模型 model.apply():实现初始化 # coding:utf- from t ...
PyTorch常用参数初始化方法详解
1. 均匀分布 torch.nn.init.uniform_(tensor, a=0, b=1) 从均匀分布U(a, b)中采样,初始化张量. 参数: tensor - 需要填充的张量 a - 均匀分 ...
PyTorch模型读写、参数初始化、Finetune
使用了一段时间PyTorch,感觉爱不释手(0-0),听说现在已经有C++接口.在应用过程中不可避免需要使用Finetune/参数初始化/模型加载等. 模型保存/加载 1.所有模型参数训练过程中,有 ...
【转载】 pytorch自定义网络结构不进行参数初始化会怎样？
原文地址: https://blog.csdn.net/u011668104/article/details/81670544 ------------------------------------ ...
pytorch和tensorflow的爱恨情仇之参数初始化
pytorch和tensorflow的爱恨情仇之基本数据类型 pytorch和tensorflow的爱恨情仇之张量 pytorch和tensorflow的爱恨情仇之定义可训练的参数 pytorch版本 ...
[源码解析] PyTorch分布式(6) -------- DistributedDataParallel -- 初始化&store
[源码解析] PyTorch分布式(6) ---DistributedDataParallel -- 初始化&store 目录 [源码解析] PyTorch分布式(6) ---Distribu ...
pytorch基础教程1
0.迅速入门:根据上一个博客先安装好,然后终端python进入,import torch ******************************************************* ...
[人工智能]Pytorch基础
PyTorch基础摘抄自<深度学习之Pytorch>. Tensor(张量) PyTorch里面处理的最基本的操作对象就是Tensor,表示的是一个多维矩阵,比如零维矩阵就是一个点,一维 ...

随机推荐

Spring Data Jpa-动态查询条件
/** * * 查看日志列表-按照时间倒序排列 * * @author: wyc * @createTime: 2017年4月20日下午4:24:43 * @history: * @return L ...
HDU 1561&HDU 3449 一类简单依赖背包问题
HDU 1561.这道是树形DP了,所谓依赖背包,就是选A前必须选B,这样的问题.1561很明显是这样的题了.把0点当成ROOT就好,然后选子节点前必须先选根,所以初始化数组每一行为该根点的值.由于多 ...
mac下安装tensorflow及入门例子
https://www.tensorflow.org/install/install_mac 使用virtualenv安装,virtualenv相当于使tensorflow运行在虚拟机环境下. 需要使 ...
codeforces Looksery Cup 2015 D. Haar Features
The first algorithm for detecting a face on the image working in realtime was developed by Paul Viol ...
C#遍历DataSet与DataSet元素实现代码
C#中的Dataset就像一个数据库,有多个表(Table),一般只有一个表,然后每个表中有行(DataRow)和列(DataColumn),DataRow[DataColumn]可以得到某行某列数据 ...
poj 2763（在线LCA+树状数组）
Housewife Wind After their royal wedding, Jiajia and Wind hid away in XX Village, to enjoy their ord ...
hdoj--5100--Chessboard（数学推理）
Chessboard Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others) To ...
B3403 [Usaco2009 Open]Cow Line 直线上的牛 deque
deque真的秀,queue和stack...没啥用了啊.操作差不多,就是在前面加一个front||back_就行了. 题干: 题目描述题目描述约翰的N只奶牛(编为1到N号)正在直线上排队 ...
Python的学习（二十一）----Python的静态变量
前段时间在论坛里面有人提问说, class foo(): member1 member2 ... self.member1 foo.member2 其中的两个成员member1, member2有什么 ...
sublime如何汉化
1.将sublime安装文件夹里面的defavlut.sublime-package这个文件zip解压. 2.然后查找到sublime-menu文件. 3.打开文件将json里面的caption里面的 ...

Pytorch基础（6）----参数初始化

一、使用Numpy初始化：【直接对Tensor操作】

对Sequential模型的参数进行修改：

对Module模型 的参数初始化：

二、torch.nn.init初始化

Xavier 初始化方法：

torch.nn.init：

Pytorch基础（6）----参数初始化的更多相关文章

随机推荐

热门专题

对Module模型的参数初始化：

`torch.nn.init：`