Dropout 下（关于《Dropout: A Simple way to prevent neural networks from overfitting》）

先上菜单：

摘要：

Deep neural nets with a large number of parameters are very powerful machine learning systems. However, overfitting is a serious problem in such networks. Large networks are also slow to use, making it difficult to deal with overfitting by combining the predictions of many different large neural nets at test time. （具有大量参数的深度神经网络是非常强大的机器学习系统。然而，在这样的网络中，过度拟合是一个严重的问题。大型网络的使用速度也较慢，因此在测试时结合许多不同大型神经网络的预测，很难处理过度拟合问题。）Dropout is a technique for addressing this problem.The key idea is to randomly drop units (along with their connections) from the neural network during training. （dropout是解决这个问题的一种方法。关键思想是在训练过程中从神经网络中随机删除单元(以及它们的连接)。）This prevents units from co-adapting too much. During training,dropout samples from an exponential number of diﬀerent “thinned” networks. At test time,it is easy to approximate the effect of averaging the predictions of all these thinned networks by simply using a single unthinned network that has smaller weights. （这就防止了单位过度的相互适应。在训练过程中，舍弃来自不同的指数级别的“稀疏”网络的样本。在测试时，只需使用一个权重较小的未减薄网络，就可以很容易地估计出所有这些变薄网络的平均预测效果。）This significantly reduces overfitting and gives major improvements over other regularization methods. We show that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology,obtaining state-of-the-art results on many benchmark data sets.（这大大减少了过度拟合，并对其他正则化方法进行了重大改进。实验结果表明，在视觉、语音识别、文档分类和计算生物学等方面，dropout都能提高神经网络在有监督学习任务中的性能，在许多基准数据集上都获得了最新的结果。）

Keywords: neural networks, regularization（正则化）, model combination(模型组合), deep learning

先介绍一下本文结构：

本文的结构如下:第2节描述了这个想法的动机。第3节描述了以前的相关工作。第4节正式描述了dropout模型。第5节给出了训练dropout网络的算法。在第6节中，我们展示了我们的实验结果，我们将dropout应用于不同领域的问题，并与其他形式的正则化和模型组合进行了比较。第7节分析了dropout对神经网络不同性质的影响，并描述了dropout如何与网络的超参数相互作用。第8节描述了drop - RBM模型。在第9节中，我们探讨了边缘化dropout的概念。在附录A中，我们提供了一个训练dropout网的实用指南。这包括在训练drop - out网络时，选择超参数所涉及的实际考虑的详细分析。(背景部分：1-3节；方法部分：4-5节；实验及分析：6-7节；其他：8-10节；总结：11；附录：A-B)

（几个参考网站：

https://www.baidu.com/link?url=F-vklwp34FZsuOsiAw36yS2upENUfms5jn-R3VGUY3Pmhq210Q2c9K5N8YNN63BzYlCS9OPNUhl-eSms3QpNh9urQwhWo0HDis6G2MnoGm3&wd=&eqid=f9e01460000131a8000000055bceab97

https://blog.csdn.net/qq_25011449/article/details/81168369

https://blog.csdn.net/huplion/article/details/79208736

https://blog.csdn.net/u014422406/article/details/70257324?locationNum=2&fps=1

https://blog.csdn.net/lhc19940815/article/details/50907545

）

Dropout 下（关于《Dropout: A Simple way to prevent neural networks from overfitting》）的更多相关文章

Dropout: A Simple Way to Prevent Neural Networks fromOverfitting
https://www.cs.toronto.edu/~hinton/absps/JMLRdropout.pdf Deep neural nets with a large number of par ...
Deep Learning 23：dropout理解_之读论文“Improving neural networks by preventing co-adaptation of feature detectors”
理论知识:Deep learning:四十一(Dropout简单理解).深度学习(二十二)Dropout浅层理解与实现.“Improving neural networks by preventing ...
论文笔记系列-Simple And Efficient Architecture Search For Neural Networks
摘要本文提出了一种新方法,可以基于简单的爬山过程自动搜索性能良好的CNN架构,该算法运算符应用网络态射,然后通过余弦退火进行短期优化运行. 令人惊讶的是,这种简单的方法产生了有竞争力的结果,尽管只需 ...
PyNest——Part1:neurons and simple neural networks
neurons and simple neural networks pynest – nest模拟器的界面神经模拟工具(NEST:www.nest-initiative.org)专为仿真点神经元的 ...
DeepFool: a simple and accurate method to fool deep neural networks
目录概主要内容二分类模型为线性为一般二分类多分类问题仿射为一般多分类 Moosavidezfooli S, Fawzi A, Frossard P, et al. DeepFool: ...
[CS231n-CNN] Training Neural Networks Part 1 : parameter updates, ensembles, dropout
课程主页:http://cs231n.stanford.edu/ ___________________________________________________________________ ...
[Neural Networks] Dropout阅读笔记
多伦多大学Hinton组 http://www.cs.toronto.edu/~rsalakhu/papers/srivastava14a.pdf 一.目的降低overfitting的风险二.原理 ...
机器学习之神经网络模型-下（Neural Networks: Representation）
3. Model Representation I 1 神经网络是在模仿大脑中的神经元或者神经网络时发明的.因此,要解释如何表示模型假设,我们不妨先来看单个神经元在大脑中是什么样的. 我们的大脑中充满 ...
第六节，Neural Networks and Deep Learning 一书小节(下)
4.神经网络可以计算任何函数的可视化证明神经网络拥有一定的普遍性,即包含一个隐藏层的神经网络可以被用来按照任意给定的精度来近似任何连续函数. 这一章使用一个实例来阐述神经网络是如何来近似一个一元函数 ...

随机推荐

Service(服务)简单使用
1.Service(服务)是一个一种可以在后台执行长时间运行操作而没有用户界面的应用组件.服务可由其他应用组件启动(如Activity),服务一旦被启动将在后台一直运行,即使启动服务的组件(Activ ...
listview添加的头部布局超过一屏头部内容显示不全
headView的实际高度超过一个屏幕,但是显示的结果只有一个屏幕,超过一个屏幕高度意外的部分显示不全. 只使用了listView.getRefreshable().addHeadView(headV ...
Java操作Kafka执行不成功
使用kafka-clients操作kafka始终不成功,原因不清楚,下面贴出相关代码及配置,请懂得指点一下,谢谢! 环境及依赖 <dependency> <groupId>or ...
asp实现阿里大鱼短信API接口的方法
阿里大鱼是阿里推出的产品,官方提供JAVA..NET.PHP等版本的SDK下载,不知为何,唯独不提供ASP版本的SDK. 不提供没关系,自己写就是了,参照官方提供的API写一个就是了. 本来以为无非是 ...
PHP学习过程中遇到的疑难杂症
变量当双引号中包含变量时,变量会与双引号中的内容连接在一起:当单引号中包含变量时,变量会被当做字符串输出. Heredoc结构形式首先使用定界符表示字符串(<<<),接着在“< ...
django前端到后端一次完整请求实例
一.创建项目:# django-admin startproject mysite# cd mysite# python manage.py startapp blog 目录结构: 一.html文件: ...
node——模块分类，require执行顺序，require注意事项，原理
node.js模块在node.js开发中一个文件就可以认为是一个模块. 一.node.js模块分类核心模块Code Module.内置模块.原生模块 fs http path url ... 所有 ...
html+css居中问题
一.行级元素水平居中对齐(父元素设置 text-align:center) <div style="width: 200px; height: 100px;border: 1px so ...
Python基础数据类型list，tuple
列表是有序的可变的元素集合.列表中的每个元素可以使任何数据类型,包括列表本身. 列表生成 Python3中的列表通过定义,for循环,列表推导式等几种方式生成定义直接通过中括号`[]`定义一个列表 ...
centos7下安装pyspark
1.安装python 2.安装jdk 3.下载spark:http://spark.apache.org/downloads.html, 下载新版(spark-2.3.1-bin-hadoop2.7. ...

Dropout 下（关于《Dropout: A Simple way to prevent neural networks from overfitting》）

Dropout 下（关于《Dropout: A Simple way to prevent neural networks from overfitting》）的更多相关文章

随机推荐

热门专题