Deep Learning 资料总结

http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/ : 理解神经网络

ELU：

梯度下降优化方式：

http://sebastianruder.com/optimizing-gradient-descent/:各种优化算法详解

GradientDescentOptimizer
This one is sensitive to the problem and you can face lots of problems using it, from getting stuck in saddle points to oscillating around the minimum and slow convergence. I found it useful for Word2Vec, CBOW and feed-forward architectures in general, but Momentum is also good.
AdadeltaOptimizer
Adadelta addresses the issues of using constant of linearly decaying learning rate. In case of recurrent networks it’s among the fastest.
MomentumOptimizer
If you learn a regression and find your loss function oscillating, switching from SGD to Momentum may be the right solution.
AdamOptimizer
Adaptive momentum in addition to the Adadelta features.
FtrlOptimizer
I haven’t used it myself, but from the paper I see that it’s better suited for online learning on large sparse datasets, like recommendation systems.
RMSPropOptimizer
This is a variant Adadelta that serves the same purpose - dynamic decay of a learning rate multiplier.

CNN神经网络一些tricky的地方：

http://lamda.nju.edu.cn/weixs/project/CNNTricks/CNNTricks.html

摘要：

1、适合Relu的参数初始化：w = np.random.randn(n) * sqrt(2.0/n) # current recommendation

2、LR： In practice, if you see that you stopped making progress on the validation set, divide the LR by 2 (or by 5), and keep going, which might give you a surprise.亲测有效

3、关于learning rate：

RNN学习：

中文教程：http://blog.csdn.net/heyongluoyao8/article/details/48636251

FCN：http://blog.csdn.net/happyer88/article/details/47205839：Fully Convolutional Networks for Semantic Segmentation笔记

优点：

1，训练一个end-to-end的FCN模型，利用卷积神经网络的很强的学习能力，得到较准确的结果，以前的基于CNN的方法都是要对输入或者输出做一些处理，才能得到最终结果。

2，直接使用现有的CNN网络，如AlexNet, VGG16, GoogLeNet，只需在末尾加上upsampling，参数的学习还是利用CNN本身的反向传播原理，"whole image training is effective and efficient."

3，不限制输入图片的尺寸，不要求图片集中所有图片都是同样尺寸，只需在最后upsampling时按原图被subsampling的比例缩放回来，最后都会输出一张与原图大小一致的dense prediction map

理解DL细节的不错的文章：

http://colah.github.io/posts/2014-07-Conv-Nets-Modular/ ：CNN细节

http://colah.github.io/posts/2015-08-Understanding-LSTMs/：RNN细节

http://blog.csdn.net/mao_xiao_feng/article/details/53444333 tensorflow 计算CNN的细节

如果遇到了最后的输出值都一样的情况，可能的解决办法如下：

Hey, I had a similar issue with my own (hand-coded) CNN trying to get some results with the CIFAR-10 dataset. What I found was that I had forgotten to normalize the input images to some range that made sense with my weight scales. Try something like X = X / max(abs(X)) to put values between -1 and 1.

Another possibility is your weight initialization is causing many ReLU units to die. I usually initialize all weights with a small number times a normal Gaussian distribution. For wx+ b, b being the biases, you can try that + a small positive constant. I.e. b = weight_scale*random.randn(num, 1) + 0.1

Another idea — your sigmoid unit might be squashing your responses too much. They’re fairly uncommon in CNNs from what I understand, maybe just stick to ReLUs.

Last point — try testing on a small training batch (say 10–20 images) and just train until you overfit with 100% accuracy. That’s one way of knowing that your network is capable of doing something. I think these smaller tests are very important before investing hours or days into proper training, which is what these networks often require.

我最后的解决办法是：加了batch normalization，不过具体原因也没有确定

GAN的资料：

http://it.sohu.com/20161003/n469570219.shtml

http://www.wtoutiao.com/p/172tUtn.html

Deep Learning 资料总结的更多相关文章

机器学习(Machine Learning)&深度学习(Deep Learning)资料【转】
转自:机器学习(Machine Learning)&深度学习(Deep Learning)资料 <Brief History of Machine Learning> 介绍:这是一 ...
机器学习(Machine Learning)&深度学习(Deep Learning)资料(Chapter 2)
##机器学习(Machine Learning)&深度学习(Deep Learning)资料(Chapter 2)---#####注:机器学习资料[篇目一](https://github.co ...
【重磅干货整理】机器学习(Machine Learning)与深度学习(Deep Learning)资料汇总
[重磅干货整理]机器学习(Machine Learning)与深度学习(Deep Learning)资料汇总 .
机器学习(Machine Learning)&深度学习(Deep Learning)资料
机器学习(Machine Learning)&深度学习(Deep Learning)资料機器學習.深度學習方面不錯的資料,轉載. 原作:https://github.com/ty4z2008 ...
机器学习(Machine Learning)&深度学习(Deep Learning)资料
<Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost到随机森林.D ...
机器学习(Machine Learning)&深入学习(Deep Learning)资料
<Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost 到随机森林. ...
机器学习(Machine Learning)&深度学习(Deep Learning)资料汇总（上）
转载:http://dataunion.org/8463.html?utm_source=tuicool&utm_medium=referral <Brief History of Ma ...
机器学习(Machine Learning)&深度学习(Deep Learning)资料(下）
转载:http://www.jianshu.com/p/b73b6953e849 该资源的github地址:Qix <Statistical foundations of machine lea ...
机器学习(Machine Learning)与深度学习(Deep Learning)资料汇总
<Brief History of Machine Learning> 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决策树.SVM.Adaboost到随机森林.D ...

随机推荐

Linux fdisk命令详解[主分区/逻辑分区创建]
fdisk常见命令参数 -b<分区大小>:指定每个分区的大小: -l:列出指定的外围设备的分区表状况: -s<分区编号>:将指定的分区大小输出到标准输出上,单位为区块: -u: ...
Linux ntpdate命令详解
ntpdate命令用于同步更新互联网时间,或者NTP服务器时间 NTP服务器[Network Time Protocol(NTP)]是用来使计算机时间同步化的一种协议,它可以使计算机对其服务器或时钟源 ...
Nodejs和npm入门使用
目录引子何为node.js 和 npm 怎么使用node.js和npm 安装node.js & npm npm init 初始化项目(创建node.js模块) Package.json 属 ...
Z ：彻底了解指针数组，数组指针以及函数指针 [复
原创 :彻底了解指针数组,数组指针以及函数指针 [复制链接] 00 roking 白手起家帖子 60 主题 16 精华 0 可用积分 74 专家积分 0 在线时间 0 小时注册时间 2003-10 ...
JavaScript基础进阶之数组方法总结
数组常用方法总结: 下面我只总结了es3中常用的数组方法,一共有11个.es5中新增的9个数组方法,后续再单独总结. 1个连接数组的方法:concat() 2个数组转换为字符串的方法:join(). ...
CSS盒子模型之CSS3可伸缩框属性（Flexible Box）
CSS盒子模型(下) 一．CSS3可伸缩框(Flexible Box) 可伸缩框属性(Flexible Box)是css3新添加的盒子模型属性,有人称之为弹性盒模型,它的出现打破了我们经常使用的浮动布 ...
git使用，在ubuntu中
#安装sudo apt install git要先到github官网建一个账号. #配置你的github git config --global user.name "fanbrightu ...
iOS AOP框架Aspects实现原理
总结: Aspects 是对类的继承结构isa.mataclass结构的调整和维护:相当于链表的节点插入和删除: 同时使用method Swizzling 对方法统一重定向: 同时使用类似代理的机制 ...
Javascript (ECMAScript5) 的细节和违反直觉的地方
记录在学习Javascript (ECMAScript5) 中的一些与其他语言的不同之处,本文会不断更新. 里面的知识可能并不太适合有一定经验的Javascript程序员,仅仅但不限于给初学者阅读. ...
【bzoj 3622】已经没有什么好害怕的了
题目看到这个数据范围就发现我们需要一个\(O(n^2)\)的做法了,那大概率是\(dp\)了看到恰好\(k\)个我们就知道这基本是个容斥了首先解方程发现我们需要使得\(a>b\)的恰好有\ ...

Deep Learning 资料总结

Deep Learning 资料总结的更多相关文章

随机推荐

热门专题