CanChen ggchen@mail.ustc.edu.cn


 

AdaBatch

  • Motivation: Current stochastic gradient descend methods use fixed batchsize. Small batchsize with small learning rate leads to fast convergence while large batchsize offers more parallelism. This paper proposes AdaBatch, during which we can change the batchsize.
  • Method: Increasing batchsize equals to decreasing learning rate under some approximations. With this formula, the author did several experiments and proved that increasing batchsize progressively maintain the test accuracy within 1% while providing more parallelism.
  • Contribution: The paper gives us some engineering experience which can be very helpful.
 

“You might also like this model”

  • Motivation: Current network performance prediction methods focus on a fixed dataset while different datasets have different features.
  • Method: This paper proposes a recommendation system for unknown datasets, which consists of three parts, namely, network encoder, dataset similarity extractor and network performance predictor. To get network encoding presentation, this paper views a certain network architecuture as a sentence and proposes sentence a prediction task and a sentence perplexity task.
  • Contribution: Compared with previous works, the paper takes dataset similarity into consideration.

PaperReading20200223的更多相关文章

随机推荐

  1. VS2017项目中使用代码连接MySQL数据库,以及进行数据添加

    //头文件 #include "mysql.h" //函数定义 // 执行sql语句, 包括增加.删除.更新数据 bool ExecuteSql(MYSQL m_mysql,con ...

  2. java 责任链模式的三种实现

    责任链模式 责任链模式的定义:使多个对象都有机会处理请求,从而避免请求的发送者和接受者之间的耦合关系, 将这个对象连成一条链,并沿着这条链传递该请求,直到有一个对象处理他为止.这里就不再过多的介绍什么 ...

  3. 寒假pta二

    整除光棍 这里所谓的“光棍”,并不是指单身汪啦~ 说的是全部由1组成的数字,比如1.11.111.1111等.传说任何一个光棍都能被一个不以5结尾的奇数整除.比如,111111就可以被13整除. 现在 ...

  4. C++11特性中的to_string

    写在最前面,本文摘录于柳神笔记 to_string 的头⽂件是 #include , to_string 最常⽤的就是把⼀个 int 型变量或者⼀个数字转化 为 string 类型的变量,当然也可以转 ...

  5. MySQL表结构导出Excel

    1. information_schema.COLUMNS表记录了所有库中所有表的字段信息 SELECT COLUMN_NAME 字段名称, COLUMN_TYPE 字段类型, COLUMN_DEFA ...

  6. java 第三次课后作业

    1.java字段初始化的规律 public class gouzao { public static void main(String[] args) { test te=new test(); Sy ...

  7. JMS消息传递的类型

    对于消息的传递有两种类型: 一种是点对点的,即一个生产者和一个消费者一一对应: 另一种是发布/ 订阅模式,即一个生产者产生消息并进行发送后,可以由多个消费者进 行接收.

  8. Jmeter-maven-plugin github 版本插件变更历史

    https://github.com/jmeter-maven-plugin/jmeter-maven-plugin/blob/master/CHANGELOG.md

  9. 07-华为RAID2.0+技术

    目录 07-华为RAID2.0+技术 参考 RAID2.0+原理 07-华为RAID2.0+技术

  10. JavaScript图形实例:圆内螺线

    数学中有各式各样富含诗意的曲线,螺旋线就是其中比较特别的一类.螺旋线这个名词来源于希腊文,它的原意是“旋卷”或“缠卷”.例如,平面螺旋线便是以一个固定点开始向外逐圈旋绕而形成的曲线. 阿基米德螺线和黄 ...