PaperReading20200219
CanChen ggchen@mail.ustc.edu.cn
This is my first day sharing my reading paper and I will try to paraphrase core ideas in these papers with very simple words. Every paper consists three parts, namely, motivation, method, and contribution. In each part, no more than 3 sentences will be used.
NAS-Bench-201
- Motivation:Network search algorithms are often quite expensive and different search spaces also make it difficult for us to compare these algorithms. In fact, we can treat network architectures as X and their corresponding accuracies as Y, and construct a standard dataset to solve this problem.
- Method: Using cell-based strategy, we only need to find a cell and insert it into the macro structure. In the paper, 4 nodes and 5 operations are used, which means we need to train 15625 cells. As last,the author just trained 15625 models on cifar10,cifar100 and sampled-ImageNet,and provided us with the corresponding training logs.
- Contribution: The paper is a ICLR paper and is not very novel(at least I think). It shows us again: computing resources is very important. At least, it gives us a benchmark for NAS research and now we can use CPU to do NAS.
Peephole
- Motivation: Can we get the network's performance without training?
- Method: The author only considers sequential network architectures since we can treat the sequential network architectures as "a language". Then the author uses LSTM to deal with this problem like language modeling.
- Contribution: The work is kind of limited since it only deals with sequential network architecures while other structures such shortcut paths are in fact dominating this field.
Latency-aware
- Motivation: Current Darts algorithms do not take latency into consideration.
- Method: First, the author train a regression model that can predict a network's latency based on the network's structure. Then the author inserts this model into bi-level optimization equation as part of the loss function.
- Contribution: This work is an extension of Darts and can be very useful since latency is important in real scenarios.
PaperReading20200219的更多相关文章
随机推荐
- socket实现简单的FTP
一.开发环境 server端:centos 7 python-3.6.2 客户端:Windows 7 python-3.6.2 pycharm-2018 程序目的:1.学习使用socketserve ...
- 28 JavaScript语言类型&运算符
语言类型: 弱类型:可以改变变量值和对象类型 强类型:可以改变变量值不能改变对象类型 解释型:边编译边执行,速度慢.解释型一般是弱类型 编译型:先编译再执行(C++\Java),速度快.编译型一般是强 ...
- 吴裕雄 python 神经网络——TensorFlow训练神经网络:MNIST最佳实践
import os import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data INPUT_N ...
- 【剑指Offer面试编程题】题目1508:把字符串转换成整数--九度OJ
题目描述: 将一个字符串转换成一个整数,要求不能使用字符串转换整数的库函数. 输入: 输入可能包含多个测试样例. 对于每个测试案例,输入为一个合法或者非法的字符串,代表一个整数n(1<= n&l ...
- FileUpload之FileItem类的常用方法
http://blog.csdn.net/chinaliuyan/article/details/7002014
- 关于calendar修改前的代码和修改后的代码
Java编写的日历,输入年月,输出这个月的日期与星期 修改前的代码: import java.io.BufferedReader; import java.io.IOException; import ...
- 【SSM】Log4j 日志配置
1.log4j.properties ### 配置根 ### # log4j.rootLogger = debug,console ,fileAppender,dailyRollingFile,ROL ...
- 利用ProxySQL实现MySQL的读写分离
本文简单介绍ProxySQL的安装及如果实现后端MySQL主从结构的读写分离. 一.ProxySQL安装 Proxy官方地址:https://proxysql.com/ proxysql-2.0.8- ...
- 理解js中的原型链
对象有”prototype”属性,函数对象有”prototype”属性,原型对象有”constructor”属性. 关于原型 在JavaScript中,原型也是一个对象,通过原型可以实现对象的属性继承 ...
- springBoot整合mybatis-plus关闭自动转换小驼峰命名规则
增加配置信息 mybatis-plus: configuration: map-underscore-to-camel-case: false