《Deep Learning Face Attributes in the Wild》论文笔记

论文背景：

IEEE International Conference on Computer Vision 2015

Ziwei Liu1, Ping Luo1, Xiaogang Wang2, Xiaoou Tang1
1Department of Information Engineering, The Chinese University of Hong Kong
2Department of Electronic Engineering, The Chinese University of Hong Kong

论文贡献：

1.背景独立的情况下提升识别人脸的准确率，如下图与state_of_art的方案对比

2.识别人脸细节属性

3.开发者福音：提供了一个包含20万张标记了40个常用属性的人像数据库celebA（基于celebFace[1]）和LFWA（基于LFW[2]）

模型架构：

1.Lneto定位头部和肩部

2.Lnets进一步定位脸

3.Anet最后接全连接层进行属性预测

4.用SVM做多个全连接层的属性分类

具体网络结构，使用了参数局部共享和全局共享混合的策略：

More specifically, the network structures of LNeto and
LNets are the same as shown in Fig.3 (a) and (b), which
stack two max-pooling and five convolutional layers (C1 to

C5) with globally shared filters. These filters are recurrently

applied at every location of the image and are able to
account for large face translation and scaling. ANet stacks
four convolutional layers (C1 to C4), three max-pooling
layers, and one fully-connected layer (FC), where the filters
at C1 and C2 are globally shared, while the filters at C3
and C4 are locally shared. As shown in Fig.3 (c), the
response maps at C2 and C3 are divided into grids with
non-overlapping cells, each of which learns different filters.
The locally shared filters have been proved effective for
face related problems [24, 23], because they can capture
different information from different face parts. The network
structures are specified in Fig.3. For instance, the filters
at C1 of LNeto has 96 channels and the filter size in each
channel is 11113, as the input image xo contains three
color channels.

crop头像时可能会遭遇多目标检测问题，文章使用了每个位置求响应密度的空间距离的方法来解决

【1】Y. Sun, X. Wang, and X. Tang. Deep learning face
representation by joint identification-verification. In NIPS,
2014.

【2】G. B. Huang, M. Ramesh, T. Berg, and E. Learned-Miller.
Labeled faces in the wild: A database for studying face
recognition in unconstrained environments. Technical Report
07-49, University of Massachusetts, Amherst, October
2007.

一点随想：这个结合生成模型，比如gan，可能可以做一件有趣的事：根据语义生成带属性的角色

《Deep Learning Face Attributes in the Wild》论文笔记的更多相关文章

《Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition》论文笔记
论文题目:<Vision Permutator: A Permutable MLP-Like ArchItecture For Visual Recognition> 论文作者:Qibin ...
[place recognition]NetVLAD: CNN architecture for weakly supervised place recognition 论文翻译及解析（转）
https://blog.csdn.net/qq_32417287/article/details/80102466 abstract introduction method overview Dee ...
论文笔记系列-Auto-DeepLab:Hierarchical Neural Architecture Search for Semantic Image Segmentation
Pytorch实现代码:https://github.com/MenghaoGuo/AutoDeeplab 创新点 cell-level and network-level search 以往的NAS ...
论文笔记——Rethinking the Inception Architecture for Computer Vision
1. 论文思想 factorized convolutions and aggressive regularization. 本文给出了一些网络设计的技巧. 2. 结果用5G的计算量和25M的参数. ...
论文笔记：Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells 2019-04- ...
论文笔记：ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware 2019-03-19 16:13:18 Pape ...
论文笔记：DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search 2019-03-19 10:04:26accepted by ICLR 2019 Paper:https://arx ...
论文笔记：Progressive Neural Architecture Search
Progressive Neural Architecture Search 2019-03-18 20:28:13 Paper:http://openaccess.thecvf.com/conten ...
论文笔记：Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation
Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation2019-03-18 14:4 ...
论文笔记系列-DARTS: Differentiable Architecture Search
Summary 我的理解就是原本节点和节点之间操作是离散的,因为就是从若干个操作中选择某一个,而作者试图使用softmax和relaxation(松弛化)将操作连续化,所以模型结构搜索的任务就转变成了 ...

随机推荐

BZOJ 2561: 最小生成树【最小割/最大流】
Description 给定一个边带正权的连通无向图G=(V,E),其中N=|V|,M=|E|,N个点从1到N依次编号,给定三个正整数u,v,和L (u≠v),假设现在加入一条边权为L的边(u,v), ...
JAVA自定义监听器的示例代码
JAVA用户自定义事件监听完整例子 JAVA用户自定义事件监听完整例子- —sunfruit 很多介绍用户自定义事件都没有例子,或是例子不全,下面写了一个完整的例子,并写入了注释以便参考,完整 ...
Flex里监听mouseDownOutside事件解决弹出窗口点击空白关闭功能
其实当用户在使用 PopUpManager 打开的某个组件外部单击时,会从该组件分派一个mouseDownOutside事件监听该事件就能实现点击空白处关闭窗口的功能 this.addEventLi ...
IPTABLES基本例子
iptables –F #删除已经存在的规则 iptables -P INPUT DROP #配置默认的拒绝规则.基本规则是:先拒绝所有的服务,然后根据需要再添加新的规则. iptables -A I ...
项目中遇到的HQL查询问题
问题描写叙述: 目的:想要查询出全部最新版本号的组件说明:组件:版本号 =1:n关系 ,假设这个组件仅仅有一个版本号也要可以查出来. 项目中使用的是内存数据库,无法看到表结构,这里的样例仅仅用于模拟 ...
改动Android启动画面
一.Android的启动步骤 1.启动Linux 2.载入Android 3.显示Android桌面二.分析 Android载入开机动画的源代码文件是: /opt/android4.3/framew ...
Spring之IOC篇章具体解释
专题一 IOC 1.接口以及面向接口编程 a.结构设计中,分清层次以及调用关系,每层仅仅向外(或者上层)提供一组功能接口,各层间仅依赖接口而非实现类这样做的优点是,接口实现的变动不影响各层间的调用 ...
看懂JSP声明的格式。。。
在WebRoot下新建test3.jsp 改动body内容: <%! int a = 3; %> <% int b = 3; %> <%= a-- %& ...
LeetCode题解汇总
陆续更新至github... https://github.com/OliveLv/LeetCode/
I2C上拉电阻取值范围
I2C总线是微电子通信控制领域中常用的一种总线标准,具备接线少,控制简单,速率高等优点.在I2C电路中常见的上拉电阻有1k.1.5k.2.2k.4.7k.5.1k.10k等等,但是应该如何根据开发要求 ...

《Deep Learning Face Attributes in the Wild》论文笔记

《Deep Learning Face Attributes in the Wild》论文笔记的更多相关文章

随机推荐

热门专题