基于Caffe的DeepID2实现（下）

小喵的唠叨话：这次的博客，真心累伤了小喵的心。但考虑到知识需要巩固和分享，小喵决定这次把剩下的内容都写完。

小喵的博客：http://www.miaoerduo.com

博客原文： http://www.miaoerduo.com/deep-learning/基于caffe的deepid2实现（下）.html ‎

四、数据的重整，简单的划分

前面的Data层用于生成成对的输入数据，Normalization层，用于将feature归一化，那么之后是不是就可以使用ContrastiveLoss层进行训练了呢？

且慢，还差一步。

ContrastiveLoss层要求有3个bottom：feature1、feature2以及表示对位的feature是否为同一个identity的label。

我们现在得到的feature却是所有的都在一起，data层直接得到的label也和这里要求的label不同。因此务必要对数据进行一次重整。

一个简单的规则就是按照奇偶，将feature划分成两部分。这样得到的两部分正好就是相同位置为一对。对于label的重整，也可以用类似的方法。小喵这里只对feature进行重整，而label的处理则是通过改ContrastiveLoss层来实现。

feature的重整本质上就是一个切片的操作，这里命名为id2_slice_layer，实现方法就是按照奇偶把bottom的数据复制到top。后馈的时候，也就是将两部分的feature的diff都直接复制到对应位置的bottom_diff中，具体实现如下：

 // created by miao

 #ifndef CAFFE_ID2_SLICE_LAYER_HPP_

 #define CAFFE_ID2_SLICE_LAYER_HPP_

 #include <vector>

 #include "caffe/blob.hpp"

 #include "caffe/layer.hpp"

 #include "caffe/proto/caffe.pb.h"

 namespace caffe {

 /**

  * @brief Takes a Blob and slices it along either the num or channel dimension,

  *        outputting multiple sliced Blob results.

  *

  * TODO(dox): thorough documentation for Forward, Backward, and proto params.

  */

 template <typename Dtype>

 class Id2SliceLayer : public Layer<Dtype> {

  public:

   explicit Id2SliceLayer(const LayerParameter& param)

       : Layer<Dtype>(param) {}

   virtual void LayerSetUp(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual void Reshape(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual inline const char* type() const { return "Id2Slice"; }

   virtual inline int ExactNumBottomBlobs() const { return ; }

   virtual inline int MinTopBlobs() const { return ; }

  protected:

   virtual void Forward_cpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual void Forward_gpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual void Backward_cpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);

   virtual void Backward_gpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);

 };

 }  // namespace caffe

 #endif  // CAFFE_ID2_SLICE_LAYER_HPP_

头文件，巨简单。。。

Cpp的代码，也非常简单，要注意id2_slice层的top有两个，每个的形状都是bottom的一半。

 // created by miao

 #include <algorithm>

 #include <vector>

 #include "caffe/layers/id2_slice_layer.hpp"

 #include "caffe/util/math_functions.hpp"

 namespace caffe {

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::LayerSetUp(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top) {

 }

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::Reshape(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top) {

     vector<int> top_shape = bottom[]->shape();

     top_shape[] /= ;

     top[]->Reshape(top_shape);

     top[]->Reshape(top_shape);

 }

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top) {

     const int feature_size = bottom[]->count();

     for (int n = ; n < bottom[]->num(); ++ n) {

         caffe_copy(

                 feature_size,

                 bottom[]->cpu_data() + n * feature_size,

                 top[n & ]->mutable_cpu_data() + (n / ) * feature_size

                 );

     }

 }

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom) {

     const int feature_size = bottom[]->count();

     for (int n = ; n < bottom[]->num(); ++ n) {

         caffe_copy(

                 feature_size,

                 top[n & ]->cpu_diff() + (n / ) * feature_size,

                 bottom[]->mutable_cpu_diff() + n * feature_size

                 );

     }

 }

 #ifdef CPU_ONLY

 STUB_GPU(Id2SliceLayer);

 #endif

 INSTANTIATE_CLASS(Id2SliceLayer);

 REGISTER_LAYER_CLASS(Id2Slice);

 }  // namespace caffe

GPU上的实现，为了简单起见，也是直接调用了CPU的前馈函数。

 // created by miao

 #include <vector>

 #include "caffe/layers/id2_slice_layer.hpp"

 #include "caffe/util/math_functions.hpp"

 namespace caffe {

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::Forward_gpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top) {

     this->Forward_cpu(bottom, top);

 }

 template <typename Dtype>

 void Id2SliceLayer<Dtype>::Backward_gpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom) {

     this->Backward_cpu(top, propagate_down, bottom);

 }

 INSTANTIATE_LAYER_GPU_FUNCS(Id2SliceLayer);

 }  // namespace caffe

这样就完成了feature的重整。由于没有用到新的参数，因此也不需要修改caffe.proto。

亲可以仿照这个方法对label来做类似的操作。鉴于小喵比较懒。。。这里就只是简单的改ContrastiveLoss层的代码了。

第一步，在ContrastiveLossLayer中新增一个用于记录feature pair是否是同一个identity的成员变量，取代原本的第三个bottom的功能。这样只需要在前馈的时候提前算好，就可以代替之前的第三个bottom来使用，而不需要再修改别的地方的代码。

为了大家使用的方便，小喵直接把修改之后的头文件粘贴出来（删掉注释）。新增的行，用“added by miao”这个注释标注出来。头文件只加了一行。

 #ifndef CAFFE_CONTRASTIVE_LOSS_LAYER_HPP_

 #define CAFFE_CONTRASTIVE_LOSS_LAYER_HPP_

 #include <vector>

 #include "caffe/blob.hpp"

 #include "caffe/layer.hpp"

 #include "caffe/proto/caffe.pb.h"

 #include "caffe/layers/loss_layer.hpp"

 namespace caffe {

 template <typename Dtype>

 class ContrastiveLossLayer : public LossLayer<Dtype> {

  public:

   explicit ContrastiveLossLayer(const LayerParameter& param)

       : LossLayer<Dtype>(param), diff_() {}

   virtual void LayerSetUp(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual inline int ExactNumBottomBlobs() const { return ; }

   virtual inline const char* type() const { return "ContrastiveLoss"; }

   virtual inline bool AllowForceBackward(const int bottom_index) const {

     return bottom_index != ;

   }

  protected:

   /// @copydoc ContrastiveLossLayer

   virtual void Forward_cpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual void Forward_gpu(const vector<Blob<Dtype>*>& bottom,

       const vector<Blob<Dtype>*>& top);

   virtual void Backward_cpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);

   virtual void Backward_gpu(const vector<Blob<Dtype>*>& top,

       const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);

   Blob<Dtype> diff_;  // cached for backward pass

   Blob<Dtype> dist_sq_;  // cached for backward pass

   Blob<Dtype> diff_sq_;  // tmp storage for gpu forward pass

   Blob<Dtype> summer_vec_;  // tmp storage for gpu forward pass

   Blob<Dtype> is_same_; // added by miao

 };

 }  // namespace caffe

 #endif  // CAFFE_CONTRASTIVE_LOSS_LAYER_HPP_

源文件的修改也十分简单，这里只贴出来Cuda的部分。源文件，修改了与原来的bottom3相关的地方。

 #include <algorithm>

 #include <vector>

 #include <iostream>

 #include "caffe/layers/contrastive_loss_layer.hpp"

 #include "caffe/util/math_functions.hpp"

 namespace caffe {

 template <typename Dtype>

 void ContrastiveLossLayer<Dtype>::Forward_gpu(

     const vector<Blob<Dtype>*>& bottom, const vector<Blob<Dtype>*>& top) {

   const int count = bottom[]->count();

   caffe_gpu_sub(

       count,

       bottom[]->gpu_data(),  // a

       bottom[]->gpu_data(),  // b

       diff_.mutable_gpu_data());  // a_i-b_i

   caffe_gpu_powx(

       count,

       diff_.mutable_gpu_data(),  // a_i-b_i

       Dtype(),

       diff_sq_.mutable_gpu_data());  // (a_i-b_i)^2

   caffe_gpu_gemv(

       CblasNoTrans,

       bottom[]->num(),

       bottom[]->channels(),

       Dtype(1.0),

       diff_sq_.gpu_data(),  // (a_i-b_i)^2

       summer_vec_.gpu_data(),

       Dtype(0.0),

       dist_sq_.mutable_gpu_data());  // \Sum (a_i-b_i)^2

   Dtype margin = this->layer_param_.contrastive_loss_param().margin();

   bool legacy_version =

       this->layer_param_.contrastive_loss_param().legacy_version();

   Dtype loss(0.0);

   for (int i = ; i < bottom[]->num(); ++i) {

     // added by miao

     is_same_.mutable_cpu_data()[i] = (bottom[]->cpu_data()[ * i] == bottom[]->cpu_data()[ * i + ])? :;

     if (is_same_.cpu_data()[i] == ) {  // similar pairs

       loss += dist_sq_.cpu_data()[i];

     } else {  // dissimilar pairs

       if (legacy_version) {

         loss += std::max(margin - dist_sq_.cpu_data()[i], Dtype(0.0));

       } else {

         Dtype dist = std::max(margin - sqrt(dist_sq_.cpu_data()[i]),

                               Dtype(0.0));

         loss += dist*dist;

       }

     }

   }

   loss = loss / static_cast<Dtype>(bottom[]->num()) / Dtype();

   top[]->mutable_cpu_data()[] = loss;

 }

 template <typename Dtype>

 __global__ void CLLBackward(const int count, const int channels,

     const Dtype margin, const bool legacy_version, const Dtype alpha,

     const Dtype* y, const Dtype* diff, const Dtype* dist_sq,

     Dtype *bottom_diff) {

   CUDA_KERNEL_LOOP(i, count) {

     int n = i / channels;  // the num index, to access y and dist_sq

     if (static_cast<int>(y[n])) {  // similar pairs

       bottom_diff[i] = alpha * diff[i];

     } else {  // dissimilar pairs

       Dtype mdist(0.0);

       Dtype beta(0.0);

       if (legacy_version) {

         mdist = (margin - dist_sq[n]);

         beta = -alpha;

       } else {

         Dtype dist = sqrt(dist_sq[n]);

         mdist = (margin - dist);

         beta = -alpha * mdist / (dist + Dtype(1e-)) * diff[i];

       }

       if (mdist > 0.0) {

         bottom_diff[i] = beta;

       } else {

         bottom_diff[i] = ;

       }

     }

   }

 }

 template <typename Dtype>

 void ContrastiveLossLayer<Dtype>::Backward_gpu(const vector<Blob<Dtype>*>& top,

     const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom) {

   for (int i = ; i < ; ++i) {

     if (propagate_down[i]) {

       const int count = bottom[]->count();

       const int channels = bottom[]->channels();

       Dtype margin = this->layer_param_.contrastive_loss_param().margin();

       const bool legacy_version =

           this->layer_param_.contrastive_loss_param().legacy_version();

       const Dtype sign = (i == ) ?  : -;

       const Dtype alpha = sign * top[]->cpu_diff()[] /

           static_cast<Dtype>(bottom[]->num());

       // NOLINT_NEXT_LINE(whitespace/operators)

       CLLBackward<Dtype><<<CAFFE_GET_BLOCKS(count), CAFFE_CUDA_NUM_THREADS>>>(

           count, channels, margin, legacy_version, alpha,

           is_same_.gpu_data(),  // pair similarity 0 or 1  added by miao

           diff_.gpu_data(),  // the cached eltwise difference between a and b

           dist_sq_.gpu_data(),  // the cached square distance between a and b

           bottom[i]->mutable_gpu_diff());

       CUDA_POST_KERNEL_CHECK;

     }

   }

 }

 INSTANTIATE_LAYER_GPU_FUNCS(ContrastiveLossLayer);

 }  // namespace caffe

需要注意的时候，前馈和后馈都需要做一点代码上的修改，虽说十分的简单，但也要小心。

至此，基于Caffe的DeepID2的修改全部完成。

如果您觉得本文对您有帮助，那请小喵喝杯茶吧~~O(∩_∩)O~~

转载请注明出处~

基于Caffe的DeepID2实现（下）的更多相关文章

基于Caffe的DeepID2实现（中）
小喵的唠叨话:我们在上一篇博客里面,介绍了Caffe的Data层的编写.有了Data层,下一步则是如何去使用生成好的训练数据.也就是这一篇的内容. 小喵的博客:http://www.miaoerduo ...
基于Caffe的DeepID2实现（上）
小喵的唠叨话:小喵最近在做人脸识别的工作,打算将汤晓鸥前辈的DeepID,DeepID2等算法进行实验和复现.DeepID的方法最简单,而DeepID2的实现却略微复杂,并且互联网上也没有比较好的资源 ...
基于Caffe的Large Margin Softmax Loss的实现（上）
小喵的唠叨话:在写完上一次的博客之后,已经过去了2个月的时间,小喵在此期间,做了大量的实验工作,最终在使用的DeepID2的方法之后,取得了很不错的结果.这次呢,主要讲述一个比较新的论文中的方法,L- ...
基于Caffe的Large Margin Softmax Loss的实现（中）
小喵的唠叨话:前一篇博客,我们做完了L-Softmax的准备工作.而这一章,我们开始进行前馈的研究. 小喵博客: http://miaoerduo.com 博客原文: http://www.miao ...
人脸识别(基于Caffe)
人脸识别(基于Caffe, 来自tyd) 人脸识别(判断是否为人脸) LMDB(数据库, 为Caffe支持的分类数据源) mkdir face_detect cd face_detect mkdir ...
基于Caffe训练AlexNet模型
数据集 1.准备数据集 1)下载训练和验证图片 ImageNet官网地址:http://www.image-net.org/signup.php?next=download-images (需用邮箱注 ...
Caffe系列4——基于Caffe的MNIST数据集训练与测试（手把手教你使用Lenet识别手写字体）
基于Caffe的MNIST数据集训练与测试原创:转载请注明https://www.cnblogs.com/xiaoboge/p/10688926.html 摘要在前面的博文中,我详细介绍了Caf ...
人脸检测数据源制作与基于caffe构架的ALEXNET神经网络训练
本篇文章主要记录的是人脸检测数据源制作与ALEXNET网络训练实现检测到人脸(基于caffe). 1.数据获取数据获取: ① benchmark是一个行业的基准(数据库.论文.源码.结果),例如WI ...
基于Caffe ResNet-50网络实现图片分类（仅推理）的实验复现
摘要:本实验主要是以基于Caffe ResNet-50网络实现图片分类(仅推理)为例,学习如何在已经具备预训练模型的情况下,将该模型部署到昇腾AI处理器上进行推理. 本文分享自华为云社区<[CA ...

随机推荐

JS正则表达式常用总结
正则表达式的创建 JS正则表达式的创建有两种方式: new RegExp() 和直接字面量. //使用RegExp对象创建 var regObj = new RegExp("(^\\s+) ...
CENTOS 6.5 平台离线编译安装 Mysql5.6.22
一.下载源码包 http://cdn.mysql.com/archives/mysql-5.6/mysql-5.6.22.tar.gz 二.准备工作卸载之前本机自带的MYSQL 安装 cmake,编 ...
红黑树——算法导论(15)
1. 什么是红黑树 (1) 简介上一篇我们介绍了基本动态集合操作时间复杂度均为O(h)的二叉搜索树.但遗憾的是,只有当二叉搜索树高度较低时,这些集合操作才会较快:即当树的高度较高(甚至一种极 ...
从备考PMP到与项目经理同呼吸
前言 PMP是什么梗? 项目管理专业人士资格认证.它是由美国项目管理协会(Project Management Institute(PMI)发起的,严格评估项目管理人员知识技能是否具有高品质的资格认证 ...
JAVA问题集锦Ⅰ
1.Java的日期添加: import java.util.Date ; date=new date();//取时间 Calendar calendar = new GregorianCalendar ...
JavaScript实现常用的排序算法
▓▓▓▓▓▓ 大致介绍由于最近要考试复习,所以学习js的时间少了 -_-||,考试完还会继续的努力学习,这次用原生的JavaScript实现以前学习的常用的排序算法,有冒泡排序.快速排序.直接插入排 ...
Android学习路线总结，绝对干货
title: Android学习路线总结,绝对干货 tags: Android学习路线,Android学习资料,怎么学习android grammar_cjkRuby: true --- 一.前言不 ...
15个关于Chrome的开发必备小技巧[译]
谷歌Chrome,是当前最流行且被众多web开发人员使用的浏览器.最快六周就更新发布一次以及伴随着它不断强大的开发组件,使得Chrome成为你必备的开发工具.例如,在线编辑CSS,console以及d ...
%iowait和CPU使用率的正确认知
resources 理解 %IOWAIT (%WIO) LINUX系统的CPU使用率和LOAD Linux Performance Observability Tools How Linux CPU ...
linux下安装Redis以及phpredis模块
一:redis的安装 1. 首先上官网下载Redis 压缩包,地址:http://redis.io/download 下载 2. 通过远程管理工具,将压缩包拷贝到Linux服务器中,执行解压操作 3. ...

基于Caffe的DeepID2实现（下）

四、数据的重整，简单的划分

基于Caffe的DeepID2实现（下）的更多相关文章

随机推荐

热门专题