OpenCV:OpenCV目标检测Hog+SWindow源代码分析

此文主要描述出HOG分类的调用堆栈。

使用OpenCV作图像检测，使用HOG检测过程，其中一部分源代码如下：

1.HOG 检测底层栈的检测计算代码：

貌似在计算过程中仅使用滑窗方法？

void HOGDescriptor::detect(const Mat& img,

    vector<Point>& hits, vector<double>& weights, double hitThreshold,

    Size winStride, Size padding, const vector<Point>& locations) const

{

    hits.clear();

    if( svmDetector.empty() )

        return;

    if( winStride == Size() )

        winStride = cellSize;

    Size cacheStride(gcd(winStride.width, blockStride.width),

                     gcd(winStride.height, blockStride.height));

    size_t nwindows = locations.size();

    padding.width = (int)alignSize(std::max(padding.width, 0), cacheStride.width);

    padding.height = (int)alignSize(std::max(padding.height, 0), cacheStride.height);

    Size paddedImgSize(img.cols + padding.width*2, img.rows + padding.height*2);

    HOGCache cache(this, img, padding, padding, nwindows == 0, cacheStride);

    if( !nwindows )

        nwindows = cache.windowsInImage(paddedImgSize, winStride).area();

    const HOGCache::BlockData* blockData = &cache.blockData[0];

    int nblocks = cache.nblocks.area();

    int blockHistogramSize = cache.blockHistogramSize;

    size_t dsize = getDescriptorSize();

    double rho = svmDetector.size() > dsize ? svmDetector[dsize] : 0;

    vector<float> blockHist(blockHistogramSize);

    for( size_t i = 0; i < nwindows; i++ )

    {

        Point pt0;

        if( !locations.empty() )

        {

            pt0 = locations[i];

            if( pt0.x < -padding.width || pt0.x > img.cols + padding.width - winSize.width ||

                pt0.y < -padding.height || pt0.y > img.rows + padding.height - winSize.height )

                continue;

        }

        else

        {

            pt0 = cache.getWindow(paddedImgSize, winStride, (int)i).tl() - Point(padding);

            CV_Assert(pt0.x % cacheStride.width == 0 && pt0.y % cacheStride.height == 0);

        }

        double s = rho;

        const float* svmVec = &svmDetector[0];

#ifdef HAVE_IPP

        int j;

#else

        int j, k;

#endif

        for( j = 0; j < nblocks; j++, svmVec += blockHistogramSize )

        {

            const HOGCache::BlockData& bj = blockData[j];

            Point pt = pt0 + bj.imgOffset;

            const float* vec = cache.getBlock(pt, &blockHist[0]);

#ifdef HAVE_IPP

            Ipp32f partSum;

            ippsDotProd_32f(vec,svmVec,blockHistogramSize,&partSum);

            s += (double)partSum;

#else

            for( k = 0; k <= blockHistogramSize - 4; k += 4 )

                s += vec[k]*svmVec[k] + vec[k+1]*svmVec[k+1] +

                    vec[k+2]*svmVec[k+2] + vec[k+3]*svmVec[k+3];

            for( ; k < blockHistogramSize; k++ )

                s += vec[k]*svmVec[k];

#endif

        }

        if( s >= hitThreshold )

        {

            hits.push_back(pt0);

            weights.push_back(s);

        }

    }

}

2. HOG invoker的对象重载：

    void operator()( const Range& range ) const

    {

        int i, i1 = range.start, i2 = range.end;

        double minScale = i1 > 0 ? levelScale[i1] : i2 > 1 ? levelScale[i1+1] : std::max(img.cols, img.rows);

        Size maxSz(cvCeil(img.cols/minScale), cvCeil(img.rows/minScale));

        Mat smallerImgBuf(maxSz, img.type());

        vector<Point> locations;

        vector<double> hitsWeights;

        for( i = i1; i < i2; i++ )

        {

            double scale = levelScale[i];

            Size sz(cvRound(img.cols/scale), cvRound(img.rows/scale));

            Mat smallerImg(sz, img.type(), smallerImgBuf.data);

            if( sz == img.size() )

                smallerImg = Mat(sz, img.type(), img.data, img.step);

            else

                resize(img, smallerImg, sz);

            //使用HOG 进行检测

           hog->detect(smallerImg, locations, hitsWeights, hitThreshold, winStride, padding);

            Size scaledWinSize = Size(cvRound(hog->winSize.width*scale), cvRound(hog->winSize.height*scale));

            mtx->lock();

            for( size_t j = 0; j < locations.size(); j++ )

            {

                vec->push_back(Rect(cvRound(locations[j].x*scale),

                                    cvRound(locations[j].y*scale),

                                    scaledWinSize.width, scaledWinSize.height));

                if (scales)

                {

                    scales->push_back(scale);

                }

            }

            mtx->unlock();

            if (weights && (!hitsWeights.empty()))

            {

                mtx->lock();

                for (size_t j = 0; j < locations.size(); j++)

                {

                    weights->push_back(hitsWeights[j]);

                }

                mtx->unlock();

            }

        }

    }

3.使用HOG特征进行多尺度检测

void HOGDescriptor::detectMultiScale(

    const Mat& img, vector<Rect>& foundLocations, vector<double>& foundWeights,

    double hitThreshold, Size winStride, Size padding,

    double scale0, double finalThreshold, bool useMeanshiftGrouping) const

{

    double scale = 1.;

    int levels = 0;

    vector<double> levelScale;

    for( levels = 0; levels < nlevels; levels++ )

    {

        levelScale.push_back(scale);

        if( cvRound(img.cols/scale) < winSize.width ||

            cvRound(img.rows/scale) < winSize.height ||

            scale0 <= 1 )

            break;

        scale *= scale0;

    }

    levels = std::max(levels, 1);

    levelScale.resize(levels);

    std::vector<Rect> allCandidates;

    std::vector<double> tempScales;

    std::vector<double> tempWeights;

    std::vector<double> foundScales;

    Mutex mtx;

    parallel_for_(Range(0, (int)levelScale.size()),

                 HOGInvoker(this, img, hitThreshold, winStride, padding, &levelScale[0], &allCandidates, &mtx, &tempWeights, &tempScales));

    std::copy(tempScales.begin(), tempScales.end(), back_inserter(foundScales));

    foundLocations.clear();

    std::copy(allCandidates.begin(), allCandidates.end(), back_inserter(foundLocations));

    foundWeights.clear();

    std::copy(tempWeights.begin(), tempWeights.end(), back_inserter(foundWeights));

    if ( useMeanshiftGrouping )

    {

        groupRectangles_meanshift(foundLocations, foundWeights, foundScales, finalThreshold, winSize);

    }

    else

    {

        groupRectangles(foundLocations, foundWeights, (int)finalThreshold, 0.2);

    }

}

其中得到HogCache也是重要的一环：

独立为init函数：

HOGCache::HOGCache(const HOGDescriptor* _descriptor,

        const Mat& _img, Size _paddingTL, Size _paddingBR,

        bool _useCache, Size _cacheStride)

{

    init(_descriptor, _img, _paddingTL, _paddingBR, _useCache, _cacheStride);

}

void HOGCache::init(const HOGDescriptor* _descriptor,

        const Mat& _img, Size _paddingTL, Size _paddingBR,

        bool _useCache, Size _cacheStride)

{

    descriptor = _descriptor;

    cacheStride = _cacheStride;

    useCache = _useCache;

    descriptor->computeGradient(_img, grad, qangle, _paddingTL, _paddingBR);

    imgoffset = _paddingTL;

    winSize = descriptor->winSize;

    Size blockSize = descriptor->blockSize;

    Size blockStride = descriptor->blockStride;

    Size cellSize = descriptor->cellSize;

    int i, j, nbins = descriptor->nbins;

    int rawBlockSize = blockSize.width*blockSize.height;

    nblocks = Size((winSize.width - blockSize.width)/blockStride.width + 1,

                   (winSize.height - blockSize.height)/blockStride.height + 1);

    ncells = Size(blockSize.width/cellSize.width, blockSize.height/cellSize.height);

    blockHistogramSize = ncells.width*ncells.height*nbins;

    if( useCache )

    {

        Size cacheSize((grad.cols - blockSize.width)/cacheStride.width+1,

                       (winSize.height/cacheStride.height)+1);

        blockCache.create(cacheSize.height, cacheSize.width*blockHistogramSize);

        blockCacheFlags.create(cacheSize);

        size_t cacheRows = blockCache.rows;

        ymaxCached.resize(cacheRows);

        for(size_t ii = 0; ii < cacheRows; ii++ )

            ymaxCached[ii] = -1;

    }

    Mat_<float> weights(blockSize);

    float sigma = (float)descriptor->getWinSigma();

    float scale = 1.f/(sigma*sigma*2);

    for(i = 0; i < blockSize.height; i++)

        for(j = 0; j < blockSize.width; j++)

        {

            float di = i - blockSize.height*0.5f;

            float dj = j - blockSize.width*0.5f;

            weights(i,j) = std::exp(-(di*di + dj*dj)*scale);

        }

    blockData.resize(nblocks.width*nblocks.height);

    pixData.resize(rawBlockSize*3);

    // Initialize 2 lookup tables, pixData & blockData.

    // Here is why:

    //

    // The detection algorithm runs in 4 nested loops (at each pyramid layer):

    //  loop over the windows within the input image

    //    loop over the blocks within each window

    //      loop over the cells within each block

    //        loop over the pixels in each cell

    //

    // As each of the loops runs over a 2-dimensional array,

    // we could get 8(!) nested loops in total, which is very-very slow.

    //

    // To speed the things up, we do the following:

    //   1. loop over windows is unrolled in the HOGDescriptor::{compute|detect} methods;

    //         inside we compute the current search window using getWindow() method.

    //         Yes, it involves some overhead (function call + couple of divisions),

    //         but it's tiny in fact.

    //   2. loop over the blocks is also unrolled. Inside we use pre-computed blockData[j]

    //         to set up gradient and histogram pointers.

    //   3. loops over cells and pixels in each cell are merged

    //       (since there is no overlap between cells, each pixel in the block is processed once)

    //      and also unrolled. Inside we use PixData[k] to access the gradient values and

    //      update the histogram

    //

    count1 = count2 = count4 = 0;

    for( j = 0; j < blockSize.width; j++ )

        for( i = 0; i < blockSize.height; i++ )

        {

            PixData* data = 0;

            float cellX = (j+0.5f)/cellSize.width - 0.5f;

            float cellY = (i+0.5f)/cellSize.height - 0.5f;

            int icellX0 = cvFloor(cellX);

            int icellY0 = cvFloor(cellY);

            int icellX1 = icellX0 + 1, icellY1 = icellY0 + 1;

            cellX -= icellX0;

            cellY -= icellY0;

            if( (unsigned)icellX0 < (unsigned)ncells.width &&

                (unsigned)icellX1 < (unsigned)ncells.width )

            {

                if( (unsigned)icellY0 < (unsigned)ncells.height &&

                    (unsigned)icellY1 < (unsigned)ncells.height )

                {

                    data = &pixData[rawBlockSize*2 + (count4++)];

                    data->histOfs[0] = (icellX0*ncells.height + icellY0)*nbins;

                    data->histWeights[0] = (1.f - cellX)*(1.f - cellY);

                    data->histOfs[1] = (icellX1*ncells.height + icellY0)*nbins;

                    data->histWeights[1] = cellX*(1.f - cellY);

                    data->histOfs[2] = (icellX0*ncells.height + icellY1)*nbins;

                    data->histWeights[2] = (1.f - cellX)*cellY;

                    data->histOfs[3] = (icellX1*ncells.height + icellY1)*nbins;

                    data->histWeights[3] = cellX*cellY;

                }

                else

                {

                    data = &pixData[rawBlockSize + (count2++)];

                    if( (unsigned)icellY0 < (unsigned)ncells.height )

                    {

                        icellY1 = icellY0;

                        cellY = 1.f - cellY;

                    }

                    data->histOfs[0] = (icellX0*ncells.height + icellY1)*nbins;

                    data->histWeights[0] = (1.f - cellX)*cellY;

                    data->histOfs[1] = (icellX1*ncells.height + icellY1)*nbins;

                    data->histWeights[1] = cellX*cellY;

                    data->histOfs[2] = data->histOfs[3] = 0;

                    data->histWeights[2] = data->histWeights[3] = 0;

                }

            }

            else

            {

                if( (unsigned)icellX0 < (unsigned)ncells.width )

                {

                    icellX1 = icellX0;

                    cellX = 1.f - cellX;

                }

                if( (unsigned)icellY0 < (unsigned)ncells.height &&

                    (unsigned)icellY1 < (unsigned)ncells.height )

                {

                    data = &pixData[rawBlockSize + (count2++)];

                    data->histOfs[0] = (icellX1*ncells.height + icellY0)*nbins;

                    data->histWeights[0] = cellX*(1.f - cellY);

                    data->histOfs[1] = (icellX1*ncells.height + icellY1)*nbins;

                    data->histWeights[1] = cellX*cellY;

                    data->histOfs[2] = data->histOfs[3] = 0;

                    data->histWeights[2] = data->histWeights[3] = 0;

                }

                else

                {

                    data = &pixData[count1++];

                    if( (unsigned)icellY0 < (unsigned)ncells.height )

                    {

                        icellY1 = icellY0;

                        cellY = 1.f - cellY;

                    }

                    data->histOfs[0] = (icellX1*ncells.height + icellY1)*nbins;

                    data->histWeights[0] = cellX*cellY;

                    data->histOfs[1] = data->histOfs[2] = data->histOfs[3] = 0;

                    data->histWeights[1] = data->histWeights[2] = data->histWeights[3] = 0;

                }

            }

            data->gradOfs = (grad.cols*i + j)*2;

            data->qangleOfs = (qangle.cols*i + j)*2;

            data->gradWeight = weights(i,j);

        }

    assert( count1 + count2 + count4 == rawBlockSize );

    // defragment pixData

    for( j = 0; j < count2; j++ )

        pixData[j + count1] = pixData[j + rawBlockSize];

    for( j = 0; j < count4; j++ )

        pixData[j + count1 + count2] = pixData[j + rawBlockSize*2];

    count2 += count1;

    count4 += count2;

    // initialize blockData

    for( j = 0; j < nblocks.width; j++ )

        for( i = 0; i < nblocks.height; i++ )

        {

            BlockData& data = blockData[j*nblocks.height + i];

            data.histOfs = (j*nblocks.height + i)*blockHistogramSize;

            data.imgOffset = Point(j*blockStride.width,i*blockStride.height);

        }

}

总结：

以上大致为HOG检测计算大致的函数调用堆栈。

OpenCV:OpenCV目标检测Hog+SWindow源代码分析的更多相关文章

OpenCV:OpenCV目标检测Adaboost+haar源代码分析
使用OpenCV作图像检测, Adaboost+haar决策过程,其中一部分源代码如下: 函数调用堆栈的底层为: 1.使用有序决策桩进行预测 template<class FEval> i ...
OpenCV亚像素角点cornerSubPixel()源代码分析
上一篇博客中讲到了goodFeatureToTrack()这个API函数能够获取图像中的强角点.但是获取的角点坐标是整数,但是通常情况下,角点的真实位置并不一定在整数像素位置,因此为了获取更为精确的角 ...
10分钟学会使用YOLO及Opencv实现目标检测（下）|附源码
将YOLO应用于视频流对象检测首先打开 yolo_video.py文件并插入以下代码: # import the necessary packages import numpy as np impo ...
OpenCV两种畸变校正模型源代码分析以及CUDA实现
图像算法中会经常用到摄像机的畸变校正,有必要总结分析OpenCV中畸变校正方法,其中包括普通针孔相机模型和鱼眼相机模型fisheye两种畸变校正方法. 普通相机模型畸变校正函数针对OpenCV中的cv ...
运动目标前景检测之ViBe源代码分析
一方面为了学习,一方面按照老师和项目的要求接触到了前景提取的相关知识,具体的方法有很多,帧差.背景减除(GMM.CodeBook. SOBS. SACON. VIBE. W4.多帧平均……).光流(稀 ...
目标检测——HOG特征
1.HOG特征: 方向梯度直方图(Histogram of Oriented Gradient, HOG)特征是一种在计算机视觉和图像处理中用来进行物体检测的特征描述子.它通过计算和统计图像局部区域的 ...
深度学习 + OpenCV，Python实现实时视频目标检测
使用 OpenCV 和 Python 对实时视频流进行深度学习目标检测是非常简单的,我们只需要组合一些合适的代码,接入实时视频,随后加入原有的目标检测功能. 在本文中我们将学习如何扩展原有的目标检测项 ...
OpenCV 学习笔记 07 目标检测与识别
目标检测与识别是计算机视觉中最常见的挑战之一.属于高级主题. 本章节将扩展目标检测的概念,首先探讨人脸识别技术,然后将该技术应用到显示生活中的各种目标检测. 1 目标检测与识别技术为了与OpenCV ...
目标检测之harr---角点检测harr 的opencv实现
本系列文章由@浅墨_毛星云出品,转载请注明出处. 文章链接: http://blog.csdn.net/poem_qianmo/article/details/29356187 作者:毛星云(浅墨) ...

随机推荐

【IntelliJ IDEA】idea上安装Translation插件后，需要AppKey才能生效的解决方案
使用idea安装的翻译插件translation,但是使用的时候并不友好无奈,如果想使用翻译软件并且更方便的话,可以如下: 可以选择将translation进行卸载清除缓存并进行重启然后再启动之 ...
【Codeforces 1114D】Flood Fill
[链接] 我是链接,点我呀:) [题意] 你选择一个point作为start_position 然后每次你可以将包含该start_position的所有联通块变成任意颜色问你最少要多少次变换才能将所 ...
[luoguP2024] 食物链（并查集）
传送门经典的并查集问题对于这种问题,并查集需要分类开3*n的并查集,其中x用来连接与x同类的,x+n用来连接x吃的,x+2*n用来连接x被吃的. 1 x y时,如果 x吃y 或 x被y吃,那么为 ...
Openfire：重新配置openfire
有些时候当我们在对openfire开发时,需要重置openfire的配置,这时最简单的方法就是重新运行openfire的安装程序.要重新运行安装程序,方法很简单: 打开openfire的安装目录,找到 ...
（二）模板引擎之Velocity脚本基本的语法全
velocity velocity三种reference 变量:对java对象的一种字符串化表示,返回值调用了java的toString()方法的结果. 方法:调用的是对象的某个方法. ...
CSS3选择器（全）
CSS选择器复习通用选择器:* 选择到全部的元素选择子元素:> 选择到元素的直接后代(第一级子元素) 相邻兄弟选择器:+ 选择到紧随目标元素后的第一个元素普通兄弟选择器:~ 选择到紧随其后 ...
leetcode中，代码怎样调试，创造本地执行环境
初次接触leetcode,是我在一个招聘站点上看的,这个OJ真有那么厉害吗? 这几天在这个OJ上做了几道题,发现他的几个特点,1.题目不难(相对于ACM来说,我被ACM虐到至今无力),评判没那么苛刻, ...
记录cocos2d-x3.0版本号更改内容官方说明
http://www.cocos2d-x.org/wiki/Release_Notes_for_Cocos2d-x_v300
UNIX环境高级编程之第３章：文件I/O
3.1 引言文件I/O函数:打开文件,读文件,写文件经常使用到五个函数:open, read, write, lseek, close. 本章描写叙述的函数都是:不带缓冲的I/O(unbuffer ...
EarthWarrior3D游戏ios源代码
这是一款不错的ios源代码源代码,EarthWarrior3D游戏源代码. 而且游戏源码支持多平台. 适用于cocos v2.1.0.0版本号源代码下载: http://code.662p.com/ ...

OpenCV:OpenCV目标检测Hog+SWindow源代码分析

OpenCV:OpenCV目标检测Hog+SWindow源代码分析的更多相关文章

随机推荐

热门专题