解决 Faster R-CNN 图片中框不在一张图片上显示的问题

解决 Faster R-CNN 图片中框不在一张图片上显示的问题

发现问题

在使用demo.py的时候，选取测试用的图片，放到demo，然后修改demo.py中对应的图片名称，然后进行测试：

发现：图片中被框出来的部分并没有完全到一张图片上去，经过多张图片的测试，可以发现，并不是一张图片上一个框，而是按照类别进行的划分，即：每一类一张图片

如何解决这个问题？

原先的画图部分主要在这里：

def vis_detections(im, class_name, dets, thresh=0.5):

    """Draw detected bounding boxes."""

    inds = np.where(dets[:, -1] >= thresh)[0]

    if len(inds) == 0:

        return

    im = im[:, :, (2, 1, 0)]

    fig, ax = plt.subplots(figsize=(12, 12))

    ax.imshow(im, aspect='equal')

    for i in inds:

        bbox = dets[i, :4]

        score = dets[i, -1]

        ax.add_patch(

            plt.Rectangle((bbox[0], bbox[1]),

                          bbox[2] - bbox[0],

                          bbox[3] - bbox[1], fill=False,

                          edgecolor='red', linewidth=3.5)

            )

        ax.text(bbox[0], bbox[1] - 2,

                '{:s} {:.3f}'.format(class_name, score),

                bbox=dict(facecolor='blue', alpha=0.5),

                fontsize=14, color='white')

    ax.set_title(('{} detections with '

                  'p({} | box) >= {:.1f}').format(class_name, class_name,

                                                  thresh),

                  fontsize=14)

    plt.axis('off')

    plt.tight_layout()

    plt.draw()

def demo(net, image_name):

    """Detect object classes in an image using pre-computed object proposals."""

    # Load the demo image

    im_file = os.path.join(cfg.DATA_DIR, 'demo', image_name)

    im = cv2.imread(im_file)

    # Detect all object classes and regress object bounds

    timer = Timer()

    timer.tic()

    scores, boxes = im_detect(net, im)

    timer.toc()

    print ('Detection took {:.3f}s for '

           '{:d} object proposals').format(timer.total_time, boxes.shape[0])

    # Visualize detections for each class

    CONF_THRESH = 0.8

    NMS_THRESH = 0.3

    for cls_ind, cls in enumerate(CLASSES[1:]):

        cls_ind += 1 # because we skipped background

        cls_boxes = boxes[:, 4*cls_ind:4*(cls_ind + 1)]

        cls_scores = scores[:, cls_ind]

        dets = np.hstack((cls_boxes,

                          cls_scores[:, np.newaxis])).astype(np.float32)

        keep = nms(dets, NMS_THRESH)

        dets = dets[keep, :]

        vis_detections(im, cls, dets, thresh=CONF_THRESH)

修改为:

# 将检测可视化

def vis_detections(ax,im, class_name, dets, thresh=0.5):

    """Draw detected bounding boxes."""

    #print("+_+")

    #print(class_name,dets,thresh)

    inds = np.where(dets[:, -1] >= thresh)[0]

    print("!!!")

    #print(inds) # 是否检测出来东西，如果有的话为0如果没有为空

    if len(inds) == 0:

        return

    #print(im.shape)  # 4000 6000 3

    #调整通道顺序，如果不调整通道顺序，图像就不正常

    for i in inds:

        bbox = dets[i, :4]

        score = dets[i, -1]

        #print(bbox[0],bbox[1],bbox[2],bbox[3])

        print("add one patch")

        ax.add_patch(

            plt.Rectangle((bbox[0], bbox[1]),

                          bbox[2] - bbox[0],

                          bbox[3] - bbox[1], fill=False,

                          edgecolor='red', linewidth=2)

            )

        ax.text(bbox[0], bbox[1] - 2,

                '{:s} {:.3f}'.format(class_name, score),

                bbox=dict(facecolor='white', alpha=0.9),

                fontsize=8, color='black')

    ax.set_title(('{} detections with '

                  'p({} | box) >= {:.1f}').format(class_name, class_name,thresh),fontsize=12)

def demo(sess, net, image_name):

    """Detect object classes in an image using pre-computed object proposals."""

    # Load the demo image

    im_file = os.path.join(cfg.FLAGS2["data_dir"], 'demo', image_name)

    im = cv2.imread(im_file)

    # Detect all object classes and regress object bounds

    timer = Timer()

    timer.tic()

    # detect the picture to find score and boxes

    scores, boxes = im_detect(sess, net, im)

    # 检测主体部分,在这里加上save_feature_picture

    # 这里的net内容是vgg

    timer.toc()

    print('Detection took {:.3f}s for {:d} object proposals'.format(timer.total_time, boxes.shape[0]))

    # Visualize detections for each class

    CONF_THRESH = 0.8

    NMS_THRESH = 0.3

    im = im[:, :, (2, 1, 0)]

    fig, ax = plt.subplots(figsize=(10,10))

    ax.imshow(im, aspect='equal')

    for cls_ind, cls in enumerate(CLASSES[1:]):

        cls_ind += 1  # because we skipped background

        cls_boxes = boxes[:, 4 * cls_ind:4 * (cls_ind + 1)]

        cls_scores = scores[:, cls_ind]

        dets = np.hstack((cls_boxes,

                          cls_scores[:, np.newaxis])).astype(np.float32)

        keep = nms(dets, NMS_THRESH)

        dets = dets[keep, :]

        vis_detections(ax,im, cls, dets, thresh=CONF_THRESH)

        plt.draw()

点拨：一开始的时候我也没有发现这两个有什么区别，后来发现图片是按照类别进行区分的以后，就可以看出，demo函数中是按照类别进行划分的，所以只要修改plt位置，将plt从vis_detections中放到demo中，这样所有类都会花在同一个plt中，而不会分开了，这样就解决了这个问题。

参考issues

How detect multiple object on same picture?

解决 Faster R-CNN 图片中框不在一张图片上显示的问题的更多相关文章

解决f.lux总是弹框定位
解决f.lux总是弹框定位,直接导入成功定位的注册表文件即可. 以下保存为f.lux.reg 双击导入即可. Windows Registry Editor Version 5.00 [HKEY_CU ...
JavaScript解决select下拉框中的内容太长显示不全的问题
JavaScript解决select下拉框中的内容太长显示不全的问题 1.说明有些情况下,select下拉框的内容过长,导致部分看不见: 现在通过鼠标事件,让下拉框中的内容显示完全 2.实现源码 & ...
（数据科学学习手札07）R在数据框操作上方法的总结（初级篇）
上篇我们了解了Python中pandas内封装的关于数据框的常用操作方法,而作为专为数据科学而生的一门语言,R在数据框的操作上则更为丰富精彩,本篇就R处理数据框的常用方法进行总结: 1.数据框的生成 ...
解决32位plsql客户端连接不64位Oracle11g上数据库
一.解决方案因为本人安装的是64位的Oracle,plsql 是32位的故连接不上.网上有方法能连接. 1. 文件下载下载PLSQL_Developer地址 http://pan.baidu.co ...
Flash Stage3D 在2D UI 界面上显示3D模型问题完美解决
一直以来很多Stage3D开发者都在为3D模型在2DUI上显示的问题头疼.Stage3D一直是在 Stage2D下面.为了做到3D模型在2DUI上显示通常大家有几种实现方式,下面来说说这几种实现方式吧 ...
解决pycharm左侧项目文件名中文字体乱码情况？中文显示口口口口.
解决pycharm左侧项目文件名中文字体乱码情况?中文显示口口口口. 点击file,进入settings 出现 Appearance & Behavior 点击Appearance UI Op ...
解决boostrap中，iframe渲染下，苹果手机横向无法显示剩余内容问题
描述: 问题解决了,采用的手势拖动显示剩余内容,并不是有了横向滚动条在head标签中加入 <head> <meta charset="utf-8"> &l ...
如何用html把文本框外观格式设为只显示底部的横线
html把文本框外观格式设为只显示底部的横线 <style> input[type='text']{background:none;border:none;border-bottom:1p ...
解决Tomcat6解压版在64位windows系统上无法启动服务的问题
解决Tomcat6解压版在64位windows系统上无法启动服务的问题由于客户环境为64位windows系统,开发环境一直用32位.tomcat使用6.0.20非安装版.部署时发现在 ...

随机推荐

Resin任意文件读取漏洞
Resin是什么虽然看不上但是还是原因下百度百科: Resin是CAUCHO公司的产品,是一个非常流行的支持servlets和jsp的引擎,速度非常快.Resin本身包含了一个支持HTTP/1.1的 ...
C#8.0中的 await foreach
AsyncStreamsInCShaper 8.0 C# 8.0中支持异步返回枚举类型async Task<IEnumerable<T>> sync Streams这个功能已经 ...
【BZOJ4418】[Shoi2013]扇形面积并扫描线+线段树
[BZOJ4418][Shoi2013]扇形面积并 Description 给定N个同心的扇形,求有多少面积,被至少K个扇形所覆盖. Input 第一行是三个整数n,m,k.n代表同心扇形的个数,m用 ...
shell脚本抓取网页信息
利用shell脚本分析网站数据 # define url time=$(date +%F) mtime=$(date +%T) file=/abc/shell/abc/abc_$time.log ht ...
mysql max_allowed_packet参数值改大后，莫名被还原
mysql数据库用innodb引擎,mysql max_allowed_packet在my.cnf中值加大后,够一段时间,系统会莫名把这个参数的值改小. innodb_buffer_pool_size ...
收集一些常用的CDN链接！无需下载快速使用！
一些常用的CDN链接,可以到这里看: http://www.bootcdn.cn/ 这个网站查找资源的方式很简单,后缀加上要查找的名字即可: 例如: http://www.bootcdn.cn/boo ...
[MongoDB] 安装MongoDB配置Replica Set
MongoDB的环境主要包括StandAlone,Replication和Sharding. StandAlone:单机环境,一般开发测试的时候用. Replication:主从结构,一个Primar ...
Jenkins的参数化构建
一.参数化构建日志 1.查看效果有时候开发需要查看服务器日志,传统的是需要运维登录服务器拉取开发所需要的服务日志,这么做的弊端是:1.如果日志比较大,拉取耗费时间.占用服务器资源.2.占用运维不必要 ...
CentOS7使用yum安装LNMP环境以后无法打开php页面
CentOS7使用yum安装LNMP环境以后无法打开php页面页面提示为File not found 查看nginx错误日志/var/log/nginx/error.log提示如下原因分析 ngi ...
003-spring cache-JCache (JSR-107) annotations
参看地址:https://docs.spring.io/spring/docs/current/spring-framework-reference/integration.html#cache-js ...

解决 Faster R-CNN 图片中框不在一张图片上显示的问题

解决 Faster R-CNN 图片中框不在一张图片上显示的问题

发现问题

如何解决这个问题？

参考issues

解决 Faster R-CNN 图片中框不在一张图片上显示的问题的更多相关文章

随机推荐

热门专题