【TensorFlow】获取object detection API训练模型的输出坐标

　　如下图，谷歌开源的object detection API提供了五种网络结构的fine-tuning训练权重，方便我们针对目标检测的需求进行模型训练，本文详细介绍下导出训练模型后，如何获得目标检测框的坐标。如果对使用object detection API训练模型的过程不了解，可以参考博文：https://www.cnblogs.com/White-xzx/p/9503203.html

　　新建一个测试文件object_detection_test.py，该脚本读取我们已经训练好的模型文件和测试图片，进行测试，代码如下，

 import numpy as np

 import os

 import six.moves.urllib as urllib

 import sys

 import tarfile

 import tensorflow as tf

 import zipfile

 from collections import defaultdict

 from io import StringIO

 from matplotlib import pyplot as plt

 from PIL import Image

 ## This is needed to display the images.

 #%matplotlib inline

 # This is needed since the notebook is stored in the object_detection folder.

 sys.path.append("..")

 from utils import label_map_util

 from utils import visualization_utils as vis_util

 # What model to download.

 #MODEL_NAME = 'ssd_mobilenet_v1_coco_2017_11_17'

 #MODEL_FILE = MODEL_NAME + '.tar.gz'

 #DOWNLOAD_BASE = #'http://download.tensorflow.org/models/object_detection/'

 MODEL_NAME = 'data'  # 训练过程中保存模型文件的文件夹路径

 # Path to frozen detection graph. This is the actual model that is used for the object detection.

 PATH_TO_CKPT = MODEL_NAME + '/frozen_inference_graph.pb' # 训练完成导出的pb模型文件

 # List of the strings that is used to add correct label for each box.

 PATH_TO_LABELS = 'E:/TensorFlow/Box-object-detection/data/label_map.pbtxt' # label_map.pbtxt文件

 NUM_CLASSES = 2   # 类别总数

 #Load a (frozen) Tensorflow model into memory. 加载模型

 detection_graph = tf.Graph()

 with detection_graph.as_default():

   od_graph_def = tf.GraphDef()

   with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:

     serialized_graph = fid.read()

     od_graph_def.ParseFromString(serialized_graph)

     tf.import_graph_def(od_graph_def, name='')

 #Loading label map 加载label_map

 label_map = label_map_util.load_labelmap(PATH_TO_LABELS)

 categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)

 category_index = label_map_util.create_category_index(categories)

 #Helper code

 def load_image_into_numpy_array(image):

   (im_width, im_height) = image.size

   return np.array(image.getdata()).reshape(

       (im_height, im_width, 3)).astype(np.uint8)

 # For the sake of simplicity we will use only 2 images:

 # image1.jpg

 # image2.jpg

 # If you want to test the code with your images, just add path to the images to the TEST_IMAGE_PATHS.

 PATH_TO_TEST_IMAGES_DIR = 'test_images'   # 测试图片的路径

 #TEST_IMAGE_PATHS = [ os.path.join(PATH_TO_TEST_IMAGES_DIR, 'image{}.jpg'.format(i)) for i in range(1, 3) ]

 TEST_IMAGE = sys.argv[1]

 print("the test image is:", TEST_IMAGE)

 # Size, in inches, of the output images.

 IMAGE_SIZE = (12, 8)

 with detection_graph.as_default():

   with tf.Session(graph=detection_graph) as sess:

     #for image_path in TEST_IMAGE_PATHS:

     image = Image.open(TEST_IMAGE)  # 打开图片

     # the array based representation of the image will be used later in order to prepare the

     # result image with boxes and labels on it.

     image_np = load_image_into_numpy_array(image)

     # Expand dimensions since the model expects images to have shape: [1, None, None, 3]

     image_np_expanded = np.expand_dims(image_np, axis=0)

     image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')   # 获取图片张量

     # Each box represents a part of the image where a particular object was detected.

     boxes = detection_graph.get_tensor_by_name('detection_boxes:0')   # 获取检测框张量

     # Each score represent how level of confidence for each of the objects.

     # Score is shown on the result image, together with the class label.

     scores = detection_graph.get_tensor_by_name('detection_scores:0')   # 获取每个检测框的分数，即概率

     classes = detection_graph.get_tensor_by_name('detection_classes:0')  # 获取类别名称id，与label_map中的ID对应

     num_detections = detection_graph.get_tensor_by_name('num_detections:0')  # 获取检测总数

     # Actual detection.

     (boxes, scores, classes, num_detections) = sess.run(

         [boxes, scores, classes, num_detections],

         feed_dict={image_tensor: image_np_expanded})

     # Visualization of the results of a detection.结果可视化

     vis_util.visualize_boxes_and_labels_on_image_array(

         image_np,

         np.squeeze(boxes),

         np.squeeze(classes).astype(np.int32),

         np.squeeze(scores),

         category_index,

         use_normalized_coordinates=True,

         line_thickness=8)

     print(boxes) # 打印检测框坐标

     print(scores)   #打印每个检测框的概率

     print(classes)   # 打印检测框对应的类别

     print(category_index)  # 打印类别的索引，其是一个嵌套的字典

     final_score = np.squeeze(scores)

     count = 0

     for i in range(100):

         if scores is None or final_score[i] > 0.5: # 显示大于50%概率的检测框

             count = count + 1

     print("the count of objects is: ", count )   

     plt.figure(figsize=IMAGE_SIZE)

     plt.imshow(image_np)

     plt.show()

打开cmd，输入如下命令，

python object_detection_test.py ./test_images/2.png

运行结果如下，

目标检测框box的坐标，此处的坐标是坐标除以相应图片的长宽所得到的小数，排列顺序为[ymin , xmin , ymax , xmax]，即box检测框左上角和右下角的坐标，

同时显示的是目标检测框box的概率：

Box的标签索引和每个索引所代表的标签，如第一个box的索引为1,1的标签名为“box”，即检测框里的是“箱子”

检测图：

因为源码中将坐标与图片的长宽相除，所以显示的是小数，为了得到准确的坐标，只要乘上相应的长宽数值就可以得到坐标了，上图的检测图坐标由计算可得

[ymin , xmin , ymax , xmax] = [ 614.4 , 410.4 , 764.16 , 569.16 ]，即在y轴的坐标和使用pyplot显示的坐标相近（图中红线标出）。

接下来，我们只要将上面的测试代码稍加修改即可得到我们想要的坐标，比如获得每个检测物体的中心坐标，代码如下：

 import numpy as np

 import os

 import six.moves.urllib as urllib

 import sys

 import tarfile

 import tensorflow as tf

 import zipfile

 import time

 from collections import defaultdict

 from io import StringIO

 from matplotlib import pyplot as plt

 #plt.switch_backend('Agg')

 from PIL import Image

 ## This is needed to display the images.

 #%matplotlib inline

 # This is needed since the notebook is stored in the object_detection folder.

 sys.path.append("..")

 from utils import label_map_util

 from utils import visualization_utils as vis_util

 # What model to download.

 #MODEL_NAME = 'ssd_mobilenet_v1_coco_2017_11_17'

 #MODEL_FILE = MODEL_NAME + '.tar.gz'

 #DOWNLOAD_BASE = #'http://download.tensorflow.org/models/object_detection/'

 MODEL_NAME = 'E:/Project/object-detection-Game-2018-5-31/data-20180607'  # model.ckpt路径,包括frozen_inference_graph.pb文件

 # Path to frozen detection graph. This is the actual model that is used for the object detection.

 PATH_TO_CKPT = MODEL_NAME + '/frozen_inference_graph.pb'

 # List of the strings that is used to add correct label for each box.

 PATH_TO_LABELS = MODEL_NAME+'/label_map.pbtxt'

 #E:/Project/object-detection-Game-2018-5-31

 NUM_CLASSES = 6

 start = time.time()

 #Load a (frozen) Tensorflow model into memory.

 detection_graph = tf.Graph()

 with detection_graph.as_default():

   od_graph_def = tf.GraphDef()

   #loading ckpt file to graph

   with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:

     serialized_graph = fid.read()

     od_graph_def.ParseFromString(serialized_graph)

     tf.import_graph_def(od_graph_def, name='')

 #Loading label map

 label_map = label_map_util.load_labelmap(PATH_TO_LABELS)

 categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)

 category_index = label_map_util.create_category_index(categories)

 #Helper code

 def load_image_into_numpy_array(image):

   (im_width, im_height) = image.size

   return np.array(image.getdata()).reshape(

       (im_height, im_width, 3)).astype(np.uint8)

 # If you want to test the code with your images, just add path to the images to the TEST_IMAGE_PATHS.

 #PATH_TO_TEST_IMAGES_DIR = 'test_images'

 #TEST_IMAGE_PATHS = [ os.path.join(PATH_TO_TEST_IMAGES_DIR, 'image{}.jpg'.format(i)) for i in range(1, 3) ]

 TEST_IMAGE = sys.argv[1]

 print("the test image is:", TEST_IMAGE)

 # Size, in inches, of the output images.

 IMAGE_SIZE = (12, 8)

 with detection_graph.as_default():

   with tf.Session(graph=detection_graph) as sess:

     # Definite input and output Tensors for detection_graph

     image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')

     # Each box represents a part of the image where a particular object was detected.

     detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')

     # Each score represent how level of confidence for each of the objects.

     # Score is shown on the result image, together with the class label.

     detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')

     detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')

     num_detections = detection_graph.get_tensor_by_name('num_detections:0')

     #for image_path in TEST_IMAGE_PATHS:

     image = Image.open(TEST_IMAGE)

     # the array based representation of the image will be used later in order to prepare the

     # result image with boxes and labels on it.

     image_np = load_image_into_numpy_array(image)

     # Expand dimensions since the model expects images to have shape: [1, None, None, 3]

     image_np_expanded = np.expand_dims(image_np, axis=0)

     image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')

     # Each box represents a part of the image where a particular object was detected.

     boxes = detection_graph.get_tensor_by_name('detection_boxes:0')

     # Each score represent how level of confidence for each of the objects.

     # Score is shown on the result image, together with the class label.

     scores = detection_graph.get_tensor_by_name('detection_scores:0')

     classes = detection_graph.get_tensor_by_name('detection_classes:0')

     num_detections = detection_graph.get_tensor_by_name('num_detections:0')

     # Actual detection.

     (boxes, scores, classes, num_detections) = sess.run(

         [boxes, scores, classes, num_detections],

         feed_dict={image_tensor: image_np_expanded})

     # Visualization of the results of a detection.

     vis_util.visualize_boxes_and_labels_on_image_array(

         image_np,

         np.squeeze(boxes),

         np.squeeze(classes).astype(np.int32),

         np.squeeze(scores),

         category_index,

         use_normalized_coordinates=True,

         line_thickness=8)

     #print(boxes)

     # for i in range(len(scores[0])):

     #     if scores[0][i]>0.5:

     #         print(scores[0][i])

     #print(scores)

     #print(classes)

     #print(category_index)

     final_score = np.squeeze(scores)

     count = 0

     for i in range(100):

         if scores is None or final_score[i] > 0.5:

             count = count + 1

     print()

     print("the count of objects is: ", count )

     (im_width, im_height) = image.size

     for i in range(count):

         #print(boxes[0][i])

         y_min = boxes[0][i][0]*im_height

         x_min = boxes[0][i][1]*im_width

         y_max = boxes[0][i][2]*im_height

         x_max = boxes[0][i][3]*im_width

         print("object{0}: {1}".format(i,category_index[classes[0][i]]['name']),

                          ',Center_X:',int((x_min+x_max)/2),',Center_Y:',int((y_min+y_max)/2))

         #print(x_min,y_min,x_max,y_max)

     end = time.time()

     seconds = end - start

     print("Time taken : {0} seconds".format(seconds))

     # plt.figure(figsize=IMAGE_SIZE)

     # plt.imshow(image_np)

     # plt.show()

运行结果如下，

转载请注明出处：https://www.cnblogs.com/White-xzx/p/9508535.html

【TensorFlow】获取object detection API训练模型的输出坐标的更多相关文章

使用Tensorflow object detection API——训练模型（Window10系统）
[数据标注处理] 1.先将下载好的图片训练数据放在models-master/research/images文件夹下,并分别为训练数据和测试数据创建train.test两个文件夹.文件夹目录如下 2. ...
Install Tensorflow object detection API in Anaconda (Windows)
This blog is to explain how to install Tensorflow object detection API in Anaconda in Windows 10 as ...
基于TensorFlow Object Detection API进行迁移学习训练自己的人脸检测模型（二）
前言已完成数据预处理工作,具体参照: 基于TensorFlow Object Detection API进行迁移学习训练自己的人脸检测模型(一) 设置配置文件新建目录face_faster_rcn ...
TensorFlow object detection API
cloud执行:https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/running_pet ...
Tensorflow object detection API 搭建属于自己的物体识别模型
一.下载Tensorflow object detection API工程源码网址:https://github.com/tensorflow/models,可通过Git下载,打开Git Bash, ...
TensorFlow object detection API应用
前一篇讲述了TensorFlow object detection API的安装与配置,现在我们尝试用这个API搭建自己的目标检测模型. 一.准备数据集本篇旨在人脸识别,在百度图片上下载了120张张 ...
TensorFlow object detection API应用--配置
目标检测在图形识别的基础上有了更进一步的应用,但是代码也更加繁琐,TensorFlow专门为此开设了一个object detection API,接下来看看怎么使用它. object detectio ...
TensorFlow Object Detection API中的Faster R-CNN /SSD模型参数调整
关于TensorFlow Object Detection API配置,可以参考之前的文章https://becominghuman.ai/tensorflow-object-detection-ap ...
使用TensorFlow Object Detection API+Google ML Engine训练自己的手掌识别器
上次使用Google ML Engine跑了一下TensorFlow Object Detection API中的Quick Start(http://www.cnblogs.com/take-fet ...

随机推荐

BZOJ4559 JLOI2016成绩比较（容斥原理+组合数学+斯特林数）
容斥一发改为计算至少碾压k人的情况数量,这样对于每门课就可以分开考虑再相乘了.剩下的问题是给出某人的排名和分数的值域,求方案数.枚举出现了几种不同的分数,再枚举被给出的人的分数排第几,算一个类似斯特林 ...
【刷题】LOJ 6227 「网络流 24 题」最长k可重线段集问题
题目描述给定平面 \(\text{xoy}\) 上 \(n\) 个开线段组成的集合 \(\text{I}\) ,和一个正整数 \(k\) ,试设计一个算法. 从开线段集合 \(\text{I}\) ...
【agc001e】BBQ HARD（动态规划）
[agc001e]BBQ HARD(动态规划) 题面 atcoder 洛谷题解这些agc都是写的整场的题解,现在还是把其中一些题目单独拿出来发这题可以说非常妙了. 我们可以把这个值看做在网格图上 ...
jenkins构建docker镜像上传到harbor并发布到kubernetes
很早之前写过一篇jenkins集成docker的文章,使用的是CloudBees Docker Build and Publish plugin插件.这篇文章是直接使用shell脚本做的,主要是这次有 ...
Java后台面试常见问题
Java后台面试常见问题从三月份找实习到现在,面了一些公司,挂了不少,但最终还是拿到小米.百度.阿里.京东.新浪.CVTE.乐视家的研发岗offer.我找的是java后台开发,把常见的问题分享 ...
std::bind常见的坑
http://note.youdao.com/noteshare?id=bce9cdea8e94501186b5ba3026af685f
django 学习笔记（转）
原文链接:https://my.oschina.net/linktime/blog/105280 例如有一下模型 from django.db import models class person(m ...
MFC Activex 开发、ocx打包成cab、部署、测试、自动升级
小小抱怨下:也许是MFC现在用的人少的缘故.在国内和国外都基本上找不到什么全的资料.特别是ocx打包成Cab时的安装文件inf的编写方面,国内基本上是copy,抄的还一知半解.查找个资源真心的累啊.现 ...
Vue单页面应用阻止浏览器记住密码
Vue单页面应用阻止浏览器记住密码 ——IT唐伯虎摘要: Vue单页面应用阻止浏览器记住密码. 现象1:路由切换时再次提示“是否记住密码” 登录页面有个密码输入框,输入账号密码进行登录: 登录完成后 ...
spring cloud 微服务架构简介
Spring Cloud 1. Spring Cloud 简介 Spring Cloud是在Spring Boot的基础上构建的,用于简化分布式系统构建的工具集,为开发人员提供快速建立分布式系统中的 ...

【TensorFlow】获取object detection API训练模型的输出坐标

【TensorFlow】获取object detection API训练模型的输出坐标的更多相关文章

随机推荐

热门专题