github 上大神的代码 https://github.com/endernewton/tf-faster-rcnn.git

在自己跑的过程中的问题：

1. 数据集的问题：

作者实现了 voc，coco数据集接口。由于我要跑自己的数据，所以要重写数据接口。为了方便我将自己的数据格式改为voc的数据格式，使用原来voc的数据接口pascal_voc.py。

voc 数据格式中需要文件：

data

-----VOCdevkit2007 (自己可以改)

----VOC2007

-----Annotations (目标的标注文件.xml)

-----ImageSets

----- trainval.txt (用于训练的图像名)

----- test.txt (用于测试的图像名)

-----JPEGImages (jpg 图像)

具体 .xml 文件编写根据自己已有的数据

写xml 文件主要内容：

from  xml.dom.minidom import Document

doc=Document()

Annotation=doc.createElement('annotation')  # 创建annotation 域

doc.appendChild(Annotation) # 写入annotation 域

object=doc.createElement('object')

Annotation.appendChild('object')

# 写入name

object_name=doc.createElement('name')

object_name_text=doc.createTextNode('分类类别名')

object_name.appendChild(object_name_text)

object.appendChild(object_name)

# 写入difficult，虽然不用，但是如果不加直接使用pascal_voc会出错

object_difficult=doc.createElement('difficult')

object_difficult_text=doc.createTextNode('0')

object_difficult.appendChild(object_difficult_text)

object.appendChild(object_difficult)

# 写入box

bndbox=doc.createElement('bndbox')

object.appendChild(bndbox)

object_box=doc.createElement('bndbox')

object_box_xmin=doc.createElement('xmin')

object_box_xmin_text=doc.createTextNode(str(image_box[0]))

object_box_xmin.appendChild(object_box_xmin_text)

bndbox.appendChild(object_box_xmin)

object_box_ymin=doc.createElement('ymin')

object_box_ymin_text=doc.createTextNode(str(image_box[1]))

object_box_ymin.appendChild(object_box_ymin_text)

bndbox.appendChild(object_box_ymin)

object_box_xmax=doc.createElement('xmax')

object_box_xmax_text=doc.createTextNode(str(image_box[2]))

object_box_xmax.appendChild(object_box_xmax_text)

bndbox.appendChild(object_box_xmax)

object_box_ymax=doc.createElement('ymax')

object_box_ymax_text=doc.createTextNode(str(image_box[3]))

object_box_ymax.appendChild(object_box_ymax_text)

bndbox.appendChild(object_box_ymax)

f=open(filename,"w")

f.write(doc.toprettyxml(indent="   "))

f.close()

　　得到：

<annotation>

   <object>

      <name>abc</name>

      <difficult>0</difficult>

      <bndbox>

         <xmin>107</xmin>

         <ymin>155</ymin>

         <xmax>193</xmax>

         <ymax>214</ymax>

      </bndbox>

   </object>

</annotation>

改pascal_voc.py 文件，修改自己的classes，以及xml中对应域的名字等。

2. 数据完成之后，就可以用来训练了，此时出现问题：

Assign requires shapes of both tensors to match. lhs shape= [2048,124] rhs shape= [2048,84]

因为我现在变为30类，30+1 （背景），31*4=124 （4为box 的定位），而原来为84类。

怎么改最后的输出类别个数？在caffe中可以直接在prototxt 定义的网络结构中改，在tensorflow中怎么改呢？

我们执行train_faster_rcnn 传入了(gpuId, dataset, net) 调用tools/trainval_net.py
在trainval_net.py 中调用net=resnetv1, load 网络模型，调用models/train_net
在train_net 中调用train_model 函数，定义计算图，在initialize 函数中对sess 进行初始化

  def initialize(self, sess):

    # Initial file lists are empty

    np_paths = []

    ss_paths = []

    # Fresh train directly from ImageNet weights

    print('Loading initial model weights from {:s}'.format(self.pretrained_model))

    variables = tf.global_variables()

    # Initialize all variables first

    sess.run(tf.variables_initializer(variables, name='init'))

    var_keep_dic = self.get_variables_in_checkpoint_file(self.pretrained_model)

    # Get the variables to restore, ignoring the variables to fix

    variables_to_restore = self.net.get_variables_to_restore(variables, var_keep_dic)

    # 要加载的变量

    restorer = tf.train.Saver(variables_to_restore)

    # 进行加载。。出错的地方就是这里

    restorer.restore(sess, self.pretrained_model)

    print('Loaded.')

    # Need to fix the variables before loading, so that the RGB weights are changed to BGR

    # For VGG16 it also changes the convolutional weights fc6 and fc7 to

    # fully connected weights

    self.net.fix_variables(sess, self.pretrained_model)

    print('Fixed.')

    last_snapshot_iter = 0

    rate = cfg.TRAIN.LEARNING_RATE

    stepsizes = list(cfg.TRAIN.STEPSIZE)

    return rate, last_snapshot_iter, stepsizes, np_paths, ss_paths

　　要改正，就要不加载最后的预测层和 box 回归层。

对要加载的文件进行选择，然后就可训练自己的数据了

tensorflow faster rann的更多相关文章

tensorflow faster rcnn 代码分析一 demo.py
os.environ["CUDA_VISIBLE_DEVICES"]=2 # 设置使用的GPU tfconfig=tf.ConfigProto(allow_soft_placeme ...
Tensorflow faster rcnn系列一
注意:本文主要是学习用,发现了一个在faster rcnn训练流程写的比较详细的博客. 大部分内容来自以下博客连接:https://blog.csdn.net/weixin_37203756/arti ...
python3 + Tensorflow + Faster R-CNN训练自己的数据
之前实现过faster rcnn, 但是因为各种原因,有需要实现一次,而且发现许多博客都不全面.现在发现了一个比较全面的博客.自己根据这篇博客实现的也比较顺利.在此记录一下(照搬). 原博客:http ...
Faster_Rcnn在windows下运行踩坑总结
Faster_Rcnn在windows下运行踩坑总结 20190524 今天又是元气满满的一天! 1.代码下载 2.编译 3.下载数据集 4.下载pre-train Model 5.运行train ...
TensorFlow_Faster_RCNN中demo.py的运行(CPU Only)
GitHub项目地址,https://github.com/endernewton/tf-faster-rcnnTensorflow Faster RCNN for Object Detection. ...
Technology Document Guide of TensorRT
Technology Document Guide of TensorRT Abstract 本示例支持指南概述了GitHub和产品包中包含的所有受支持的TensorRT 7.2.1示例.Tensor ...
新人如何运行Faster RCNN的tensorflow代码
0.目的刚刚学习faster rcnn目标检测算法,在尝试跑通github上面Xinlei Chen的tensorflow版本的faster rcnn代码时候遇到很多问题(我真是太菜),代码地址如下 ...
Tensorflow版Faster RCNN源码解析（TFFRCNN）（2）推断（测试）过程不使用RPN时代码运行流程
本blog为github上CharlesShang/TFFRCNN版源码解析系列代码笔记第二篇推断(测试)过程不使用RPN时代码运行流程作者:Jiang Wu 原文见:https://hom ...
TensorFlow Object Detection API中的Faster R-CNN /SSD模型参数调整
关于TensorFlow Object Detection API配置,可以参考之前的文章https://becominghuman.ai/tensorflow-object-detection-ap ...

随机推荐

css预编译语言sass——mixin的使用
以根据不同屏幕吃寸动态应用背景图片为例新建一个mixin如下: @mixin bg_img($path, $ext){ @media screen and (max-device-width: 76 ...
Spring Boot 1.X和2.X优雅重启实战
纯洁的微笑今天项目在重新发布的过程中,如果有的请求时间比较长,还没执行完成,此时重启的话就会导致请求中断,影响业务功能,优雅重启可以保证在停止的时候,不接收外部的新的请求,等待未完成的请求执行完成 ...
docker file 示例
报错 Cannot connect to the Docker daemon. Is the docker daemon running on this host? 这个错误只要输入docker -d ...
Django 项目内利用ORM直接运行脚本读库
#导包 import os import sys #将脚本所在工程添加到环境变量 #绝对路径 # sys.path.append('c:/Users/nxy/www/mymac') #相对路径 sys ...
在Derby中取得刚刚插入的“递增”类型的字段值
现在才发现采用不同的数据库,对写程序影响很大. 以前常用SQL Server2000或Access,可能是因为都是Microsoft公司的产品,所以在从不同的平台转换的时候问题不是很大. 现在采用De ...
Python模块之time、random、os、sys、序列化、re
Time模块和时间有关系的我们就要用到时间模块.在使用模块之前,应该首先导入这个模块. #常用方法 1.time.sleep(secs) (线程)推迟指定的时间运行.单位为秒. 2.time.tim ...
spring cron表达式(定时器)
转: spring cron表达式(定时器) 写定时器时用到,记录一下: Cron表达式是一个字符串,字符串以5或6个空格隔开,分开工6或7个域,每一个域代表一个含义,Cron有如下两种语法格式: ...
python 类的介绍实例
使用面向对象的优点: 1.能够更好的设计软件架构 2.维护软件模块 3.易于架构和组件的重用类的定义: 构造函数:初始化用,写不写都可以,默认为空类属性:属于类的对象方法属性:不属于类的对象私 ...
Good Bye 2018 D. New Year and the Permutation Concatenation
传送门 https://www.cnblogs.com/violet-acmer/p/10201535.html 题意: 求 n 的所有全排列组成的序列中连续的 n 个数加和为 n*(n+1)/2 的 ...
DK location not found. Define location with sdk.dir in the local.properties file or with an ANDROID_HOME
根据提示,我们可以新建一个项目或者以前自己使用过没问题的工程,从中把local.properties文件copy到我们从github中想要导入的工程中,我自己就是这样的,然后这个问题就解决了. ndk ...

tensorflow faster rann

Assign requires shapes of both tensors to match. lhs shape= [2048,124] rhs shape= [2048,84]

tensorflow faster rann的更多相关文章

随机推荐

热门专题