如何用Tensorflow训练模型成pb文件和和如何加载已经训练好的模型文件

这篇薄荷主要是讲了如何用tensorflow去训练好一个模型，然后生成相应的pb文件。最后会将如何重新加载这个pb文件。
首先先放出PO主的github：

https://github.com/ppplinday/tensorflow-vgg16-train-and-test

其中的pitcute文件是狗和猫的图片分别15张一共30（别吐槽，只是为了练手学习的233333）， train那个就是训练的文件，test这个就是测试的文件。
接着PO主会慢慢讲解相应的步骤。
！！！ps：由于PO主也是新手，所以难免会出现一点（很多）小错误，希望大婶看了能够提出来让PO主好好学习233333。

train
首先说一下train。一开始当然是读图片啦。

def read_img(path):

    cate   = [path + x for x in os.listdir(path) if os.path.isdir(path + x)]

    imgs   = []

    labels = []

    for idx, folder in enumerate(cate):

        for im in glob.glob(folder + '/*.jpg'):

            print('reading the image: %s' % (im))

            img = io.imread(im)

            img = transform.resize(img, (w, h, c))

            imgs.append(img)

            labels.append(idx)

    return np.asarray(imgs, np.float32), np.asarray(labels, np.int32)

data, label = read_img(path)

用io.imread来读取每一张图片，然后resize成vgg的输入的大小（224，224，3），最后分别放入了data和label中。

num_example = data.shape[0]

arr = np.arange(num_example)

np.random.shuffle(arr)

data = data[arr]

label = label[arr]

这里是把图片的顺序打乱，先生成一个等差数列，然后打乱，最后赋值回原来的data和label

ratio = 0.8

s = np.int(num_example * ratio)

x_train = data[:s]

y_train = label[:s]

x_val   = data[s:]

y_val = label[s:]

全部的数据中百分之80的用来train，剩下20的用来test（虽然一共才30张图片。。。。。）

def build_network(height, width, channel):

    x = tf.placeholder(tf.float32, shape=[None, height, width, channel], name='input')

    y = tf.placeholder(tf.int64, shape=[None, 2], name='labels_placeholder')

开始build相应的vgg model，这一步不难，但是每一层最好都给上相应的name。上面的x和y是相应的输入和相应的标签。

    finaloutput = tf.nn.softmax(output_fc8, name="softmax")

    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=finaloutput, labels=y))

    optimize = tf.train.AdamOptimizer(learning_rate=1e-4).minimize(cost)

    prediction_labels = tf.argmax(finaloutput, axis=1, name="output")

    read_labels = y

    correct_prediction = tf.equal(prediction_labels, read_labels)

    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

    correct_times_in_batch = tf.reduce_sum(tf.cast(correct_prediction, tf.int32))

    return dict(

        x=x,

        y=y,

        optimize=optimize,

        correct_prediction=correct_prediction,

        correct_times_in_batch=correct_times_in_batch,

        cost=cost,

)

在build的最后，是需要进行误差计算。finaloutput是最后的输出，cost是计算误差，optimize是定义训练时候安什么方式，也注意一下最后的return。

接着是训练过程。

def train_network(graph, batch_size, num_epochs, pb_file_path):

    init = tf.global_variables_initializer()

    with tf.Session() as sess:

        sess.run(init)

        epoch_delta = 2

        for epoch_index in range(num_epochs):

            for i in range(12):

                sess.run([graph['optimize']], feed_dict={

                    graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),

                    graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])

})

其实训练的代码就这些，定好了batchsize和numepoch进行训练。下面的代码主要是为了看每几次相应的正确率。

            constant_graph = graph_util.convert_variables_to_constants(sess, sess.graph_def, ["output"])

            with tf.gfile.FastGFile(pb_file_path, mode='wb') as f:

f.write(constant_graph.SerializeToString())

这两句是重要的代码，用来把训练好的模型保存为pb文件。运行完之后就会发现应该的文件夹多出了一个pb文件。

test

def recognize(jpg_path, pb_file_path):

    with tf.Graph().as_default():

        output_graph_def = tf.GraphDef()

        with open(pb_file_path, "rb") as f:

            output_graph_def.ParseFromString(f.read())

_ = tf.import_graph_def(output_graph_def, name="")

打开相应的pb文件。

            img = io.imread(jpg_path)

            img = transform.resize(img, (224, 224, 3))

            img_out_softmax = sess.run(out_softmax, feed_dict={input_x:np.reshape(img, [-1, 224, 224, 3])})

读取图片文件，resize之后放入模型的输入位置，之后img_out_softmax就是相应输出的结果。

这大概就是整个流程。目的是为了练练手，PO主应该有挺多小错误，希望大家能够提出来让PO主好好学习哈哈哈！！！

最后放出整个的train和test的代码：
train

from PIL import Image

import numpy as np

import matplotlib.pyplot as plt

import matplotlib.image as mpimg

import tensorflow as tf

import os

import glob

from skimage import io, transform

from tensorflow.python.framework import graph_util

import collections

path = '/home/zhoupeilin/vgg16/picture/'

w = 224

h = 224

c = 3

def read_img(path):

    cate   = [path + x for x in os.listdir(path) if os.path.isdir(path + x)]

    imgs   = []

    labels = []

    for idx, folder in enumerate(cate):

        for im in glob.glob(folder + '/*.jpg'):

            print('reading the image: %s' % (im))

            img = io.imread(im)

            img = transform.resize(img, (w, h, c))

            imgs.append(img)

            labels.append(idx)

    return np.asarray(imgs, np.float32), np.asarray(labels, np.int32)

data, label = read_img(path)

num_example = data.shape[0]

arr = np.arange(num_example)

np.random.shuffle(arr)

data = data[arr]

label = label[arr]

ratio = 0.8

s = np.int(num_example * ratio)

x_train = data[:s]

y_train = label[:s]

x_val   = data[s:]

y_val   = label[s:]

def build_network(height, width, channel):

    x = tf.placeholder(tf.float32, shape=[None, height, width, channel], name='input')

    y = tf.placeholder(tf.int64, shape=[None, 2], name='labels_placeholder')

    def weight_variable(shape, name="weights"):

        initial = tf.truncated_normal(shape, dtype=tf.float32, stddev=0.1)

        return tf.Variable(initial, name=name)

    def bias_variable(shape, name="biases"):

        initial = tf.constant(0.1, dtype=tf.float32, shape=shape)

        return tf.Variable(initial, name=name)

    def conv2d(input, w):

        return tf.nn.conv2d(input, w, [1, 1, 1, 1], padding='SAME')

    def pool_max(input):

        return tf.nn.max_pool(input,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool1')

    def fc(input, w, b):

        return tf.matmul(input, w) + b

    # conv1

    with tf.name_scope('conv1_1') as scope:

        kernel = weight_variable([3, 3, 3, 64])

        biases = bias_variable([64])

        output_conv1_1 = tf.nn.relu(conv2d(x, kernel) + biases, name=scope)

    with tf.name_scope('conv1_2') as scope:

        kernel = weight_variable([3, 3, 64, 64])

        biases = bias_variable([64])

        output_conv1_2 = tf.nn.relu(conv2d(output_conv1_1, kernel) + biases, name=scope)

    pool1 = pool_max(output_conv1_2)

    # conv2

    with tf.name_scope('conv2_1') as scope:

        kernel = weight_variable([3, 3, 64, 128])

        biases = bias_variable([128])

        output_conv2_1 = tf.nn.relu(conv2d(pool1, kernel) + biases, name=scope)

    with tf.name_scope('conv2_2') as scope:

        kernel = weight_variable([3, 3, 128, 128])

        biases = bias_variable([128])

        output_conv2_2 = tf.nn.relu(conv2d(output_conv2_1, kernel) + biases, name=scope)

    pool2 = pool_max(output_conv2_2)

    # conv3

    with tf.name_scope('conv3_1') as scope:

        kernel = weight_variable([3, 3, 128, 256])

        biases = bias_variable([256])

        output_conv3_1 = tf.nn.relu(conv2d(pool2, kernel) + biases, name=scope)

    with tf.name_scope('conv3_2') as scope:

        kernel = weight_variable([3, 3, 256, 256])

        biases = bias_variable([256])

        output_conv3_2 = tf.nn.relu(conv2d(output_conv3_1, kernel) + biases, name=scope)

    with tf.name_scope('conv3_3') as scope:

        kernel = weight_variable([3, 3, 256, 256])

        biases = bias_variable([256])

        output_conv3_3 = tf.nn.relu(conv2d(output_conv3_2, kernel) + biases, name=scope)

    pool3 = pool_max(output_conv3_3)

    # conv4

    with tf.name_scope('conv4_1') as scope:

        kernel = weight_variable([3, 3, 256, 512])

        biases = bias_variable([512])

        output_conv4_1 = tf.nn.relu(conv2d(pool3, kernel) + biases, name=scope)

    with tf.name_scope('conv4_2') as scope:

        kernel = weight_variable([3, 3, 512, 512])

        biases = bias_variable([512])

        output_conv4_2 = tf.nn.relu(conv2d(output_conv4_1, kernel) + biases, name=scope)

    with tf.name_scope('conv4_3') as scope:

        kernel = weight_variable([3, 3, 512, 512])

        biases = bias_variable([512])

        output_conv4_3 = tf.nn.relu(conv2d(output_conv4_2, kernel) + biases, name=scope)

    pool4 = pool_max(output_conv4_3)

    # conv5

    with tf.name_scope('conv5_1') as scope:

        kernel = weight_variable([3, 3, 512, 512])

        biases = bias_variable([512])

        output_conv5_1 = tf.nn.relu(conv2d(pool4, kernel) + biases, name=scope)

    with tf.name_scope('conv5_2') as scope:

        kernel = weight_variable([3, 3, 512, 512])

        biases = bias_variable([512])

        output_conv5_2 = tf.nn.relu(conv2d(output_conv5_1, kernel) + biases, name=scope)

    with tf.name_scope('conv5_3') as scope:

        kernel = weight_variable([3, 3, 512, 512])

        biases = bias_variable([512])

        output_conv5_3 = tf.nn.relu(conv2d(output_conv5_2, kernel) + biases, name=scope)

    pool5 = pool_max(output_conv5_3)

    #fc6

    with tf.name_scope('fc6') as scope:

        shape = int(np.prod(pool5.get_shape()[1:]))

        kernel = weight_variable([shape, 4096])

        biases = bias_variable([4096])

        pool5_flat = tf.reshape(pool5, [-1, shape])

        output_fc6 = tf.nn.relu(fc(pool5_flat, kernel, biases), name=scope)

    #fc7

    with tf.name_scope('fc7') as scope:

        kernel = weight_variable([4096, 4096])

        biases = bias_variable([4096])

        output_fc7 = tf.nn.relu(fc(output_fc6, kernel, biases), name=scope)

    #fc8

    with tf.name_scope('fc8') as scope:

        kernel = weight_variable([4096, 2])

        biases = bias_variable([2])

        output_fc8 = tf.nn.relu(fc(output_fc7, kernel, biases), name=scope)

    finaloutput = tf.nn.softmax(output_fc8, name="softmax")

    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=finaloutput, labels=y))

    optimize = tf.train.AdamOptimizer(learning_rate=1e-4).minimize(cost)

    prediction_labels = tf.argmax(finaloutput, axis=1, name="output")

    read_labels = y

    correct_prediction = tf.equal(prediction_labels, read_labels)

    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

    correct_times_in_batch = tf.reduce_sum(tf.cast(correct_prediction, tf.int32))

    return dict(

        x=x,

        y=y,

        optimize=optimize,

        correct_prediction=correct_prediction,

        correct_times_in_batch=correct_times_in_batch,

        cost=cost,

    )

def train_network(graph, batch_size, num_epochs, pb_file_path):

    init = tf.global_variables_initializer()

    with tf.Session() as sess:

        sess.run(init)

        epoch_delta = 2

        for epoch_index in range(num_epochs):

            for i in range(12):

                sess.run([graph['optimize']], feed_dict={

                    graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),

                    graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])

                })

            if epoch_index % epoch_delta == 0:

                total_batches_in_train_set = 0

                total_correct_times_in_train_set = 0

                total_cost_in_train_set = 0.

                for i in range(12):

                    return_correct_times_in_batch = sess.run(graph['correct_times_in_batch'], feed_dict={

                        graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),

                        graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])

                    })

                    mean_cost_in_batch = sess.run(graph['cost'], feed_dict={

                        graph['x']: np.reshape(x_train[i], (1, 224, 224, 3)),

                        graph['y']: ([[1, 0]] if y_train[i] == 0 else [[0, 1]])

                    })

                    total_batches_in_train_set += 1

                    total_correct_times_in_train_set += return_correct_times_in_batch

                    total_cost_in_train_set += (mean_cost_in_batch * batch_size)

                total_batches_in_test_set = 0

                total_correct_times_in_test_set = 0

                total_cost_in_test_set = 0.

                for i in range(3):

                    return_correct_times_in_batch = sess.run(graph['correct_times_in_batch'], feed_dict={

                        graph['x']: np.reshape(x_val[i], (1, 224, 224, 3)),

                        graph['y']: ([[1, 0]] if y_val[i] == 0 else [[0, 1]])

                    })

                    mean_cost_in_batch = sess.run(graph['cost'], feed_dict={

                        graph['x']: np.reshape(x_val[i], (1, 224, 224, 3)),

                        graph['y']: ([[1, 0]] if y_val[i] == 0 else [[0, 1]])

                    })

                    total_batches_in_test_set += 1

                    total_correct_times_in_test_set += return_correct_times_in_batch

                    total_cost_in_test_set += (mean_cost_in_batch * batch_size)

                acy_on_test  = total_correct_times_in_test_set / float(total_batches_in_test_set * batch_size)

                acy_on_train = total_correct_times_in_train_set / float(total_batches_in_train_set * batch_size)

                print('Epoch - {:2d}, acy_on_test:{:6.2f}%({}/{}),loss_on_test:{:6.2f}, acy_on_train:{:6.2f}%({}/{}),loss_on_train:{:6.2f}'.format(epoch_index, acy_on_test*100.0,total_correct_times_in_test_set,

                                                                                                                                                   total_batches_in_test_set * batch_size,

                                                                                                                                                   total_cost_in_test_set,

                                                                                                                                                   acy_on_train * 100.0,

                                                                                                                                                   total_correct_times_in_train_set,

                                                                                                                                                   total_batches_in_train_set * batch_size,

                                                                                                                                                   total_cost_in_train_set))

            constant_graph = graph_util.convert_variables_to_constants(sess, sess.graph_def, ["output"])

            with tf.gfile.FastGFile(pb_file_path, mode='wb') as f:

                f.write(constant_graph.SerializeToString())

def main():

    batch_size = 12

    num_epochs = 50

    pb_file_path = "vggs.pb"

    g = build_network(height=224, width=224, channel=3)

    train_network(g, batch_size, num_epochs, pb_file_path)

main()

test

import tensorflow as tf

import  numpy as np

import PIL.Image as Image

from skimage import io, transform

def recognize(jpg_path, pb_file_path):

    with tf.Graph().as_default():

        output_graph_def = tf.GraphDef()

        with open(pb_file_path, "rb") as f:

            output_graph_def.ParseFromString(f.read())

            _ = tf.import_graph_def(output_graph_def, name="")

        with tf.Session() as sess:

            init = tf.global_variables_initializer()

            sess.run(init)

            input_x = sess.graph.get_tensor_by_name("input:0")

            print input_x

            out_softmax = sess.graph.get_tensor_by_name("softmax:0")

            print out_softmax

            out_label = sess.graph.get_tensor_by_name("output:0")

            print out_label

            img = io.imread(jpg_path)

            img = transform.resize(img, (224, 224, 3))

            img_out_softmax = sess.run(out_softmax, feed_dict={input_x:np.reshape(img, [-1, 224, 224, 3])})

            print "img_out_softmax:",img_out_softmax

            prediction_labels = np.argmax(img_out_softmax, axis=1)

            print "label:",prediction_labels

recognize("vgg16/picture/dog/dog3.jpg", "vgg16/vggs.pb")

如何用Tensorflow训练模型成pb文件和和如何加载已经训练好的模型文件的更多相关文章

cordova加载层、进度条、文件选择插件
在做cordova项目的时候,感觉应用的响应速度跟原生应用比相差甚远,一个主要问题就是如加载层.进度条等弹出对话框的效率不行.毕竟项目中的这些弹框都是用dom拼成的,dom的渲染效率和原生控件比起来慢 ...
asp.net使用httphandler打包多CSS或JS文件以加快页面加载速度
介绍使用许多小得JS.CSS文件代替一个庞大的JS或CSS文件来让代码获得更好的可维护性,这是一个很好的实践.但这样做反过来却损失了网站的性能.虽然你应该将你的Javascript代码写在小文件中 ...
ThinkPHP第三天(公共函数Common加载，dump定义，模板文件，定义替换__PUBLIC__)
1.公共函数定义自动加载:在项目的common文件夹中定义,公共函数文件命名规则为common.php,只有命名成common.php才能被自动载入. 动态加载:可以修改配置项‘LOAD_EXT_F ...
java中调用本地动态链接库（*.DLL）的两种方式详解和not found library、打包成jar,war包dll无法加载等等问题解决办法
我们经常会遇到需要java调用c++的案例,这里就java调用DLL本地动态链接库两种方式,和加载过程中遇到的问题进行详细介绍 1.通过System.loadLibrary("dll名称,不 ...
精尽MyBatis源码分析 - MyBatis初始化（二）之加载Mapper接口与XML映射文件
该系列文档是本人在学习 Mybatis 的源码过程中总结下来的,可能对读者不太友好,请结合我的源码注释(Mybatis源码分析 GitHub 地址.Mybatis-Spring 源码分析 GitHub ...
[Android]异步加载图片，内存缓存，文件缓存，imageview显示图片时增加淡入淡出动画
以下内容为原创,欢迎转载,转载请注明来自天天博客:http://www.cnblogs.com/tiantianbyconan/p/3574131.html 这个可以实现ImageView异步加载 ...
网站加载css/js/img等静态文件失败
网站加载css/js/img等静态文件失败,报网站http服务器内部500错误.而服务器中静态文件存在且权限正常. 从浏览器中直接访问文件,出来乱码.这种问题原因在于iis中该网站mime配置报错,不 ...
安装Win7或者XP系统用虚拟光驱加载Win7或者XP镜像 iso文件xp win7wim文件
安装Win7或者XP系统用虚拟光驱加载Win7或者XP镜像 iso文件xp win7wim文件 http://pcedu.pconline.com.cn/teach/xt/1201/2657834_8 ...
MVP+RXJAVA+RecyclerView实现sd卡根目录下的所有文件中的照片加载并显示
初学Rxjava,目前只能遍历加载指定目录下的所有文件夹中的照片,文件夹中如果还嵌套有文件夹目前还没找到实现方法. 先看mvp目录结构: 很抱歉,没有model. 接下来是view层的接口代码和pre ...

随机推荐

iOS UI进阶-6.0 手势
给每个页面添加手势,只需要统一设置不是根控制器的页面,都增加手势.需要自定义导航控制器 1.继承代理 @interface BSNavigationController ()<UIGesture ...
C# 深拷贝和浅拷贝
在编码中.经常会遇到赋值操作.值类型就不说了.如果是引用类型赋值.其实是引用传递,即赋值的是一个引用.比如: Person p1 = new Person("张三", " ...
unity3d-知识汇总
itween下载 http://www.youkexueyuan.com/exp_show/1147.html 代码修改精灵图片的透明度 UIBp.GetComponent<Image>( ...
FCN的理解
FCN特点 1.卷积化即是将普通的分类网络丢弃全连接层,换上对应的卷积层即可 2.上采样方法是双线性上采样差此处的上采样即是反卷积3.因为如果将全卷积之后的结果直接上采样得到的结果是很粗糙的,所 ...
linux c语言开发工具
---恢复内容开始--- C语言编译全过程剖析编译的概念:编译程序读取源程序(字符流),对之进行词法和语法的分析,将高级语言指令转换为功能等效的汇编代码,再由汇编程序转换为机器语言,并且按照操作系统 ...
LeetCode110.平衡二叉树
一个二叉树每个节点的左右两个子树的高度差的绝对值不超过1. 示例 1: 给定二叉树 [3,9,20,null,null,15,7] 3 / \ 9 20 / \ 15 7 返回 true . 示例 ...
jQuery-切换2
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/ ...
CentOS中利用Docker安装Redis
CentOS中利用Docker安装Redis 1.拉取镜像 #docker pull redis:4.0.10 2.加载镜像 #docker run -p 6379:6379 --name test- ...
eclipse安装Freemaker IDE插件
eclipse安装Freemaker IDE插件 http://download.jboss.org/jbosstools/updates/
Lua之table
Lua table(表) 参考:http://www.runoob.com/lua/lua-tables.html table 是 Lua 的一种数据结构用来帮助我们创建不同的数据类型,如:数字.字典 ...

如何用Tensorflow训练模型成pb文件和和如何加载已经训练好的模型文件

如何用Tensorflow训练模型成pb文件和和如何加载已经训练好的模型文件的更多相关文章

随机推荐

热门专题