前一篇讲过环境的部署篇，这一次就讲讲从代码角度如何导出pb模型，如何进行服务调用。

1 hello world篇

部署完docker后，如果是cpu环境，可以直接拉取tensorflow/serving，如果是GPU环境则麻烦点，具体参考前一篇，这里就不再赘述了。

cpu版本的可以直接拉取tensorflow/serving，docker会自动拉取latest版本：

docker pull tensorflow/serving

如果想要指定tensorflow的版本，可以去这里查看：https://hub.docker.com/r/tensorflow/serving/tags/

比如我需要的是1.12.0版本的tf，那么也可以拉取指定的版本：

docker pull tensorflow/serving:1.12.0

拉取完镜像，需要下载一个hello world的程序代码。

mkdir -p /tmp/tfserving

cd /tmp/tfserving

git clone https://github.com/tensorflow/serving

tensorflow/serving的github中有对应的测试模型，模型其实就是 y = 0.5 * x + 2。即输入一个数，输出是对应的y。

运行下面的命令，在docker中部署服务：

docker run -p 8501:8501 --mount type=bind,source=/tmp/serving/tensorflow_serving/servables/tensorflow/testdata/saved_model_half_plus_two_cpu,target=/models/half_plus_two -e MODEL_NAME=half_plus_two -t tensorflow/serving &

上面的命令中，把/tmp/serving/tensorflow_serving/servables/tensorflow/testdata/saved_model_half_plus_two_cpu路径挂载到/models/half_plus_two,这样tensorflow_serving就可以加载models下的模型了，然后开放内部8501的http接口。

执行docker ps查看服务列表：

➜  ~ docker ps

CONTAINER ID        IMAGE                COMMAND                  CREATED             STATUS              PORTS                              NAMES

7decb4286057        tensorflow/serving   "/usr/bin/tf_serving…"   7 seconds ago       Up 6 seconds        8500/tcp, 0.0.0.0:8501->8501/tcp   eager_dewdney

发送一个http请求测试一下：

curl -d '{"instances": [1.0, 2.0, 5.0]}' -X POST http://localhost:8501/v1/models/half_plus_two:predict

{

    "predictions": [2.5, 3.0, 4.5

    ]

}%

2 mnist篇

由于前面的例子，serving工程下只有pb模型，没有模型的训练和导出，因此看不出其中的门道。这一部分就直接基于手写体识别的例子，展示一下如何从tensorflow训练代码导出模型，又如何通过grpc服务进行模型的调用。

训练和导出:

#! /usr/bin/env python

"""

训练并导出Softmax回归模型，使用SaveModel导出训练模型并添加签名。

"""

from __future__ import print_function

import os

import sys

# This is a placeholder for a Google-internal import.

import tensorflow as tf

import ssl

ssl._create_default_https_context = ssl._create_unverified_context

import basic.mnist_input_data as mnist_input_data

# 定义模型参数

tf.app.flags.DEFINE_integer('training_iteration', 10, 'number of training iterations.')

tf.app.flags.DEFINE_integer('model_version', 2, 'version number of the model.')

tf.app.flags.DEFINE_string('work_dir', './tmp', 'Working directory.')

FLAGS = tf.app.flags.FLAGS

def main(_):

    # 参数校验

    # if len(sys.argv) < 2 or sys.argv[-1].startswith('-'):

    #     print('Usage: mnist_saved_model.py [--training_iteration=x] '

    #           '[--model_version=y] export_dir')

    #     sys.exit(-1)

    # if FLAGS.training_iteration <= 0:

    #     print('Please specify a positive value for training iteration.')

    #     sys.exit(-1)

    # if FLAGS.model_version <= 0:

    #     print('Please specify a positive value for version number.')

    #     sys.exit(-1)

    # Train model

    print('Training model...')

    mnist = mnist_input_data.read_data_sets(FLAGS.work_dir, one_hot=True)

    sess = tf.InteractiveSession()

    serialized_tf_example = tf.placeholder(tf.string, name='tf_example')

    feature_configs = {'x': tf.FixedLenFeature(shape=[784], dtype=tf.float32), }

    tf_example = tf.parse_example(serialized_tf_example, feature_configs)

    x = tf.identity(tf_example['x'], name='x')  # use tf.identity() to assign name

    y_ = tf.placeholder('float', shape=[None, 10])

    w = tf.Variable(tf.zeros([784, 10]))

    b = tf.Variable(tf.zeros([10]))

    sess.run(tf.global_variables_initializer())

    y = tf.nn.softmax(tf.matmul(x, w) + b, name='y')

    cross_entropy = -tf.reduce_sum(y_ * tf.log(y))

    train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)

    values, indices = tf.nn.top_k(y, 10)

    table = tf.contrib.lookup.index_to_string_table_from_tensor(

        tf.constant([str(i) for i in range(10)]))

    prediction_classes = table.lookup(tf.to_int64(indices))

    for _ in range(FLAGS.training_iteration):

        batch = mnist.train.next_batch(50)

        train_step.run(feed_dict={x: batch[0], y_: batch[1]})

    correct_prediction = tf.equal(tf.argmax(y, 1), tf.argmax(y_, 1))

    accuracy = tf.reduce_mean(tf.cast(correct_prediction, 'float'))

    print('training accuracy %g' % sess.run(

        accuracy, feed_dict={

            x: mnist.test.images,

            y_: mnist.test.labels

        }))

    print('Done training!')

    # Export model

    # WARNING(break-tutorial-inline-code): The following code snippet is

    # in-lined in tutorials, please update tutorial documents accordingly

    # whenever code changes.

    # export_path_base = sys.argv[-1]

    export_path_base = "/Users/xingoo/PycharmProjects/ml-in-action/实践-tensorflow/01-官方文档-学习和使用ML/save_model"

    export_path = os.path.join(tf.compat.as_bytes(export_path_base), tf.compat.as_bytes(str(FLAGS.model_version)))

    print('Exporting trained model to', export_path)

    # 配置导出地址，创建SaveModel

    builder = tf.saved_model.builder.SavedModelBuilder(export_path)

    # Build the signature_def_map.

    # 创建TensorInfo，包含type,shape,name

    classification_inputs = tf.saved_model.utils.build_tensor_info(serialized_tf_example)

    classification_outputs_classes = tf.saved_model.utils.build_tensor_info(prediction_classes)

    classification_outputs_scores = tf.saved_model.utils.build_tensor_info(values)

    # 分类签名：算法类型+输入+输出（概率和名字）

    classification_signature = (

        tf.saved_model.signature_def_utils.build_signature_def(

            inputs={

                tf.saved_model.signature_constants.CLASSIFY_INPUTS:

                    classification_inputs

            },

            outputs={

                tf.saved_model.signature_constants.CLASSIFY_OUTPUT_CLASSES:

                    classification_outputs_classes,

                tf.saved_model.signature_constants.CLASSIFY_OUTPUT_SCORES:

                    classification_outputs_scores

            },

            method_name=tf.saved_model.signature_constants.CLASSIFY_METHOD_NAME))

    tensor_info_x = tf.saved_model.utils.build_tensor_info(x)

    tensor_info_y = tf.saved_model.utils.build_tensor_info(y)

    # 预测签名：输入的x和输出的y

    prediction_signature = (

        tf.saved_model.signature_def_utils.build_signature_def(

            inputs={'images': tensor_info_x},

            outputs={'scores': tensor_info_y},

            method_name=tf.saved_model.signature_constants.PREDICT_METHOD_NAME))

    # 构建图和变量的信息：

    """

    sess                会话

    tags                标签，默认提供serving、train、eval、gpu、tpu

    signature_def_map   签名

    main_op             初始化？

    strip_default_attrs strip?

    """

    # predict_images就是服务调用的方法

    # serving_default是没有输入签名时，使用的方法

    builder.add_meta_graph_and_variables(

        sess, [tf.saved_model.tag_constants.SERVING],

        signature_def_map={

            'predict_images':

                prediction_signature,

            tf.saved_model.signature_constants.DEFAULT_SERVING_SIGNATURE_DEF_KEY:

                classification_signature,

        },

        main_op=tf.tables_initializer(),

        strip_default_attrs=True)

    # 保存

    builder.save()

    print('Done exporting!')

if __name__ == '__main__':

    tf.app.run()

执行后，在当前目录中就有一个save_model文件，保存了各个版本的pb模型文件。

然后基于grpc部署服务：

docker run -p 8500:8500 --mount type=bind,source=/Users/xingoo/PycharmProjects/ml-in-action/01-实践-tensorflow/01-官方文档-学习和使用ML/save_model,target=/models/mnist -e MODEL_NAME=mnist -t tensorflow/serving &

服务部署成功，查看一下docker列表：

➜  ~ docker ps

CONTAINER ID        IMAGE                COMMAND                  CREATED             STATUS              PORTS                              NAMES

39a06cc35961        tensorflow/serving   "/usr/bin/tf_serving…"   4 seconds ago       Up 3 seconds        0.0.0.0:8500->8500/tcp, 8501/tcp   hardcore_galileo

然后编写对应的client代码：

import tensorflow as tf

import basic.mnist_input_data as mnist_input_data

import grpc

import numpy as np

import sys

import threading

from tensorflow_serving.apis import predict_pb2

from tensorflow_serving.apis import prediction_service_pb2_grpc

tf.app.flags.DEFINE_integer('concurrency', 1, 'maximum number of concurrent inference requests')

tf.app.flags.DEFINE_integer('num_tests', 100, 'Number of test images')

tf.app.flags.DEFINE_string('server', 'localhost:8500', 'PredictionService host:port')

tf.app.flags.DEFINE_string('work_dir', './tmp', 'Working directory. ')

FLAGS = tf.app.flags.FLAGS

test_data_set = mnist_input_data.read_data_sets(FLAGS.work_dir).test

channel = grpc.insecure_channel(FLAGS.server)

stub = prediction_service_pb2_grpc.PredictionServiceStub(channel)

class _ResultCounter(object):

    """Counter for the prediction results."""

    def __init__(self, num_tests, concurrency):

        self._num_tests = num_tests

        self._concurrency = concurrency

        self._error = 0

        self._done = 0

        self._active = 0

        self._condition = threading.Condition()

    def inc_error(self):

        with self._condition:

            self._error += 1

    def inc_done(self):

        with self._condition:

            self._done += 1

            self._condition.notify()

    def dec_active(self):

        with self._condition:

            self._active -= 1

            self._condition.notify()

    def get_error_rate(self):

        with self._condition:

            while self._done != self._num_tests:

                self._condition.wait()

            return self._error / float(self._num_tests)

    def throttle(self):

        with self._condition:

            while self._active == self._concurrency:

                self._condition.wait()

            self._active += 1

def _create_rpc_callback(label, result_counter):

    def _callback(result_future):

        exception = result_future.exception()

        if exception:

            result_counter.inc_error()

            print(exception)

        else:

            response = np.array(result_future.result().outputs['scores'].float_val)

            prediction = np.argmax(response)

            sys.stdout.write("%s - %s\n" % (label, prediction))

            sys.stdout.flush()

        result_counter.inc_done()

        result_counter.dec_active()

    return _callback

result_counter = _ResultCounter(FLAGS.num_tests, FLAGS.concurrency)

for i in range(FLAGS.num_tests):

    request = predict_pb2.PredictRequest()

    request.model_spec.name = 'mnist'

    request.model_spec.signature_name = 'predict_images'

    image, label = test_data_set.next_batch(1)

    request.inputs['images'].CopyFrom(tf.contrib.util.make_tensor_proto(image[0], shape=[1, image[0].size]))

    result_counter.throttle()

    result_future = stub.Predict.future(request, 5.0)  # 5 seconds

    result_future.add_done_callback(_create_rpc_callback(label[0], result_counter))

print(result_counter.get_error_rate())

得到对应的输出：

深度学习Tensorflow生产环境部署（下·模型部署篇）的更多相关文章

深度学习Tensorflow生产环境部署（上·环境准备篇）
最近在研究Tensorflow Serving生产环境部署,尤其是在做服务器GPU环境部署时,遇到了不少坑.特意总结一下,当做前车之鉴. 1 系统背景系统是ubuntu16.04 ubuntu@ub ...
linux服务器上配置进行kaggle比赛的深度学习tensorflow keras环境详细教程
本文首发于个人博客https://kezunlin.me/post/6b505d27/,欢迎阅读最新内容! full guide tutorial to install and configure d ...
深度学习-tensorflow学习笔记(1)-MNIST手写字体识别预备知识
深度学习-tensorflow学习笔记(1)-MNIST手写字体识别预备知识在tf第一个例子的时候需要很多预备知识. tf基本知识香农熵交叉熵代价函数cross-entropy 卷积神经网络 s ...
深度学习-tensorflow学习笔记(2)-MNIST手写字体识别
深度学习-tensorflow学习笔记(2)-MNIST手写字体识别超级详细版这是tf入门的第一个例子.minst应该是内置的数据集. 前置知识在学习笔记(1)里面讲过了这里直接上代码 # -*- ...
深度学习之Attention Model（注意力模型）
1.Attention Model 概述深度学习里的Attention model其实模拟的是人脑的注意力模型,举个例子来说,当我们观赏一幅画时,虽然我们可以看到整幅画的全貌,但是在我们深入仔细地观 ...
[源码解析] 深度学习流水线并行 PipeDream(3)--- 转换模型
[源码解析] 深度学习流水线并行 PipeDream(3)--- 转换模型目录 [源码解析] 深度学习流水线并行 PipeDream(3)--- 转换模型 0x00 摘要 0x01 前言 1.1 改 ...
深度学习Tensorflow相关书籍推荐和PDF下载
深度学习Tensorflow相关书籍推荐和PDF下载 baihualinxin关注 32018.03.28 10:46:16字数 481阅读 22,673 1.机器学习入门经典<统计学习方法&g ...
深度学习入门者的Python快速教程 - 基础篇
5.1 Python简介本章将介绍Python的最基本语法,以及一些和深度学习还有计算机视觉最相关的基本使用. 5.1.1 Python简史 Python是一门解释型的高级编程语言,特点是简单明 ...
在linux ubuntu下搭建深度学习/机器学习开发环境
一.安装Anaconda 1.下载下载地址为:https://www.anaconda.com/download/#linux 2.安装anaconda,执行命令: bash ~/Downloads ...

随机推荐

Servlet+JSP例子
前面两节已经学习了什么是Servlet,Servlet接口函数是哪些.怎么运行.Servlet生命周期是什么? 以及Servlet中的模式匹配URL,web.xml配置和HttpServlet.怎么 ...
selenium启动谷歌浏览器
谷歌浏览器可以是任意版本. 根据谷歌浏览器的版本,下载响应的浏览器驱动(百度:谷歌驱动与谷歌浏览器版本映射表). 我用的是win7 ,下载win32就行了.下载解压之后可以放在python的安装目录下 ...
wsl
1.win10设置为开发人员模式,并安装subsystem for linux(命令行输入bash就可以安装) 2.登录并su到root,然后修改/etc/sudoers,在最后一行加(一定要在最后一 ...
JavaScript数据类型，构造函数
JavaScript 的数据类型分为两类:原始类型和对象类型. JavaScript中的原始类型包括数字.字符串和布尔值 JavaScript中有两种特殊的原始值:null和undefined. Ja ...
团队项目Alpha冲刺阶段之学习总结
线性布局线性布局是程序中最常见的种布局方式,可以分为水平线性布局和重直线性布局两种,通过Android:orientation属性可以设置线性布局的方向.线性布局的特点是各个子元法彼此连接,中间不留 ...
Numpy 创建数组
ndarray 数组除了可以使用底层 ndarray 构造器来创建外, 也可以通过以下几种方式来创建. numpy.empty numpy.empty 方法用来创建一个指定形状(shape),数据类型 ...
博客六--Tensorflow卷积神经网络的自主搭建
本人较懒也很忙,所以就不重复工作.连接我的开源中国博客查询:https://my.oschina.net/u/3770644/blog/3042523
虚拟DOM
传统的 DOM 操作是直接在 DOM 上操作,当需要修改一系列元素中的值时,就会直接对 DOM 进行操作.如果需要操作的DOM元素过多,则成本太高,而采用 Virtual DOM 则会对需要修改的 D ...
记录一下maven使用过程中的问题
Failed to execute goal on project bos_fore: Could not resolve dependencies for project 上面问题,我把<de ...
python3+ selenium3开发环境搭建
环境搭建基于python3和selenium3做自动化测试,俗话说:工欲善其事必先利其器:没有金刚钻就不揽那瓷器活,磨刀不误砍柴工,因此你必须会搭建基本的开发环境,掌握python基本的语法和一个I ...

深度学习Tensorflow生产环境部署（下·模型部署篇）

1 hello world篇

2 mnist篇

深度学习Tensorflow生产环境部署（下·模型部署篇）的更多相关文章

随机推荐

热门专题