《TensorFlow2深度学习》学习笔记（四）对笔记二中的模型增加正确率展示

全部代码如下：（红色部分为与笔记二不同之处）

#1.Import the neccessary libraries needed

import numpy as np

import tensorflow as tf

import matplotlib

from matplotlib import pyplot as plt

########################################################################

#2.Set default parameters for plots

matplotlib.rcParams['font.size'] = 20

matplotlib.rcParams['figure.titlesize'] = 20

matplotlib.rcParams['figure.figsize'] = [9, 7]

matplotlib.rcParams['font.family'] = ['STKaiTi']

matplotlib.rcParams['axes.unicode_minus']=False

########################################################################

#3.Initialize Parameters

#Initialize learning rate

lr = 1e-2                                              #----------------------changed

#Initialize batch size

batchsz = 512

#Initialize loss and accurate array

losses = []

accs = []                                              #----------------------changed

#Initialize the weights layers and the bias layers

w1=tf.Variable(tf.random.truncated_normal([784,256],stddev=0.1))

b1=tf.Variable(tf.zeros([256]))

w2=tf.Variable(tf.random.truncated_normal([256,128],stddev=0.1))

b2=tf.Variable(tf.zeros([128]))

w3=tf.Variable(tf.random.truncated_normal([128,10],stddev=0.1))

b3=tf.Variable(tf.zeros([10]))

########################################################################

#4.Define preprocess function                          #----------------------changed

def preprocess(x,y):

    x=tf.cast(x,dtype=tf.float32)/255.

    x=tf.reshape(x,[-1,28*28])

    y=tf.cast(y,dtype=tf.int32)

    #one_hot接受的输入为int32,输出为float32

    y=tf.one_hot(y,depth=10)

    return x,y

########################################################################

#5.Import the minist dataset offline

(x_train,y_train),(x_test,y_test)=tf.keras.datasets.mnist.load_data(path=r'F:\learning\machineLearning\TensorFlow2_deeplearning\forward_progression\mnist.npz')

train_db=tf.data.Dataset.from_tensor_slices((x_train,y_train))

train_db=train_db.shuffle(10000)                       #-----------------------changed

train_db=train_db.batch(batchsz)

train_db=train_db.map(preprocess)

#Control the epoch times

train_db=train_db.repeat(20)

test_db=tf.data.Dataset.from_tensor_slices((x_test,y_test))

test_db=test_db.shuffle(1000).batch(batchsz).map(preprocess)

########################################################################

#The main function

def main():

    for step,(x,y) in enumerate(train_db):#Or for x,y in train_db:

        with tf.GradientTape() as tape: # tf.Variable

            # layer1

            h1 = x@w1 + b1

            h1 = tf.nn.relu(h1)

            # layer2

            h2 = h1@w2 + b2

            h2 = tf.nn.relu(h2)

            # output

            out = h2@w3 + b3

            # compute loss

            loss = tf.square(y-out)

            # mean: scalar

            loss = tf.reduce_mean(loss)

        # compute gradients

        grads = tape.gradient(loss, [w1, b1, w2, b2, w3, b3])

        #Update the weights and the bias               #-----------------------changed

        for p, g in zip([w1, b1, w2, b2, w3, b3], grads):

            p.assign_sub(lr * g)

        if step % 80 == 0:

            print(step, 'loss:', float(loss))

            losses.append(float(loss))

        if step % 80 == 0:                             #-----------------------changed

            total, total_correct = 0., 0

            for x,y in test_db:

                # layer1

                h1 = x@w1 + b1

                h1 = tf.nn.relu(h1)

                # layer2

                h2 = h1@w2 + b2

                h2 = tf.nn.relu(h2)

                # output

                out = h2@w3 + b3

                pred=tf.argmax(out,axis=1)

                y=tf.argmax(y,axis=1)

                correct=tf.equal(pred,y)

                total_correct+=tf.reduce_sum(tf.cast(correct,dtype=tf.int32)).numpy()

                total+=x.shape[0]

            print(step,'Evaluate ACC:',total_correct/total)

            accs.append(total_correct/total)

    plt.figure()

    x = [i*80 for i in range(len(losses))]

    plt.plot(x, losses, color='C0', marker='s', label='训练')

    plt.ylabel('MSE')

    plt.xlabel('Step')

    plt.legend()

    plt.figure()

    plt.plot(x, accs, color='C1', marker='s', label='测试')

    plt.ylabel('准确率')

    plt.xlabel('Step')

    plt.legend()

    plt.show()

if __name__ == '__main__':

    main()

其中learning rate在此处改为了1e-2，经测试若为1e-3则accurate rate会增长较慢，在20epoch下最终会达到30~40%,而1e-2则会接近80%

并且通过.map(preprocess)方法预处理了train_db，包括将图片数据标准化到（0-1），reshape到[-1,28*28]，将标签数据做one-hot处理，深度为10；通过train_db=train_db.repeat(20)代替了for epoch in range(20)；用

for p, g in zip([w1, b1, w2, b2, w3, b3], grads):

　　p.assign_sub(lr * g)

代替了

w1.assign_sub(lr * grads[0])

b1.assign_sub(lr * grads[1])

w2.assign_sub(lr * grads[2])

b2.assign_sub(lr * grads[3])

w3.assign_sub(lr * grads[4])

b3.assign_sub(lr * grads[5])

《TensorFlow2深度学习》学习笔记（四）对笔记二中的模型增加正确率展示的更多相关文章

ThinkPHP 学习笔记 ( 四 ) 数据库操作之关联模型 ( RelationMondel ) 和高级模型 ( AdvModel )
一.关联模型 ( RelationMondel ) 1.数据查询 ① HAS_ONE 查询创建两张数据表评论表和文章表: tpk_comment , tpk_article .评论和文章的对应关系为 ...
深度学习课程笔记（十四）深度强化学习 --- Proximal Policy Optimization (PPO)
深度学习课程笔记(十四)深度强化学习 --- Proximal Policy Optimization (PPO) 2018-07-17 16:54:51 Reference: https://b ...
官网实例详解-目录和实例简介-keras学习笔记四
官网实例详解-目录和实例简介-keras学习笔记四 2018-06-11 10:36:18 wyx100 阅读数 4193更多分类专栏: 人工智能 python 深度学习 keras 版权声明: ...
C#可扩展编程之MEF学习笔记(四)：见证奇迹的时刻
前面三篇讲了MEF的基础和基本到导入导出方法,下面就是见证MEF真正魅力所在的时刻.如果没有看过前面的文章,请到我的博客首页查看. 前面我们都是在一个项目中写了一个类来测试的,但实际开发中,我们往往要 ...
iOS阶段学习第四天笔记（循环）
iOS学习(C语言)知识点整理笔记一.分支结构 1.分支结构分为单分支即:if( ){ } ;多分支即:if( ){ }else{ } 两种 2.单分支 if表达式成立则执行{ }里的语句:双 ...
IOS学习笔记(四)之UITextField和UITextView控件学习
IOS学习笔记(四)之UITextField和UITextView控件学习(博客地址:http://blog.csdn.net/developer_jiangqq) Author:hmjiangqq ...
java之jvm学习笔记四(安全管理器)
java之jvm学习笔记四(安全管理器) 前面已经简述了java的安全模型的两个组成部分(类装载器,class文件校验器),接下来学习的是java安全模型的另外一个重要组成部分安全管理器. 安全管理器 ...
Java学习笔记四---打包成双击可运行的jar文件
写笔记四前的脑回路是这样的: 前面的学习笔记二,提到3个环境变量,其中java_home好理解,就是jdk安装路径:classpath指向类文件的搜索路径:path指向可执行程序的搜索路径.这里的类文 ...
Learning ROS for Robotics Programming Second Edition学习笔记(四) indigo devices
中文译著已经出版,详情请参考:http://blog.csdn.net/ZhangRelay/article/category/6506865 Learning ROS for Robotics Pr ...

随机推荐

工控随笔_24_西门子TIA 博图硬件目录的更新
西门子博图软件,不但体积庞大,功能也很复杂,与经典的Step7相比,如果不是经常使用,一般都会有一种很难使用的感觉. 而且相比原来的Step7操作有点不太一样.这里简单的说一下硬件目录的更新. 有两种 ...
mysq5.7 主主同步
db01 172.21.0.10 db02 172.21.0.14 一.安装数据库看上一遍博客修改配置文件 db01 172.21.0.10 [root@VM_0_10_centos mys ...
关于UiAutomator无法识别的元素
1.关于没有name,没有ID的元素的定位---通用篇解题思路:因为没有name,id:其实剩下的选择已不多,要么xpath,要么className.xpath木有好印象(稳定性不高,加之1.0x后需 ...
配置git diff和git merge使用的第三方工具
一般在运行git merge branchName后,git 如果提示了merger冲突,然后运行git mergetool.Git提示冲突后,运行git mergetool --tool-help ...
使用Python的turtle画小绵羊
今天学习使用turtle画图,本来想实现个3D效果,结果2D都画了半天,画圆被绕晕了目标图片: 实现代码: # -*- coding:utf-8 -*- # __author__ :kusy # _ ...
在DCEF3中使用较少的dll文件?
您可以使用以下属性: GlobalCEFApp.CheckCEFFiles:设置为FALSE以跳过所有CEF二进制文件检查功能. GlobalCEFApp.LocalesRequired:一个逗号分隔 ...
linux安装 uwsgi 测试 test.py 不显示hello world 的解决办法
一般部署项目到服务器,会安装uwsgi,但是很多教程在安装它的时候会让你测试一下安装好了没,于是就有很多像我一样懵逼的少年掉进一个坑里出不来,很久.很久... 那就是最后浏览器输入ip:8000端口后 ...
Python进阶:并发编程之Asyncio
什么是Asyncio 多线程有诸多优点且应用广泛,但也存在一定的局限性: 比如,多线程运行过程容易被打断,因此有可能出现 race condition 的情况:再如,线程切换本身存在一定的损耗,线程数 ...
Django框架之第六篇（模型层）--单表查询和必知必会13条、单表查询之双下划线、Django ORM常用字段和参数、关系字段
单表查询补充一个知识点:在models.py建表是 create_time = models.DateField() 关键字参数: 1.auto_now:每次操作数据,都会自动刷新当前操作的时间 2 ...
count_if 功能模板
count_if 功能模板 template <class InputIterator, class UnaryPredicate> typename iterator_traits< ...

《TensorFlow2深度学习》学习笔记（四）对笔记二中的模型增加正确率展示

《TensorFlow2深度学习》学习笔记（四）对笔记二中的模型增加正确率展示的更多相关文章

随机推荐

热门专题