【python实现卷积神经网络】定义训练和测试过程

代码来源：https://github.com/eriklindernoren/ML-From-Scratch

卷积神经网络中卷积层Conv2D（带stride、padding）的具体实现：https://www.cnblogs.com/xiximayou/p/12706576.html

激活函数的实现（sigmoid、softmax、tanh、relu、leakyrelu、elu、selu、softplus）：https://www.cnblogs.com/xiximayou/p/12713081.html

损失函数定义（均方误差、交叉熵损失）：https://www.cnblogs.com/xiximayou/p/12713198.html

优化器的实现（SGD、Nesterov、Adagrad、Adadelta、RMSprop、Adam）：https://www.cnblogs.com/xiximayou/p/12713594.html

卷积层反向传播过程：https://www.cnblogs.com/xiximayou/p/12713930.html

全连接层实现：https://www.cnblogs.com/xiximayou/p/12720017.html

批量归一化层实现：https://www.cnblogs.com/xiximayou/p/12720211.html

池化层实现：https://www.cnblogs.com/xiximayou/p/12720324.html

padding2D实现：https://www.cnblogs.com/xiximayou/p/12720454.html

Flatten层实现：https://www.cnblogs.com/xiximayou/p/12720518.html

上采样层UpSampling2D实现：https://www.cnblogs.com/xiximayou/p/12720558.html

Dropout层实现：https://www.cnblogs.com/xiximayou/p/12720589.html

激活层实现：https://www.cnblogs.com/xiximayou/p/12720622.html

首先是所有的代码：

from __future__ import print_function, division

from terminaltables import AsciiTable

import numpy as np

import progressbar

from mlfromscratch.utils import batch_iterator

from mlfromscratch.utils.misc import bar_widgets

class NeuralNetwork():

    """Neural Network. Deep Learning base model.

    Parameters:

    -----------

    optimizer: class

        The weight optimizer that will be used to tune the weights in order of minimizing

        the loss.

    loss: class

        Loss function used to measure the model's performance. SquareLoss or CrossEntropy.

    validation: tuple

        A tuple containing validation data and labels (X, y)

    """

    def __init__(self, optimizer, loss, validation_data=None):

        self.optimizer = optimizer

        self.layers = []

        self.errors = {"training": [], "validation": []}

        self.loss_function = loss()

        self.progressbar = progressbar.ProgressBar(widgets=bar_widgets)

        self.val_set = None

        if validation_data:

            X, y = validation_data

            self.val_set = {"X": X, "y": y}

    def set_trainable(self, trainable):

        """ Method which enables freezing of the weights of the network's layers. """

        for layer in self.layers:

            layer.trainable = trainable

    def add(self, layer):

        """ Method which adds a layer to the neural network """

        # If this is not the first layer added then set the input shape

        # to the output shape of the last added layer

        if self.layers:

            layer.set_input_shape(shape=self.layers[-1].output_shape())

        # If the layer has weights that needs to be initialized

        if hasattr(layer, 'initialize'):

            layer.initialize(optimizer=self.optimizer)

        # Add layer to the network

        self.layers.append(layer)

    def test_on_batch(self, X, y):

        """ Evaluates the model over a single batch of samples """

        y_pred = self._forward_pass(X, training=False)

        loss = np.mean(self.loss_function.loss(y, y_pred))

        acc = self.loss_function.acc(y, y_pred)

        return loss, acc

    def train_on_batch(self, X, y):

        """ Single gradient update over one batch of samples """

        y_pred = self._forward_pass(X)

        loss = np.mean(self.loss_function.loss(y, y_pred))

        acc = self.loss_function.acc(y, y_pred)

        # Calculate the gradient of the loss function wrt y_pred

        loss_grad = self.loss_function.gradient(y, y_pred)

        # Backpropagate. Update weights

        self._backward_pass(loss_grad=loss_grad)

        return loss, acc

    def fit(self, X, y, n_epochs, batch_size):

        """ Trains the model for a fixed number of epochs """

        for _ in self.progressbar(range(n_epochs)):

            batch_error = []

            for X_batch, y_batch in batch_iterator(X, y, batch_size=batch_size):

                loss, _ = self.train_on_batch(X_batch, y_batch)

                batch_error.append(loss)

            self.errors["training"].append(np.mean(batch_error))

            if self.val_set is not None:

                val_loss, _ = self.test_on_batch(self.val_set["X"], self.val_set["y"])

                self.errors["validation"].append(val_loss)

        return self.errors["training"], self.errors["validation"]

    def _forward_pass(self, X, training=True):

        """ Calculate the output of the NN """

        layer_output = X

        for layer in self.layers:

            layer_output = layer.forward_pass(layer_output, training)

        return layer_output

    def _backward_pass(self, loss_grad):

        """ Propagate the gradient 'backwards' and update the weights in each layer """

        for layer in reversed(self.layers):

            loss_grad = layer.backward_pass(loss_grad)

    def summary(self, name="Model Summary"):

        # Print model name

        print (AsciiTable([[name]]).table)

        # Network input shape (first layer's input shape)

        print ("Input Shape: %s" % str(self.layers[0].input_shape))

        # Iterate through network and get each layer's configuration

        table_data = [["Layer Type", "Parameters", "Output Shape"]]

        tot_params = 0

        for layer in self.layers:

            layer_name = layer.layer_name()

            params = layer.parameters()

            out_shape = layer.output_shape()

            table_data.append([layer_name, str(params), str(out_shape)])

            tot_params += params

        # Print network configuration table

        print (AsciiTable(table_data).table)

        print ("Total Parameters: %d\n" % tot_params)

    def predict(self, X):

        """ Use the trained model to predict labels of X """

        return self._forward_pass(X, training=False)

接着我们来一个一个函数进行分析：

1、初始化__init__：这里面定义好优化器optimizer、模型层layers、错误errors、损失函数loss_function、用于显示进度条progressbar，这里从mlfromscratch.utils.misc中导入了bar_widgets，我们看看这是什么：

bar_widgets = [

    'Training: ', progressbar.Percentage(), ' ', progressbar.Bar(marker="-", left="[", right="]"),

    ' ', progressbar.ETA()

]

2、set_trainable()：用于设置哪些模型层需要进行参数的更新

3、add()：将一个模块放入到卷积神经网络中，例如卷积层、池化层、激活层等等。

4、test_on_batch()：使用batch进行测试，这里不需要进行反向传播。

5、train_on_batch()：使用batch进行训练，包括前向传播计算损失以及反向传播更新参数。

6、fit()：喂入数据进行训练或验证，这里需要定义好epochs和batch_size的大小，同时有一个读取数据的函数batch_iterator()，位于mlfromscratch.utils下的data_manipulation.py中：

def batch_iterator(X, y=None, batch_size=64):

    """ Simple batch generator """

    n_samples = X.shape[0]

    for i in np.arange(0, n_samples, batch_size):

        begin, end = i, min(i+batch_size, n_samples)

        if y is not None:

            yield X[begin:end], y[begin:end]

        else:

            yield X[begin:end]

7、_forward_pass()：模型层的前向传播。

8、_backward_pass()：模型层的反向传播。

9、summary()：用于输出模型的每层的类型、参数数量以及输出大小。

10、predict()：用于输出预测值。

不难发现，该代码是借鉴了tensorflow中的一些模块的设计思想。

【python实现卷积神经网络】定义训练和测试过程的更多相关文章

【python实现卷积神经网络】开始训练
代码来源:https://github.com/eriklindernoren/ML-From-Scratch 卷积神经网络中卷积层Conv2D(带stride.padding)的具体实现:https ...
基于Python的卷积神经网络和特征提取
基于Python的卷积神经网络和特征提取用户1737318发表于人工智能头条订阅 224 在这篇文章中: Lasagne 和 nolearn 加载MNIST数据集 ConvNet体系结构与训练预测 ...
《TensorFlow实战》中AlexNet卷积神经网络的训练中
TensorFlow实战中AlexNet卷积神经网络的训练 01 出错 TypeError: as_default() missing 1 required positional argument: ...
python机器学习卷积神经网络(CNN)
卷积神经网络(CNN) 关注公众号"轻松学编程"了解更多. 一.简介卷积神经网络(Convolutional Neural Network,CNN)是一种前馈神经网络,它的人 ...
【python实现卷积神经网络】损失函数的定义（均方误差损失、交叉熵损失）
代码来源:https://github.com/eriklindernoren/ML-From-Scratch 卷积神经网络中卷积层Conv2D(带stride.padding)的具体实现:https ...
Python CNN卷积神经网络代码实现
# -*- coding: utf-8 -*- """ Created on Wed Nov 21 17:32:28 2018 @author: zhen "& ...
使用卷积神经网络CNN训练识别mnist
算的的上是自己搭建的第一个卷积神经网络.网络结构比较简单. 输入为单通道的mnist数据集.它是一张28*28,包含784个特征值的图片我们第一层输入,使用5*5的卷积核进行卷积,输出32张特征图, ...
【python实现卷积神经网络】卷积层Conv2D反向传播过程
代码来源:https://github.com/eriklindernoren/ML-From-Scratch 卷积神经网络中卷积层Conv2D(带stride.padding)的具体实现:https ...
【python实现卷积神经网络】激活函数的实现（sigmoid、softmax、tanh、relu、leakyrelu、elu、selu、softplus）
代码来源:https://github.com/eriklindernoren/ML-From-Scratch 卷积神经网络中卷积层Conv2D(带stride.padding)的具体实现:https ...

随机推荐

基于.NetCore3.1搭建项目系列 —— 使用Swagger做Api文档 (下篇)
前言回顾上一篇文章<使用Swagger做Api文档 >,文中介绍了在.net core 3.1中,利用Swagger轻量级框架,如何引入程序包,配置服务,注册中间件,一步一步的实现,最终 ...
python中可变长度参数详解
1. *args用法:python会将所有位置的参数收集到一个元组中 2. **args用法:python会将关键字参数传递给一个新的字典.**允许将关键字参数转换为字典用法见如下代码: def f ...
第十一周Java实验作业
实验十一集合实验时间 2018-11-8 1.实验目的与要求 (1) 掌握Vetor.Stack.Hashtable三个类的用途及常用API: Vector类类似长度可变的数组,其中只能存放对 ...
【Excel使用技巧】vlookup函数
背景前不久开发了一个运营小工具,运营人员上传一个id的列表,即可导出对应id的额外数据.需求本身不复杂,很快就开发完了,但上线后,运营反馈了一个问题,导出后的数据跟导出之前的数据顺序不一致. 经过沟 ...
editplus软件使用技巧
1.文本文件的特点注:不针对editplus这个软件,对于其他的文本文件处理软件也同样适用. 文本文件就是不包含其它文字格式(比如字体,字号,对齐,行间距等)以及富文本(比如图片,表格等)的一种纯文 ...
动态规划-买卖股票的最佳时机 V
2020-03-11 18:19:00 问题描述: 给出一个股票n天的价格,每天最多只能进行一次交易,可以选择买入一支股票或卖出一支股票或放弃交易,输出能够达到的最大利润值样例样例 1: 给出 ` ...
Java进阶之心态
不管什么时候学习都是一个积累的过程,量变才能引起质变.一口吃一个胖子是不存在的,成长的路上没有捷径,只有学到的知识才是我们走向远方道路的基石!
【MySQL】面试官：谈谈你对Mysql的MVCC的理解？
MVCC(Mutil-Version Concurrency Control),就是多版本并发控制.MVCC 是一种并发控制的方法,一般在数据库管理系统中,实现对数据库的并发访问. 在Mysql的In ...
关于用命令行和idea对项目打jar包
前提说一下,我们一般是对编译后的项目进行打包,不然打包后还得自己去重新编译class文件. 假如这是你的一个项目目录: 我们要写一个简单的计算器工具类项目,然后对他进行打包, idea里面out文件夹 ...
Selenium系列（十二） - 自动化必备知识之CSS选择器的详细使用
如果你还想从头学起Selenium,可以看看这个系列的文章哦! https://www.cnblogs.com/poloyy/category/1680176.html 其次,如果你不懂前端基础知识, ...

【python实现卷积神经网络】定义训练和测试过程

【python实现卷积神经网络】定义训练和测试过程的更多相关文章

随机推荐

热门专题