softmax及python实现

相对于自适应神经网络、感知器，softmax巧妙低使用简单的方法来实现多分类问题。

功能上，完成从N维向量到M维向量的映射
输出的结果范围是[0, 1]，对于一个sample的结果所有输出总和等于1
输出结果，可以隐含地表达该类别的概率

softmax的损失函数是采用了多分类问题中常见的交叉熵，注意经常有2个表达的形式

经典的交叉熵形式：L=-sum(y_right * log(y_pred))，具体见https://blog.csdn.net/bqw18744018044/article/details/83120425
简单版本是: L = -Log(y_pred),具体见https://blog.csdn.net/red_stone1/article/details/80687921

这两个版本在求导过程有点不同，但是结果都是一样的，同时损失表达的意思也是相同的，因为在第一种表达形式中，当y不是正确分类时,y_right等于0，当y是正确分类时,y_right等于1。

下面基于mnist数据做了一个多分类的实验，整体能达到85%的精度。

'''

softmax classifier for mnist  

created on 2019.9.28

author: vince

'''

import math

import logging

import numpy

import random

import matplotlib.pyplot as plt

from tensorflow.contrib.learn.python.learn.datasets.mnist import read_data_sets

from sklearn.metrics import accuracy_score

def loss_max_right_class_prob(predictions, y):

	return -predictions[numpy.argmax(y)];

def loss_cross_entropy(predictions, y):

	return -numpy.dot(y, numpy.log(predictions));

'''

Softmax classifier

linear classifier

'''

class Softmax:

	def __init__(self, iter_num = 100000, batch_size = 1):

		self.__iter_num = iter_num;

		self.__batch_size = batch_size;

	def train(self, train_X, train_Y):

		X = numpy.c_[train_X, numpy.ones(train_X.shape[0])];

		Y = numpy.copy(train_Y);

		self.L = [];

		#initialize parameters

		self.__weight = numpy.random.rand(X.shape[1], 10) * 2 - 1.0;

		self.__step_len = 1e-3; 

		logging.info("weight:%s" % (self.__weight));

		for iter_index in range(self.__iter_num):

			if iter_index % 1000 == 0:

				logging.info("-----iter:%s-----" % (iter_index));

			if iter_index % 100 == 0:

				l = 0;

				for i in range(0, len(X), 100):

					predictions = self.forward_pass(X[i]);

					#l += loss_max_right_class_prob(predictions, Y[i]);

					l += loss_cross_entropy(predictions, Y[i]);

				l /= len(X);

				self.L.append(l);

			sample_index = random.randint(0, len(X) - 1);

			logging.debug("-----select sample %s-----" % (sample_index));

			z = numpy.dot(X[sample_index], self.__weight);

			z = z - numpy.max(z);

			predictions = numpy.exp(z) / numpy.sum(numpy.exp(z));

			dw = self.__step_len * X[sample_index].reshape(-1, 1).dot((predictions - Y[sample_index]).reshape(1, -1));

#			dw = self.__step_len * X[sample_index].reshape(-1, 1).dot(predictions.reshape(1, -1));

#			dw[range(X.shape[1]), numpy.argmax(Y[sample_index])] -= X[sample_index] * self.__step_len;

			self.__weight -= dw;

			logging.debug("weight:%s" % (self.__weight));

			logging.debug("loss:%s" % (l));

		logging.info("weight:%s" % (self.__weight));

		logging.info("L:%s" % (self.L));

	def forward_pass(self, x):

		net = numpy.dot(x, self.__weight);

		net = net - numpy.max(net);

		net = numpy.exp(net) / numpy.sum(numpy.exp(net));

		return net;

	def predict(self, x):

		x = numpy.append(x, 1.0);

		return self.forward_pass(x);

def main():

	logging.basicConfig(level = logging.INFO,

			format = '%(asctime)s %(filename)s[line:%(lineno)d] %(levelname)s %(message)s',

			datefmt = '%a, %d %b %Y %H:%M:%S');

	logging.info("trainning begin.");

	mnist = read_data_sets('../data/MNIST',one_hot=True)    # MNIST_data指的是存放数据的文件夹路径，one_hot=True 为采用one_hot的编码方式编码标签

	#load data

	train_X = mnist.train.images                #训练集样本

	validation_X = mnist.validation.images      #验证集样本

	test_X = mnist.test.images                  #测试集样本

	#labels

	train_Y = mnist.train.labels                #训练集标签

	validation_Y = mnist.validation.labels      #验证集标签

	test_Y = mnist.test.labels                  #测试集标签

	classifier = Softmax();

	classifier.train(train_X, train_Y);

	logging.info("trainning end. predict begin.");

	test_predict = numpy.array([]);

	test_right = numpy.array([]);

	for i in range(len(test_X)):

		predict_label = numpy.argmax(classifier.predict(test_X[i]));

		test_predict = numpy.append(test_predict, predict_label);

		right_label = numpy.argmax(test_Y[i]);

		test_right = numpy.append(test_right, right_label);

	logging.info("right:%s, predict:%s" % (test_right, test_predict));

	score = accuracy_score(test_right, test_predict);

	logging.info("The accruacy score is: %s "% (str(score)));

	plt.plot(classifier.L)

	plt.show();

if __name__ == "__main__":

	main();

损失函数收敛情况

Sun, 29 Sep 2019 18:08:08 softmax.py[line:104] INFO trainning end. predict begin.

Sun, 29 Sep 2019 18:08:08 softmax.py[line:114] INFO right:[7. 2. 1. ... 4. 5. 6.], predict:[7. 2. 1. ... 4. 8. 6.]

Sun, 29 Sep 2019 18:08:08 softmax.py[line:116] INFO The accruacy score is: 0.8486

softmax及python实现的更多相关文章

机器学习-softmax回归 python实现
---恢复内容开始--- Softmax Regression 可以看做是 LR 算法在多分类上的推广,即类标签 y 的取值大于或者等于 2. 假设数据样本集为:$\left \{ \left ( X ...
softmax函数python实现
import numpy as np def softmax(x): """ 对输入x的每一行计算softmax. 该函数对于输入是向量(将向量视为单独的行)或者矩阵(M ...
TensorFlow(2)Softmax Regression
Softmax Regression Chapter Basics generate random Tensors Three usual activation function in Neural ...
logistic regression model
logistic regression model LR softmax classification Fly logistic regression model loss fuction softm ...
[C2W3] Improving Deep Neural Networks : Hyperparameter tuning, Batch Normalization and Programming Frameworks
第三周:Hyperparameter tuning, Batch Normalization and Programming Frameworks 调试处理(Tuning process) 目前为止, ...
softmax分类算法原理(用python实现)
逻辑回归神经网络实现手写数字识别如果更习惯看Jupyter的形式,请戳Gitthub_逻辑回归softmax神经网络实现手写数字识别.ipynb 1 - 导入模块 import numpy as n ...
手写数字识别 ----Softmax回归模型官方案例注释（基于Tensorflow,Python）
# 手写数字识别 ----Softmax回归模型 # regression import os import tensorflow as tf from tensorflow.examples.tut ...
如何用Python计算Softmax？
Softmax函数,或称归一化指数函数,它能将一个含任意实数的K维向量z"压缩"到另一个K维实向量$\sigma{(z)}$中,使得每一个元素的范围都在(0,1)之间,并且所有 ...
使用python计算softmax函数
softmax计算公式: Softmax是机器学习中一个非常重要的工具,他可以兼容 logistics 算法.可以独立作为机器学习的模型进行建模训练.还可 ...

随机推荐

PDF 相关操作
去年一年偷了下懒, 博客写了一点就没写了, 还好一些大的flag完成了. 花了半年的空余时间, 培养了一门兴趣爱好. 自己在为人处世上还是不够圆滑啊, 也难怪. 自己当初选择走技术这条路的初 ...
js实现box(2)(3)这种调用方式的方法
box(2)(3)函数的调用方法有两种: 第一种: var box = function(num1){ return function(num2){ return num1+num2; }; }; a ...
Git将文件上传至Github过程
1.安装Git工具(在这里就不多说了) 2.我们需要先创建一个本地的版本库(其实也就是一个文件夹). 你可以直接在桌面右击新建文件夹,也可以右击打开Git bash命令行窗口通过命令来创建. 现在我通 ...
学习Java技术哪家强
https://github.com/CyC2018/CS-Notes https://github.com/Snailclimb/JavaGuide SpringBoot 之配置文件优先级 htt ...
利用iTunes给MP3添加专辑插图
利用iTunes给MP3添加专辑插图打开iTunes 准备好没有专辑插图的mp3文件和插图将准备好的mp3文件拖入iTunes 右键菜单选择专辑信息选项在专辑信息里面选择插图点击左下角的添加插 ...
第一篇：解析Linux是什么？能干什么？它的应用领域！
不得不说的前言(不看完睡觉会尿床):饿货们~!你说你们上学都学了点啥?这不懂那也不懂,快毕业了啥也不会.专业课程不学好毕业了也找不到好工作.爸妈给你养大,投资了多少钱.你毕业后随便找了个什么鸡毛工作开 ...
RStudio终端操作
转于:https://support.rstudio.com/hc/en-us/articles/115010737148-Using-the-RStudio-Terminal#send 原文是英文版 ...
Silence主题美观清爽的cnblog第三方主题
为什么推荐? 才开通cnblog,但苦于官方主题都不是很好看,翻找Github的时候发现了这个项目Silence 这是预览地址官方展示图片安装中的坑不显示公共模块.博文目录.博文签名.博文赞赏. ...
Angular介绍
Angulay介绍 1.介绍:是一个用于Html和TypeScript构建客户端应用平台与框架.Angular 本身就是用 TypeScript 写成的.基本构造块是 NgModule,它为组件提供了 ...
《自拍教程46》Python_adb自动拍照100张
Android手机测试, 涉及照相机(Camera)应用程序的稳定性测试的用例, 需要涉及100张照片的拍照自动化测试. 准备阶段先清理老照片,照片一般存放在/scard/DCIM目录下 adb s ...

softmax及python实现

softmax及python实现的更多相关文章

随机推荐

热门专题