手写体数字识别 MNIST数据集

本赛 train 42000样例 test 28000样例，原始MNIST是 train 60000 test 10000

我分别用 Logistic Regression/ 784-200-200-10的Sparse AutoEncoder/Convolution AutoEncoder刷了下

===============方法一、 One-Vs-All 的Logistic Regression===================

%%

ccc

load digitData

%%

input_layer_size  = 28*28;

num_ys = 10;         

X = train_x;

[~,y] = max(train_y, [], 2);

lambda = 0.1;

lambda = 100;

[all_theta] = oneVsAll(X, y, num_ys, lambda);

%% ================ Part: Predict for One-Vs-All ================

%  After ...

pred = predictOneVsAll(all_theta, X);

fprintf('\nTraining Set Accuracy: %f\n', mean(double(pred == y)) * 100);

%% ============== 计算test准确度(test_y 是基于KNN的 只作为参考)

[~,test_y] = max(test_y, [], 2);

pred = predictOneVsAll(all_theta, test_x);

fprintf('\nTest Set Accuracy: %f\n', mean(double(pred == test_y)) * 100);

%% write csv file

pred(pred==10) = 0;

M = [(1:length(pred))' pred(:)];

csvwrite('LiFeiteng0824.csv',M)

===============方法二、 784-200-200-10的Sparse AutoEncoder ===================

%% STEP 0: Here we provide the relevant parameters values that will

tic

inputDim = 28;

inputSize = 28 * 28;

numClasses = 10;

hiddenSizeL1 = 200;    % Layer 1 Hidden Size

hiddenSizeL2 = 200;    % Layer 2 Hidden Size

sparsityParam = 0.1;   % desired average activation of the hidden units.

                       % (This was denoted by the Greek alphabet rho, which looks like a lower-case "p",

		               %  in the lecture notes).

lambda = 3e-3;         % weight decay parameter

beta = 3;              % weight of sparsity penalty term

maxIter = 100;    

%% STEP 1: Load data

load digitData

trainData = train_x';

[~, trainLabels] = max(train_y, [], 2);

%%% 增加数据 

%%% ZCA白化 像素值范围变化 []

% trainData = ZCAWhite(trainData);

%% STEP 2: Train the first sparse autoencoder

sae1Theta = initializeParameters(hiddenSizeL1, inputSize);

options.Method = 'lbfgs';

options.maxIter = 200;	  % Maximum number of iterations of L-BFGS to run

options.display = 'on';

[sae1OptTheta, cost] = minFunc( @(p) sparseAutoencoderCost(p, ...

                                   inputSize, hiddenSizeL1, ...

                                   lambda, sparsityParam, ...

                                   beta, trainData), ...

                              sae1Theta, options);

% -------------------------------------------------------------------------

W1 = reshape(sae1OptTheta(1:hiddenSizeL1*inputSize), hiddenSizeL1, inputSize);

display_network(W1', 12); 

%% STEP 2: Train the second sparse autoencoder

[sae1Features] = feedForwardAutoencoder(sae1OptTheta, hiddenSizeL1, ...

                                        inputSize, trainData);

%  Randomly initialize the parameters

sae2Theta = initializeParameters(hiddenSizeL2, hiddenSizeL1);

options.Method = 'lbfgs';

options.maxIter = 100;	  % Maximum number of iterations of L-BFGS to run

options.display = 'on';

[sae2OptTheta, cost] = minFunc( @(p) sparseAutoencoderCost(p, ...

                                   size(sae1Features,1), hiddenSizeL2, ...

                                   lambda, sparsityParam, ...

                                   beta, sae1Features), ...

                              sae2Theta, options);

%% STEP 3: Train the softmax classifier

[sae2Features] = feedForwardAutoencoder(sae2OptTheta, hiddenSizeL2, ...

                                        hiddenSizeL1, sae1Features);

%  Randomly initialize the parameters

saeSoftmaxTheta = 0.005 * randn(hiddenSizeL2 * numClasses, 1);

lambda = 1e-4;

options.maxIter = 200;

softmaxModel = softmaxTrain(hiddenSizeL2, numClasses, lambda, ...

                            sae2Features, trainLabels, options);

% -------------------------------------------------------------------------

 saeSoftmaxOptTheta = softmaxModel.optTheta(:);

%% STEP 5: Finetune softmax model

% Implement the stackedAECost to give the combined cost of the whole model

% then run this cell.

% Initialize the stack using the parameters learned

stack = cell(2,1);

stack{1}.w = reshape(sae1OptTheta(1:hiddenSizeL1*inputSize), ...

                     hiddenSizeL1, inputSize);

stack{1}.b = sae1OptTheta(2*hiddenSizeL1*inputSize+1:2*hiddenSizeL1*inputSize+hiddenSizeL1);

stack{2}.w = reshape(sae2OptTheta(1:hiddenSizeL2*hiddenSizeL1), ...

                     hiddenSizeL2, hiddenSizeL1);

stack{2}.b = sae2OptTheta(2*hiddenSizeL2*hiddenSizeL1+1:2*hiddenSizeL2*hiddenSizeL1+hiddenSizeL2);

% Initialize the parameters for the deep model

[stackparams, netconfig] = stack2params(stack);

stackedAETheta = [ saeSoftmaxOptTheta ; stackparams ];

options.Method = 'lbfgs';

options.maxIter = 400;	  % Maximum number of iterations of L-BFGS to run

options.display = 'on';

[stackedAEOptTheta, cost] = minFunc( @(p) stackedAECost(p, ...

                                   hiddenSizeL2 , hiddenSizeL2, ...

                                   numClasses, netconfig, ...

                                   lambda, trainData, trainLabels), ...

                              stackedAETheta, options);

% -------------------------------------------------------------------------

%% STEP 6: Test

%  Instructions: You will need to complete the code in stackedAEPredict.m

%                before running this part of the code

%

testData = test_x';

[~, testLabels] = max(test_y, [], 2);

[pred] = stackedAEPredict(stackedAETheta, inputSize, hiddenSizeL2, ...

                          numClasses, netconfig, testData);

acc = mean(testLabels(:) == pred(:));

fprintf('Before Finetuning Test Accuracy: %0.3f%%\n', acc * 100);

[pred] = stackedAEPredict(stackedAEOptTheta, inputSize, hiddenSizeL2, ...

                          numClasses, netconfig, testData);

acc = mean(testLabels(:) == pred(:));

fprintf('After Finetuning Test Accuracy: %0.3f%%\n', acc * 100);

toc

pred(pred==10) = 0;

tmp = [(1:length(pred))' pred(:)];

csvwrite('LiFeiteng0824.csv',tmp)

test准确率基于Knn的pred-label

===============方法三、 784-200-200-10的Sparse AutoEncoder ===================

使用DeepLearnToolbox

%%

clear

close all

clc

%% load data label

load digitData

%%% pre-processing

%% ex2 train a X-X hidden unit SDAE and use it to initialize a FFNN

%  Setup and train a stacked denoising autoencoder (SDAE)

rng(0);

nDim = [784 200 200];

sae = saesetup(nDim);

sae.ae{1}.activation_function       = 'sigm';

sae.ae{1}.learningRate              = 1;

sae.ae{1}.inputZeroMaskedFraction   = 0.5;

sae.ae{2}.activation_function       = 'sigm';

sae.ae{2}.learningRate              = 1;

sae.ae{2}.inputZeroMaskedFraction   = 0.5;

% sae.ae{3}.activation_function       = 'sigm';

% sae.ae{3}.learningRate              = 0.8;

% sae.ae{3}.inputZeroMaskedFraction   = 0.5;

opts.numepochs =   30;

opts.batchsize = 100;

% opts.sparsityTarget = 0.05;%$LiFeiteng

% opts.nonSparsityPenalty = 1;

opts.dropoutFraction = 0.5;

sae = saetrain(sae, train_x, opts);

visualize(sae.ae{1}.W{1}(:,2:end)')

%% Use the SDAE to initialize a FFNN

nn = nnsetup([nDim 10]);

nn.activation_function              = 'sigm';%'sigm';

nn.learningRate                     = 1;

%add pretrained weights

nn.W{1} = sae.ae{1}.W{1};

nn.W{2} = sae.ae{2}.W{1};

%nn.W{3} = sae.ae{3}.W{1};

% Train the FFNN

fprintf('\n')

opts.numepochs =   40;

opts.batchsize = 100;

nn = nntrain(nn, train_x, train_y, opts);

%% test

[er, bad, pred] = nntest(nn, test_x, test_y);

pred(pred==10) = 0;

tmp = [(1:length(pred))' pred(:)];

csvwrite('LiFeiteng0824.csv',tmp)

start of the art！

==================================================================

排名200多好伤感！！！

Leaderboard上好多100%的，其实我也可以做到——作弊——把错误的部分逐一用肉眼扫下，更改test_label就可，不过这就没意思了。

Y. LeCun 维护的

THE MNIST DATABASE

最好成绩：

==============================

可以提高准确率的方法：

1.增加train的个数，对增加原始图像平移旋转等构造新图像；

2.对图像做预处理等；直接用PCA or ZCA白化会改变像素值范围；

3.卷积-池化等加入Deep Networks中去；

4.New Model。。。

Kaggle—Digit Recognizer竞赛的更多相关文章

kaggle实战记录 =>Digit Recognizer
date:2016-09-13 今天开始注册了kaggle,从digit recognizer开始学习, 由于是第一个案例对于整个流程目前我还不够了解,首先了解大神是怎么运行怎么构思,然后模仿.这样的 ...
Kaggle大数据竞赛平台入门
Kaggle大数据竞赛平台入门大数据竞赛平台,国内主要是天池大数据竞赛和DataCastle,国外主要就是Kaggle.Kaggle是一个数据挖掘的竞赛平台,网站为:https://www.kagg ...
Kiggle:Digit Recognizer
题目链接:Kiggle:Digit Recognizer Each image is 28 pixels in height and 28 pixels in width, for a total o ...
DeepLearning to digit recognizer in kaggle
DeepLearning to digit recongnizer in kaggle 近期在看deeplearning,于是就找了kaggle上字符识别进行练习.这里我主要用两种工具箱进行求解.并比 ...
Kaggle入门(一)——Digit Recognizer
目录 0 前言 1 简介 2 数据准备 2.1 导入数据 2.2 检查空值 2.3 正则化 Normalization 2.4 更改数据维度 Reshape 2.5 标签编码 2.6 分割交叉验证集 ...
Kaggle 项目之 Digit Recognizer
train.csv 和 test.csv 包含 1~9 的手写数字的灰度图片.每幅图片都是 28 个像素的高度和宽度,共 28*28=784 个像素点,每个像素值都在 0~255 之间. train. ...
kaggle赛题Digit Recognizer：利用TensorFlow搭建神经网络（附上K邻近算法模型预测）
一.前言 kaggle上有传统的手写数字识别mnist的赛题,通过分类算法,将图片数据进行识别.mnist数据集里面,包含了42000张手写数字0到9的图片,每张图片为28*28=784的像素,所以整 ...
适合初学者的使用CNN的数字图像识别项目：Digit Recognizer with CNN for beginner
准备工作数据集介绍数据文件 train.csv 和 test.csv 包含从零到九的手绘数字的灰度图像. 每张图像高 28 像素,宽 28 像素,总共 784 像素.每个像素都有一个与之关联的像素 ...
SMO序列最小最优化算法
SMO例子: 1 from numpy import * 2 import matplotlib 3 import matplotlib.pyplot as plt 4 5 def loadDataS ...

随机推荐

iOS 编程之使用 Xcode6配置.pch文件
刚上手 Xcode6 的人,总会发现之前在 6 之前常常会在“利用名-Prefix.pch”这个文件中来配置我们全局要用到的头文件,但是 xcode6 没有了,人家说,这类东西有时候也会出现1些稀里糊 ...
【Java TCP/IP Socket】TCP Socket通信中由read返回值造成的的死锁问题（含代码）（转）
书上示例在第一章<基本套接字>中,作者给出了一个TCP Socket通信的例子——反馈服务器,即服务器端直接把从客户端接收到的数据原原本本地反馈回去. 书上客户端代码如下: 1 2 3 ...
基于visual Studio2013解决面试题之1009兄弟字符串
题目
UVA10006 - Carmichael Numbers
题目链接:UVA10006 本来想直接打素数表,然后根据素数表来判断,结果一直超时,后来把素数表去掉,再在for循环中加判断才勉强过了. Some numbers that are not prime ...
【linux】内核make编译链接相关变量定义
欢迎转载,转载时请保留作者信息,谢谢. 邮箱:tangzhongp@163.com 博客园地址:http://www.cnblogs.com/embedded-tzp Csdn博客地址:http:// ...
湖南省第八届大学生程序设计大赛原题 D - 平方根大搜索 UVA 12505 - Searching in sqrt(n)
http://acm.hust.edu.cn/vjudge/contest/view.action?cid=30746#problem/D D - 平方根大搜索 UVA12505 - Searchin ...
MapReduce/Hbase进阶提升(原理剖析、实战演练)
什么是MapReduce? MapReduce是一种编程模型,用于大规模数据集(大于1TB)的并行运算.概念"Map(映射)"和"Reduce(归约)",和他们 ...
Not able to reset SmartRF04DD
今天在使用使用CC2540的时候,想下载个程序到CC2540底板上,结果出现Not able to reset SmartRF04DD的错误.如下图经过一番摸索,最终是按下CCDEBUG上的rese ...
Ubuntu_开启root 登陆
默认的安装完ubuntu ,root 用户没有开启 1.使用安装时的用户,先给root用户设置密码设置root密码 sudo passwd root 之后会提示输入新的密码切换到root用户 su ...
理清JavaScript正则表达式
理清JavaScript正则表达式--下篇紧接:"理清JavaScript正则表达式--上篇". 正则在String类中的应用类String支持四种利用正则表达式的方法.分别是 ...

Kaggle—Digit Recognizer竞赛

THE MNIST DATABASE

Kaggle—Digit Recognizer竞赛的更多相关文章

随机推荐

热门专题