CheeseZH: Stanford University: Machine Learning Ex3: Multiclass Logistic Regression and Neural Network Prediction

Handwritten digits recognition (0-9)

Multi-class Logistic Regression

1. Vectorizing Logistic Regression

(1) Vectorizing the cost function

(2) Vectorizing the gradient

(3) Vectorizing the regularized cost function

(4) Vectorizing the regularized gradient

All above 4 formulas can be found in the previous blog: click here.

lrCostFunction.m

 function [J, grad] = lrCostFunction(theta, X, y, lambda)

 %LRCOSTFUNCTION Compute cost and gradient for logistic regression with

 %regularization

 %   J = LRCOSTFUNCTION(theta, X, y, lambda) computes the cost of using

 %   theta as the parameter for regularized logistic regression and the

 %   gradient of the cost w.r.t. to the parameters. 

 % Initialize some useful values

 m = length(y); % number of training examples

 % You need to return the following variables correctly

 J = ;

 grad = zeros(size(theta));

 % ====================== YOUR CODE HERE ======================

 % Instructions: Compute the cost of a particular choice of theta.

 %               You should set J to the cost.

 %               Compute the partial derivatives and set grad to the partial

 %               derivatives of the cost w.r.t. each parameter in theta

 %

 % Hint: The computation of the cost function and gradients can be

 %       efficiently vectorized. For example, consider the computation

 %

 %           sigmoid(X * theta)

 %

 %       Each row of the resulting matrix will contain the value of the

 %       prediction for that example. You can make use of this to vectorize

 %       the cost function and gradient computations.

 %

 % Hint: When computing the gradient of the regularized cost function,

 %       there're many possible vectorized solutions, but one solution

 %       looks like:

 %           grad = (unregularized gradient for logistic regression)

 %           temp = theta;

 %           temp() = ;   % because we don't add anything for j = 0

 %           grad = grad + YOUR_CODE_HERE (using the temp variable)

 %

 hx = sigmoid(X*theta);

 reg = lambda/(*m)*sum(theta(:size(theta),:).^);

 J = -/m*(y'*log(hx)+(1-y)'*log(-hx)) + reg;

 theta() = ;

 grad = /m*X'*(hx-y)+lambda/m*theta;

 % =============================================================

 grad = grad(:);

 end

2. One-vs-all Classification (Training)

Return all the classifier parameters in a matrix Θ (a K x N+1 matrix, K is the num_labels and N is the num_features ), where each row of Θ corresponds to the learned logistic regression parameters for one class. You can do this with a 'for'-loop from 1 to K, training each classifier independently.

oneVsAll.m

 function [all_theta] = oneVsAll(X, y, num_labels, lambda)

 %ONEVSALL trains multiple logistic regression classifiers and returns all

 %the classifiers in a matrix all_theta, where the i-th row of all_theta

 %corresponds to the classifier for label i

 %   [all_theta] = ONEVSALL(X, y, num_labels, lambda) trains num_labels

 %   logisitc regression classifiers and returns each of these classifiers

 %   in a matrix all_theta, where the i-th row of all_theta corresponds

 %   to the classifier for label i

 % Some useful variables

 m = size(X, );

 n = size(X, );

 % You need to return the following variables correctly

 all_theta = zeros(num_labels, n + );

 % Add ones to the X data matrix

 X = [ones(m, ) X];

 % ====================== YOUR CODE HERE ======================

 % Instructions: You should complete the following code to train num_labels

 %               logistic regression classifiers with regularization

 %               parameter lambda.

 %

 % Hint: theta(:) will return a column vector.

 %

 % Hint: You can use y == c to obtain a vector of 's and 0's that tell use

 %       whether the ground truth is true/false for this class.

 %

 % Note: For this assignment, we recommend using fmincg to optimize the cost

 %       function. It is okay to use a for-loop (for c = :num_labels) to

 %       loop over the different classes.

 %

 %       fmincg works similarly to fminunc, but is more efficient when we

 %       are dealing with large number of parameters.

 %

 % Example Code for fmincg:

 %

 %     % Set Initial theta

 %     initial_theta = zeros(n + , );

 %

 %     % Set options for fminunc

 %     options = optimset('GradObj', 'on', 'MaxIter', );

 %

 %     % Run fmincg to obtain the optimal theta

 %     % This function will return theta and the cost

 %     [theta] = ...

 %         fmincg (@(t)(lrCostFunction(t, X, (y == c), lambda)), ...

 %                 initial_theta, options);

 %

 for c=:num_labels,

   initial_theta = all_theta(c,:)';

   options = optimset('GradObj','on','MaxIter',);

   theta = fmincg(@(t)(lrCostFunction(t,X,(y==c),lambda)),initial_theta,options);

   all_theta(c,:) = theta';

 end;

 % =========================================================================

 end

3. One-vs-all Classification (Prediction)

predictOneVsAll.m

Neural Network Prediction

Feedword Propagation and Prediction

predict.m

 function p = predict(Theta1, Theta2, X)

 %PREDICT Predict the label of an input given a trained neural network

 %   p = PREDICT(Theta1, Theta2, X) outputs the predicted label of X given the

 %   trained weights of a neural network (Theta1, Theta2)

 % Useful values

 m = size(X, );

 num_labels = size(Theta2, );

 % You need to return the following variables correctly

 p = zeros(size(X, ), );

 % ====================== YOUR CODE HERE ======================

 % Instructions: Complete the following code to make predictions using

 %               your learned neural network. You should set p to a

 %               vector containing labels between  to num_labels.

 %

 % Hint: The max function might come in useful. In particular, the max

 %       function can also return the index of the max element, for more

 %       information see 'help max'. If your examples are in rows, then, you

 %       can use max(A, [], ) to obtain the max for each row.

 %

 a1 = X; %*

 a1 = [ones(size(X,), ),X]; %*

 a2 = sigmoid(a1*Theta1');%5000*25

 a2 = [ones(size(a2,),),a2]; %*

 a3 = sigmoid(a2*Theta2');%5000*10

 [tmp,p] = max(a3,[],);

 % =========================================================================

 end

Other files and dataset can be download in Coursera.

CheeseZH: Stanford University: Machine Learning Ex3: Multiclass Logistic Regression and Neural Network Prediction的更多相关文章

CheeseZH: Stanford University: Machine Learning Ex5:Regularized Linear Regression and Bias v.s. Variance
源码:https://github.com/cheesezhe/Coursera-Machine-Learning-Exercise/tree/master/ex5 Introduction: In ...
CheeseZH: Stanford University: Machine Learning Ex2:Logistic Regression
1. Sigmoid Function In Logisttic Regression, the hypothesis is defined as: where function g is the s ...
CheeseZH: Stanford University: Machine Learning Ex1:Linear Regression
(1) How to comput the Cost function in Univirate/Multivariate Linear Regression; (2) How to comput t ...
CheeseZH: Stanford University: Machine Learning Ex4:Training Neural Network(Backpropagation Algorithm)
1. Feedforward and cost function; 2.Regularized cost function: 3.Sigmoid gradient The gradient for t ...
[Machine Learning]学习笔记-Logistic Regression
[Machine Learning]学习笔记-Logistic Regression 模型-二分类任务 Logistic regression,亦称logtic regression,翻译为" ...
Andrew Ng Machine Learning 专题【Logistic Regression & Regularization】
此文是斯坦福大学,机器学习界 superstar - Andrew Ng 所开设的 Coursera 课程:Machine Learning 的课程笔记. 力求简洁,仅代表本人观点,不足之处希望大家探 ...
机器学习---朴素贝叶斯与逻辑回归的区别（Machine Learning Naive Bayes Logistic Regression Difference）
朴素贝叶斯与逻辑回归的区别: 朴素贝叶斯逻辑回归生成模型(Generative model) 判别模型(Discriminative model) 对特征x和目标y的联合分布P(x,y)建模,使用 ...
machine learning(10) -- classification:logistic regression cost function 和使用 gradient descent to minimize cost function
logistic regression cost function(single example) 图像分布 logistic regression cost function(m examples) ...
Machine Learning in Action -- Logistic regression
这个系列,重点关注如何实现,至于算法基础,参考Andrew的公开课相较于线性回归,logistic回归更适合用于分类因为他使用Sigmoid函数,因为分类的取值是0,1 对于分类,最完美和自然的函 ...

随机推荐

word-ladder总结
title: word ladder总结 categories: LeetCode tags: 算法 LeetCode comments: true date: 2016-10-16 09:42:30 ...
[OpenGL]纹理贴图实现总结
实现步骤第一步:设置所需要的OpenGL环境设置上下文环境删除已经存在的渲染的缓存设置颜色缓存设置帧缓存清除缓存设置窗口大小开启功能编译shander 使用program 获取sha ...
hihocoder #1015 KMP
#include<stdio.h> #include<iostream> #include<math.h> #include<string.h> usi ...
网络服务器搭建的那些事（PV QPS Throughput）转载
一.前言: 从事后台sever开发的同学,代码开发完成之后,上线之前,总会进行各种黑盒白盒测试,压测.正确性测试... 而测试同学,会给开发同学一份测试报告,需要开发同学进行确认...问题来了,里面好 ...
Lucene_索引(域)的查询
package cn.tz.lucene; import java.io.File; import org.apache.lucene.analysis.Analyzer; import org.ap ...
所谓jQuery.append()、jQuery.html()存在的XSS漏洞
使用jQuery.append().jQuery.html()方法时,如果其中内容包含<script>脚本而没有经过任何处理的话,会执行它. 简单的示例代码如下: var xssStr = ...
BZOJ 2002: [Hnoi2010]Bounce 弹飞绵羊（动态树LCT）
2002: [Hnoi2010]Bounce 弹飞绵羊 Time Limit: 10 Sec Memory Limit: 259 MBSubmit: 2843 Solved: 1519[Submi ...
虚拟信用卡全球付, 工商银行国际E卡, Bancore, Entropay, Payoneer
虚拟信用卡海外网购.购买国外域名空间.ebay等一些国外网站账号的激活这些情况都需要用到国际信用卡, 如果没有信用卡或者有信用卡但是对于安全性有所顾虑怎么办? 其实有一种东西叫做虚拟信用卡,正规银行 ...
sqlite - Sqlite Wrappers - Delphi
http://www.sqlite.org/cvstrac/wiki?p=SqliteWrappers Aducom's SQLite: Open source (NewBSD) Delphi (4. ...
jquery easyui combobox设置默认选中第一项
combobox的内容是从后台获取的json, js截取: var data = $('#id').combobox('getData'); $("#id ").combobox( ...

CheeseZH: Stanford University: Machine Learning Ex3: Multiclass Logistic Regression and Neural Network Prediction

CheeseZH: Stanford University: Machine Learning Ex3: Multiclass Logistic Regression and Neural Network Prediction的更多相关文章

随机推荐

热门专题