从零单排入门机器学习：线性回归（linear regression）实践篇

线性回归（linear regression）实践篇

之前一段时间在coursera看了Andrew ng的机器学习的课程，感觉还不错，算是入门了。

这次打算以该课程的作业为主线，对机器学习基本知识做一下总结。小弟才学疏浅，如有错误。敬请指导。

问题原描写叙述：

you will implement linear regression with one

variable to predict prots for a food truck. Suppose you are the CEO of a

restaurant franchise and are considering dierent cities for opening a new

outlet. The chain already has trucks in various cities and you have data for

prots and populations from the cities.

简单来说，就是依据一个城市的人口数量，来预測一辆快餐车能获得的利益。

数据集大概是这样子的：

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvbGluZ2VybGFubGFu/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/SouthEast" alt="">

一行数据为一个样本。第一列表示人口，第二列表示利益。

首先。先把数据可视化。

%% ======================= Part 2: Plotting =======================

fprintf('Plotting Data ...\n')

data = load('ex1data1.txt');

X = data(:, 1); y = data(:, 2);

m = length(y); % number of training examples

% Plot Data

% Note: You have to complete the code in plotData.m

plotData(X, y);

fprintf('Program paused. Press enter to continue.\n');

pause;

function plotData(x, y)

%PLOTDATA Plots the data points x and y into a new figure

%   PLOTDATA(x,y) plots the data points and gives the figure axes labels of

%   population and profit.

% ====================== YOUR CODE HERE ======================

% Instructions: Plot the training data into a figure using the

%               "figure" and "plot" commands. Set the axes labels using

%               the "xlabel" and "ylabel" commands. Assume the

%               population and revenue data have been passed in

%               as the x and y arguments of this function.

%

% Hint: You can use the 'rx' option with plot to have the markers

%       appear as red crosses. Furthermore, you can make the

%       markers larger by using plot(..., 'rx', 'MarkerSize', 10);

figure; % open a new figure window

plot(x, y, 'rx', 'MarkerSize', 10); % Plot the data

ylabel('Profit in $10,000s'); % Set the y label

xlabel('Population of City in 10,000s'); % Set the x label

% ============================================================

end

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvbGluZ2VybGFubGFu/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/SouthEast" alt="">

计算cost function

function J = computeCost(X, y, theta)

%COMPUTECOST Compute cost for linear regression

%   J = COMPUTECOST(X, y, theta) computes the cost of using theta as the

%   parameter for linear regression to fit the data points in X and y

% Initialize some useful values

m = length(y); % number of training examples

% You need to return the following variables correctly

% ====================== YOUR CODE HERE ======================

% Instructions: Compute the cost of a particular choice of theta

%               You should set J to the cost.

H = X*theta;

diff = H - y;

%J = sum(diff.^2)/(2*m);

J = sum(diff.*diff)/(2*m);

% =========================================================================

end

为了方便理解上面代码，看看各变量大概长什么样子的。

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvbGluZ2VybGFubGFu/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/SouthEast" alt="">

梯度下降法计算參数theta

function [theta, J_history] = gradientDescent(X, y, theta, alpha, num_iters)

%GRADIENTDESCENT Performs gradient descent to learn theta

%   theta = GRADIENTDESENT(X, y, theta, alpha, num_iters) updates theta by

%   taking num_iters gradient steps with learning rate alpha

% Initialize some useful values

m = length(y); % number of training examples

J_history = zeros(num_iters, 1);

for iter = 1:num_iters

    % ====================== YOUR CODE HERE ======================

    % Instructions: Perform a single gradient step on the parameter vector

    %               theta.

    %

    % Hint: While debugging, it can be useful to print out the values

    %       of the cost function (computeCost) and gradient here.

    %

    H = X*theta-y;

    theta(1) = theta(1) - sum(H.* X(:,1))*alpha/m;%感觉这样写挺搓的

    theta(2) = theta(2) - sum(H.* X(:,2))*alpha/m;

    %theta = theta - alpha * (X' * (X * theta - y)) / m; 

    % ============================================================

    % Save the cost J in every iteration

    J_history(iter) = computeCost(X, y, theta);

end

end

难以理解的是theta = theta - alpha * (X' * (X * theta - y)) / m; 这样的向量化算法。

先看看theta本质是怎么计算的

再看看各变量长什么样子的

算出theta之后，就能够画出拟合直线了。

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvbGluZ2VybGFubGFu/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/SouthEast" alt="">

注：本文作者linger，如有转载。请标明转载于http://blog.csdn.net/lingerlanlan。

本文链接:http://blog.csdn.net/lingerlanlan/article/details/32162559

从零单排入门机器学习：线性回归（linear regression）实践篇的更多相关文章

从零单排入门机器学习：Octave/matlab的经常使用知识之矩阵和向量
Octave/matlab的经常使用知识之矩阵和向量之前一段时间在coursera看了Andrew ng的机器学习的课程,感觉还不错.算是入门了.这次打算以该课程的作业为主线,对机器学习基本知识做一 ...
Stanford机器学习---第二讲. 多变量线性回归 Linear Regression with multiple variable
原文:http://blog.csdn.net/abcjennifer/article/details/7700772 本栏目(Machine learning)包括单参数的线性回归.多参数的线性回归 ...
机器学习（三）--------多变量线性回归(Linear Regression with Multiple Variables)
机器学习(三)--------多变量线性回归(Linear Regression with Multiple Variables) 同样是预测房价问题如果有多个特征值那么这种情况下假设h表示 ...
斯坦福CS229机器学习课程笔记 Part1：线性回归 Linear Regression
机器学习三要素机器学习的三要素为:模型.策略.算法. 模型:就是所要学习的条件概率分布或决策函数.线性回归模型策略:按照什么样的准则学习或选择最优的模型.最小化均方误差,即所谓的 least-sq ...
机器学习 (一) 单变量线性回归 Linear Regression with One Variable
文章内容均来自斯坦福大学的Andrew Ng教授讲解的Machine Learning课程,本文是针对该课程的个人学习笔记,如有疏漏,请以原课程所讲述内容为准.感谢博主Rachel Zhang的个人笔 ...
机器学习 (二) 多变量线性回归 Linear Regression with Multiple Variables
文章内容均来自斯坦福大学的Andrew Ng教授讲解的Machine Learning课程,本文是针对该课程的个人学习笔记,如有疏漏,请以原课程所讲述内容为准.感谢博主Rachel Zhang 的个人 ...
TensorFlow 学习笔记(1)----线性回归(linear regression)的TensorFlow实现
此系列将会每日持续更新,欢迎关注线性回归(linear regression)的TensorFlow实现 #这里是基于python 3.7版本的TensorFlow TensorFlow是一个机器学 ...
Ng第二课：单变量线性回归(Linear Regression with One Variable)
二.单变量线性回归(Linear Regression with One Variable) 2.1 模型表示 2.2 代价函数 2.3 代价函数的直观理解 2.4 梯度下降 2.5 梯度下 ...
斯坦福第二课：单变量线性回归(Linear Regression with One Variable)
二.单变量线性回归(Linear Regression with One Variable) 2.1 模型表示 2.2 代价函数 2.3 代价函数的直观理解 I 2.4 代价函数的直观理解 I ...

随机推荐

修改android手机文件权限
修改android手机文件权限默认情况下,一个应用肯定是读取不了另外一个应用的数据的,因为权限不够.但是我们一定要读,怎么办? 修改我们要读取文件的权限. Android是基于Linux的,所以修改 ...
AVL树、splay树(伸展树)和红黑树比较
AVL树.splay树(伸展树)和红黑树比较一.AVL树: 优点:查找.插入和删除,最坏复杂度均为O(logN).实现操作简单如过是随机插入或者删除,其理论上可以得到O(logN)的复杂度,但是实 ...
React-Native Android开发沉思录
在runServer.js中有声明如何启动http服务: 查看端口占用情况而且在系统管理器中,根本找不到PID为7956的应用,那能更改端口吗?在server.js中有声明: module.expo ...
ROS-TF-广播
前言:将海龟的坐标系变换广播到TF. URDF文件的描述是在相对坐标上进行的,运动起来就需要考虑机器人各个连杆的相对位置关系.TF的诞生就是为了自动管理这些相对关系下的坐标变换的,而我们需要做的就是给 ...
NOIP 2010 关押罪犯并查集二分+二分图染色
题目描述: S 城现有两座监狱,一共关押着N 名罪犯,编号分别为1~N.他们之间的关系自然也极不和谐.很多罪犯之间甚至积怨已久,如果客观条件具备则随时可能爆发冲突.我们用"怨气值" ...
Solr.NET快速入门(七)【覆盖默认映射器,NHibernate集成】
覆盖默认映射器默认情况下,SolrNet使用属性映射Solr字段. 但是,您可能需要使用另一个映射程序. 替换默认映射器取决于您如何设置库: 内置容器如果使用默认的内置容器,可以在调用Startu ...
showdialog
在C#中窗口的显示有两种方式:模态显示(showdialog)和非模态显示(show). 区别: 模态与非模态窗体的主要区别是窗体显示的时候是否可以操作其他窗体.模态窗体不允许操作其他窗体,非模态窗体 ...
Java语言基础(数组)
Java语言基础(数组概述和定义格式说明) A:为什么要有数组(容器) 为了存储同种数据类型的多个值 B:数组概念数组是存储同一种数据类型多个元素的集合.也可以看成是一个容器. 数组既可以存储基本数 ...
Walking on the path of Redis --- Introduction and Installation
废话开篇以前从来没听说过有Redis这么个玩意,无意间看到一位仁兄的博客,才对其有所了解,所以决定对其深入了解下.有不对的地方还请各位指正. Redis介绍下面是官方的介绍,不喜欢english的 ...
（转）shiro权限框架详解06-shiro与web项目整合(下)
http://blog.csdn.net/facekbook/article/details/54962975 shiro和web项目整合,实现类似真实项目的应用 web项目中认证 web项目中授权 ...

从零单排入门机器学习：线性回归（linear regression）实践篇

从零单排入门机器学习：线性回归（linear regression）实践篇的更多相关文章

随机推荐

热门专题