线性回归和批量梯度下降法python

通过学习斯坦福公开课的线性规划和梯度下降，参考他人代码自己做了测试，写了个类以后有时间再去扩展，代码注释以后再加，作业好多：

import numpy as np

import matplotlib.pyplot as plt

import random

class dataMinning:

    datasets = []

    labelsets = []

    addressD = ''  #Data folder

    addressL = ''  #Label folder

    npDatasets = np.zeros(1)

    npLabelsets = np.zeros(1)

    cost = []

    numIterations = 0

    alpha = 0

    theta = np.ones(2)

    #pCols = 0

    #dRows = 0

    def __init__(self,addressD,addressL,theta,numIterations,alpha,datasets=None):

        if datasets is None:

            self.datasets = []

        else:

            self.datasets = datasets

        self.addressD = addressD

        self.addressL = addressL

        self.theta = theta

        self.numIterations = numIterations

        self.alpha = alpha

    def readFrom(self):

        fd = open(self.addressD,'r')

        for line in fd:

            tmp = line[:-1].split()

            self.datasets.append([int(i) for i in tmp])

        fd.close()

        self.npDatasets = np.array(self.datasets)

        fl = open(self.addressL,'r')

        for line in fl:

            tmp = line[:-1].split()

            self.labelsets.append([int(i) for i in tmp])

        fl.close()

        tm = []

        for item in self.labelsets:

            tm = tm + item

        self.npLabelsets = np.array(tm)

    def genData(self,numPoints,bias,variance):

        self.genx = np.zeros(shape = (numPoints,2))

        self.geny = np.zeros(shape = numPoints)

        for i in range(0,numPoints):

            self.genx[i][0] = 1

            self.genx[i][1] = i

            self.geny[i] = (i + bias) + random.uniform(0,1) * variance

    def gradientDescent(self):

        xTrans = self.genx.transpose() #

        i = 0

        while i < self.numIterations:

            hypothesis = np.dot(self.genx,self.theta)

            loss = hypothesis - self.geny

            #record the cost

            self.cost.append(np.sum(loss ** 2))

            #calculate the gradient

            gradient = np.dot(xTrans,loss)

            #updata, gradientDescent

            self.theta = self.theta - self.alpha * gradient

            i = i + 1

    def show(self):

        print 'yes'

if __name__ == "__main__":

    c = dataMinning('c:\\city.txt','c:\\st.txt',np.ones(2),100000,0.000005)

    c.genData(100,25,10)

    c.gradientDescent()

    cx = range(len(c.cost))

    plt.figure(1)

    plt.plot(cx,c.cost)

    plt.ylim(0,25000)

    plt.figure(2)

    plt.plot(c.genx[:,1],c.geny,'b.')

    x = np.arange(0,100,0.1)

    y = x * c.theta[1] + c.theta[0]

    plt.plot(x,y)

    plt.margins(0.2)

    plt.show()

　　　　　　　　　　图1. 迭代过程中的误差cost

　　　　　　　　　　图2. 数据散点图和解直线

参考资料：

1.python编写类：http://blog.csdn.net/wklken/article/details/6313265

2.python中if __name__ == __main__的用法：http://www.cnblogs.com/herbert/archive/2011/09/27/2193482.html

3.matplotlab gallery:http://matplotlib.org/gallery.html

4.python批量梯度下降参考代码：http://www.91r.net/ask/17784587.html

线性回归和批量梯度下降法python的更多相关文章

【Python】机器学习之单变量线性回归利用批量梯度下降找到合适的参数值
[Python]机器学习之单变量线性回归利用批量梯度下降找到合适的参数值本题目来自吴恩达机器学习视频. 题目: 你是一个餐厅的老板,你想在其他城市开分店,所以你得到了一些数据(数据在本文最下方), ...
线性回归（最小二乘法、批量梯度下降法、随机梯度下降法、局部加权线性回归） C++
We turn next to the task of finding a weight vector w which minimizes the chosen function E(w). Beca ...
1. 批量梯度下降法BGD 2. 随机梯度下降法SGD 3. 小批量梯度下降法MBGD
排版也是醉了见原文:http://www.cnblogs.com/maybe2030/p/5089753.html 在应用机器学习算法时,我们通常采用梯度下降法来对采用的算法进行训练.其实,常用的梯度 ...
梯度下降法VS随机梯度下降法 (Python的实现)
# -*- coding: cp936 -*- import numpy as np from scipy import stats import matplotlib.pyplot as plt # ...
matlib实现梯度下降法
样本文件下载:ex2Data.zip ex2x.dat文件中是一些2-8岁孩子的年龄. ex2y.dat文件中是这些孩子相对应的体重. 我们尝试用批量梯度下降法,随机梯度下降法和小批量梯度下降法来对这 ...
梯度下降法的python代码实现（多元线性回归）
梯度下降法的python代码实现(多元线性回归最小化损失函数) 1.梯度下降法主要用来最小化损失函数,是一种比较常用的最优化方法,其具体包含了以下两种不同的方式:批量梯度下降法(沿着梯度变化最快的方向 ...
梯度下降法实现（Python语言描述）
原文地址:传送门 import numpy as np import matplotlib.pyplot as plt %matplotlib inline plt.style.use(['ggplo ...
梯度下降法原理与python实现
梯度下降法(Gradient descent)是一个一阶最优化算法,通常也称为最速下降法. 要使用梯度下降法找到一个函数的局部极小值,必须向函数上当前点对应梯度(或者是近似梯度)的反方向的规定步长距离 ...
机器学习中梯度下降法原理及用其解决线性回归问题的C语言实现
本文讲梯度下降(Gradient Descent)前先看看利用梯度下降法进行监督学习(例如分类.回归等)的一般步骤: 1, 定义损失函数(Loss Function) 2, 信息流forward pr ...

随机推荐

redis入门配置
简介: Redis是Nosql中比较出名的,分布式数据库缓存,提升相应的速度,降低对数据库的访问! Redis是一种高级key-value数据库.它跟memcached类似,不过数据可以持久化,(永久 ...
Vim块注释
如何在VIM下快速注释块代码添加块注释 01.进入视图模式 v进入视图模式,控制方向键选中注释的代码 02.进入列模式并插入# ctrl+v进入列,I插入注释# 03.全部注释 esc两次自动全部注 ...
LINQ日常使用记录
1.公司一位美女程序媛写的 2.技术总监提供(来自互联网) var query = from f in db.TField join fw in db.TFieldWel on f.emp_no eq ...
js和jQuery的日常
让当前页面显示整个屏幕 - (iframe 从后台跳转到前台的时候容易多层嵌套)$(document).ready(function(){ if(window.top != window.self) ...
“CEPH浅析”系列之八——小结
最初决定写这些文章的时候,本打算大致记录一下,几千字也就了事了.可是越写越觉得东西多,不说明白总有些不甘心,于是就越写越长,到这儿为止貌似已经有一万七千多字了.除了博士论文之外,应该是没有写过更长的东 ...
51nod 1422(强行YY)
1422 沙拉酱前缀题目来源: CodeForces 基准时间限制:1 秒空间限制:131072 KB 分值: 40 难度:4级算法题收藏关注沙拉酱非常喜欢数字序列.这正是他要弄一个关于 ...
一起学HTML基础-CSS样式表-基本概念、分类、选择器
一.基本概念: CSS (Cascading Style Sheets)层叠样式表,是一种用来表现HTML(标准通用标记语言的一个应用)或XML(标准通用标记语言的一个子集)等文件样式的计算机语言. ...
BZOJ 1853: [Scoi2010]幸运数字
1853: [Scoi2010]幸运数字 Time Limit: 2 Sec Memory Limit: 64 MBSubmit: 2117 Solved: 779[Submit][Status] ...
linux命令语法格式
一.命令的一般格式 command [option]... [argument]... command [options] [arguments] 具体说明: 1.command: 表示命令的名称,如 ...
K-means之matlab实现
引入作为练手,不妨用matlab实现K-means 要解决的问题:n个D维数据进行聚类(无监督),找到合适的簇心. 这里仅考虑最简单的情况,数据维度D=2,预先知道簇心数目K(K=4) 理论步骤关 ...

线性回归和批量梯度下降法python

线性回归和批量梯度下降法python的更多相关文章

随机推荐

热门专题