Regression 手动实现Gradient Descent

import numpy as np

import matplotlib.pyplot as plt

x_data = [338.,333.,328.,207.,226.,25.,179.,60.,208.,606.]

y_data = [640.,633.,619.,393.,428.,27.,193.,66.,226.,1591.]

#y_data = w*x_data + b

x = np.arange(-200,-100,1)#bias

y = np.arange(-5,5,0.1)#weight

Z = np.zeros((len(x),len(y)))

X,Y = np.meshgrid(x,y)

for i in range(len(x)):

    for j in range(len(y)):

        b = x[i]

        w = y[j]

        Z[j][i] = 0

        for n in range(len(x_data)):

            Z[j][i] = Z[j][i] + (y_data[n] - b - w*x_data[n])**2

        Z[j][i] = Z[j][i]/len(x_data)

b = -120 #初始化b

w = -4 #初始化w

lr = 0.0000001 #learning rate

iteration = 100000

#作图保留

b_history = [b]

w_history = [w]

for i in range(iteration):

    b_grad = 0.0

    w_grad = 0.0

    for n in range(len(x_data)):#求导的和

        b_grad = b_grad - 2.0*(y_data[n] - b - w*x_data[n])*1.0

        w_grad = w_grad - 2.0*(y_data[n] - b - w*x_data[n])*x_data[n]

    #update

    b = b - lr*b_grad

    w = w - lr*w_grad

    #store for plotting

    b_history.append(b)

    w_history.append(w)

#plot

plt.contourf(x,y,Z,50,alpha=0.5,cmap=plt.get_cmap('jet'))

plt.plot([-188.4],[2.67],'x',ms=12,markeredgewidth=3,color='orange')

plt.plot(b_history, w_history, 'o-', ms=3, lw=1.5, color='black')

plt.xlim(-200,-100)

plt.ylim(-5,5)

plt.xlabel(r'$b$', fontsize=16)

plt.ylabel(r'$w$', fontsize=16)

plt.show()

显然没有搞好

用adaGrad

import numpy as np

import matplotlib.pyplot as plt

x_data = [338.,333.,328.,207.,226.,25.,179.,60.,208.,606.]

y_data = [640.,633.,619.,393.,428.,27.,193.,66.,226.,1591.]

#y_data = w*x_data + b

#Z是整个data的loss值

x = np.arange(-200,-100,1)#bias

y = np.arange(-5,5,0.1)#weight

Z = np.zeros((len(x),len(y)))

X,Y = np.meshgrid(x,y)

for i in range(len(x)):

    for j in range(len(y)):

        b = x[i]

        w = y[j]

        Z[j][i] = 0

        for n in range(len(x_data)):

            Z[j][i] = Z[j][i] + (y_data[n] - b - w*x_data[n])**2

        Z[j][i] = Z[j][i]/len(x_data)

b = -120 #初始化b

w = -4 #初始化w

lr = 1 #learning rate

iteration = 100000

#作图保留

b_history = [b]

w_history = [w]

lr_b = 0

lr_w = 0

for i in range(iteration):

    b_grad = 0.0

    w_grad = 0.0

    for n in range(len(x_data)):#求导的和

        b_grad = b_grad - 2.0*(y_data[n] - b - w*x_data[n])*1.0

        w_grad = w_grad - 2.0*(y_data[n] - b - w*x_data[n])*x_data[n]

    lr_b = lr_b + b_grad ** 2

    lr_w = lr_w + w_grad ** 2

    #update

    b = b - lr/np.sqrt(lr_b)*b_grad

    w = w - lr/np.sqrt(lr_w)*w_grad

    #store for plotting

    b_history.append(b)

    w_history.append(w)

#plot

plt.contourf(x,y,Z,50,alpha=0.5,cmap=plt.get_cmap('jet'))

plt.plot([-188.4],[2.67],'x',ms=12,markeredgewidth=3,color='orange')

plt.plot(b_history, w_history, 'o-', ms=3, lw=1.5, color='black')

plt.xlim(-200,-100)

plt.ylim(-5,5)

plt.xlabel(r'$b$', fontsize=16)

plt.ylabel(r'$w$', fontsize=16)

plt.show()

Regression 手动实现Gradient Descent的更多相关文章

线性回归、梯度下降（Linear Regression、Gradient Descent）
转载请注明出自BYRans博客:http://www.cnblogs.com/BYRans/ 实例首先举个例子,假设我们有一个二手房交易记录的数据集,已知房屋面积.卧室数量和房屋的交易价格,如下表: ...
Logistic Regression and Gradient Descent
Logistic Regression and Gradient Descent Logistic regression is an excellent tool to know for classi ...
斯坦福机器学习视频笔记 Week1 Linear Regression and Gradient Descent
最近开始学习Coursera上的斯坦福机器学习视频,我是刚刚接触机器学习,对此比较感兴趣:准备将我的学习笔记写下来, 作为我每天学习的签到吧,也希望和各位朋友交流学习. 这一系列的博客,我会不定期的更 ...
Logistic Regression Using Gradient Descent -- Binary Classification 代码实现
1. 原理 Cost function Theta 2. Python # -*- coding:utf8 -*- import numpy as np import matplotlib.pyplo ...
Linear Regression Using Gradient Descent 代码实现
参考吴恩达<机器学习>, 进行 Octave, Python(Numpy), C++(Eigen) 的原理实现, 同时用 scikit-learn, TensorFlow, dlib 进行 ...
斯坦福机器学习视频笔记 Week1 线性回归和梯度下降 Linear Regression and Gradient Descent
最近开始学习Coursera上的斯坦福机器学习视频,我是刚刚接触机器学习,对此比较感兴趣:准备将我的学习笔记写下来, 作为我每天学习的签到吧,也希望和各位朋友交流学习. 这一系列的博客,我会不定期的更 ...
flink 批量梯度下降算法线性回归参数求解（Linear Regression with BGD(batch gradient descent) ）
1.线性回归假设线性函数如下: 假设我们有10个样本x1,y1),(x2,y2).....(x10,y10),求解目标就是根据多个样本求解theta0和theta1的最优值. 什么样的θ最好的呢?最 ...
machine learning (7)---normal equation相对于gradient descent而言求解linear regression问题的另一种方式
Normal equation: 一种用来linear regression问题的求解Θ的方法,另一种可以是gradient descent 仅适用于linear regression问题的求解,对其 ...
machine learning(10) -- classification:logistic regression cost function 和使用 gradient descent to minimize cost function
logistic regression cost function(single example) 图像分布 logistic regression cost function(m examples) ...

随机推荐

百度NLP一面
C++ : 1.拷贝构造函数和重载=符分别在什么情况下被调用,实现有什么区别 2.虚函数的目的,虚函数和模板类的区别,如何找到虚函数常规算法: 1. 如何输出一个集合的所有真子集,递归和非递 ...
Java基础知识陷阱系列
Java基础知识陷阱系列今天抽空把Java基础知识陷阱有关的文章汇总于此,便于大家查看. Java基础知识陷阱(一) Java基础知识陷阱(二) Java基础知识陷阱(三) Java基础知识陷阱(四 ...
两种ajax的方法
两种Ajax方法 Ajax是一种用于快速创建动态网页的技术,他通过在后台与服务器进行少量的数据交换,可以实现网页的异步更新,不需要像传统网页那样重新加载页面也可以做到对网页的某部分作出更新,现在这项技 ...
将std::array转换成std::tuple
template<typename Array, std::size_t... Index> decltype(auto) array2tuple_impl(const Array& ...
hadoop yarn HA集群搭建
可先完成hadoop namenode HA的搭建:http://www.cnblogs.com/kisf/p/7458519.html 搭建yarnde HA只需要在namenode HA配置基础上 ...
Web服务器对比介绍
1.Apache Apache是非常强大的老牌Web服务器,具有模块化结构,拥有众多非常成熟稳定的模块,目前仍是使用非常广泛的服务器,但它是基于多进程HTTPServer,需要对每个用户请求创建一个子 ...
20145328 《Java程序设计》第10周学习总结
20145328 <Java程序设计>第10周学习总结资料学习内容总结网络编程 13.1 网络概述网络编程技术是当前一种主流的编程技术,随着联网趋势的逐步增强以及网络应用程序的大量出 ...
linux块设备读写流程
在学习块设备原理的时候,我最关系块设备的数据流程,从应用程序调用Read或者Write开始,数据在内核中到底是如何流通.处理的呢?然后又如何抵达具体的物理设备的呢?下面对一个带Cache功能的块设备数 ...
Python Matplotlib简易教程【转】
本文转载自:https://blog.csdn.net/Notzuonotdied/article/details/77876080 详情请见:Matplotlib python 数据可视化神器简单 ...
log4j:ERROR A "org.apache.log4j.DailyRollingFileAppender" object is not assignable to a "org.apache.log4j.Appender" variable.
多个classloader加载log4j时需要设置当前Thread的classloader为你自己的classloader Thread.currentThread().setContextClass ...

Regression 手动实现Gradient Descent

Regression 手动实现Gradient Descent的更多相关文章

随机推荐

热门专题