K nearest neighbor cs229

vectorized code 带来的好处。

 import numpy as np

 from sklearn.datasets import fetch_mldata

 import time

 import matplotlib.pyplot as plt

 mnist = fetch_mldata('MNIST original')

 X = mnist.data.astype(float)

 Y = mnist.target.astype(float) 

 mask = np.random.permutation(range(np.shape(X)[0]))

 num_train = 10000

 num_test = 500

 K = 10

 X_train = X[mask[:num_train]]

 Y_train = Y[mask[:num_train]]

 X_mean = np.mean(X_train,axis = 0)

 X_train = (X_train-X_mean)/255

 X_test = X[mask[num_train:num_train+num_test]]

 X_test = (X_test - X_mean)/255

 Y_test = Y[mask[num_train:num_train+num_test]]

 print('X_train',X_train.shape)

 print('Y_train',Y_train.shape)

 print('X_test',X_test.shape)

 print('Y_test',Y_test.shape)

 ex_image = (np.reshape(X_train[10,:]*255 + X_mean, (28, 28))).astype(np.uint8)

 plt.imshow(ex_image, interpolation='nearest')

 # **Computing the distance matrix (num_test x num_train)**

 # Version 1 (Naive implementation using two for loops)

 start = time.time()

 dists_1 = np.zeros((num_test,num_train))

 for i in xrange(num_test):

     for j in xrange(num_train):

           dists_1[i,j] = np.sqrt(np.square(np.sum(X_test[i,:]-X_train[j,:])))

 stop = time.time()

 time_taken = stop-start

 print('Time taken with two for loops: {}s'.format(time_taken))

 # Version 2(Somewhat better implementation using one for loop)

 start = time.time()

 dists_2 = np.zeros((num_test,num_train))

 for i in xrange(num_test):

           dists_2[i,:] = np.sqrt(np.square(np.sum(X_test[i,:]-X_train,axis = 1)))

 stop = time.time()

 time_taken = stop-start

 print('Time taken with just one for loop: {}s'.format(time_taken))

 # Version 3 (Fully vectorized implementation with no for loop)

 start = time.time()

 dists_3 = np.zeros((num_test,num_train))

 A = np.sum(np.square(X_test),axis = 1)

 B = np.sum(np.square(X_train),axis = 1)

 C = np.dot(X_test,X_train.T)

 dists_3 = np.sqrt(A[:,np.newaxis]+B[np.newaxis,:]-2*C)

 stop = time.time()

 time_taken = stop-start

 print('Time taken with no for loops: {}s'.format(time_taken))

 sorted_dist_indices = np.argsort(dists_3,axis = 1)

 closest_k = Y_train[sorted_dist_indices][:,:K].astype(int)

 Y_pred = np.zeros_like(Y_test)

 for i in xrange(num_test):

       Y_pred[i] = np.argmax(np.bincount(closest_k[i,:]))

 accuracy = (np.where(Y_test-Y_pred == 0)[0].size)/float(num_test)

 print('Prediction accuracy: {}%'.format(accuracy*100))

K nearest neighbor cs229的更多相关文章

K Nearest Neighbor 算法
文章出处:http://coolshell.cn/articles/8052.html K Nearest Neighbor算法又叫KNN算法,这个算法是机器学习里面一个比较经典的算法, 总体来说KN ...
K NEAREST NEIGHBOR 算法(knn)
K Nearest Neighbor算法又叫KNN算法,这个算法是机器学习里面一个比较经典的算法, 总体来说KNN算法是相对比较容易理解的算法.其中的K表示最接近自己的K个数据样本.KNN算法和K-M ...
K-Means和K Nearest Neighbor
来自酷壳: http://coolshell.cn/articles/7779.html http://coolshell.cn/articles/8052.html
Nearest neighbor graph | 近邻图
最近在开发一套自己的单细胞分析方法,所以copy paste事业有所停顿. 实例: R eNetIt v0.1-1 data(ralu.site) # Saturated spatial graph ...
【cs231n】图像分类-Nearest Neighbor Classifier(最近邻分类器)【python3实现】
[学习自CS231n课程] 转载请注明出处:http://www.cnblogs.com/GraceSkyer/p/8735908.html 图像分类: 一张图像的表示:长度.宽度.通道(3个颜色通道 ...
[机器学习系列] k-近邻算法（K–nearest neighbors）
C++ with Machine Learning -K–nearest neighbors 我本想写C++与人工智能,但是转念一想,人工智能范围太大了,我根本介绍不完也没能力介绍完,所以还是取了他的 ...
Visualizing MNIST with t-SNE, MDS, Sammon’s Mapping and Nearest neighbor graph
MNIST 可视化 Visualizing MNIST: An Exploration of Dimensionality Reduction At some fundamental level, n ...
Nearest Neighbor Search
## Nearest Neighbor Search ## Input file: standard input Output file: standard output Time limit: 1 ...
K近邻（K Nearest Neighbor-KNN）原理讲解及实现
算法原理 K最近邻(k-Nearest Neighbor)算法是比较简单的机器学习算法.它采用测量不同特征值之间的距离方法进行分类.它的思想很简单:如果一个样本在特征空间中的k个最近邻(最相似)的样本 ...

随机推荐

安装gnocchi
在控制节点上执行 #!/bin/bash MYSQL_ROOT_PASSWD='m4r!adbOP' GNOCCHI_PASSWD='gnocchi1234!' CEILOMETER_PASSWD=' ...
thinkPHP 出现route不起作用提示No input file specified.
修改.htaccess文件原因在于使用的PHP是fast_cgi模式,而在某些情况下,不能正确识别path_info所造成的错误. 打开.htaccess 在RewriteRule 后面的index ...
【C/C++开发】try-cache-finnally捕获异常
在c++中,可以直接抛出异常之后自己进行捕捉处理,如:(这样就可以在任何自己得到不想要的结果的时候进行中断,比如在进行数据库事务操作的时候,如果某一个语句返回SQL_ERROR则直接抛出异常,在cat ...
【C/C++开发】【Java开发】JNI的替代者—使用JNA访问Java外部功能接口
JNI的替代者-使用JNA访问Java外部功能接口 1. JNA简单介绍先说JNI(Java Native Interface)吧,有过不同语言间通信经历的一般都知道,它允许Java代码和其他语言( ...
Mac下使用sshpass让iterm2支持多ssh登录信息保存
mac下没有xshell等连接linux服务器的工具,当需要管理的服务器越来越多之后,密码管理就成了一个很头疼的问题,每次都需要去复制粘贴密码,浪费了很多时间,在网上查了不少资料,发现mac下可以使用 ...
python pip换源方法
以下资料来源于网络: pip国内的一些镜像阿里云 http://mirrors.aliyun.com/pypi/simple/ 中国科技大学 https://pypi.mirrors.ust ...
快速排序的js实现
该方法的基本思想是: 1．先从数列中取出一个数作为基准数. 2．分区过程,将比这个数大的数全放到它的右边,小于或等于它的数全放到它的左边. 3．再对左右区间重复第二步,直到各区间只有一个数. var ...
docker容器创建MariaDB镜像
基于commit命令方式创建 docker的安装 [root@test01 ~]# yum install docker [root@test01 ~]# systemctl enable docke ...
oracle -- 查询执行计划，判读查询语句优劣
以oracle的scott账户:找到员工表中薪水大于本部门平均薪水的员工为例多表查询方式: select e.empno, e.ename, e.sal, d.avgsal from emp e, ...
Ruby Rails学习中：添加安全密码
接上篇一. 添加安全密码我们已经为 name 和 email 字段添加了验证规则, 现在要加入用户所需的最后一个常规属性: 安全密码.每个用户都要设置一个密码(还要二次确认), 数据库中则存储经过 ...

K nearest neighbor cs229

K nearest neighbor cs229的更多相关文章

随机推荐

热门专题