Sparse PCA: reproduction of the synthetic example

The paper:

Hui Zou, Trevor Hastie, and Robert Tibshirani,

Sparse Principal Component Analysis,

Journal of computational and Graphical Statistics, 15(2): 265-286, 2006.

Reproduction of the Synthetic Example in Section 5.2 using R programming:

 library(elasticnet)

 ## sample version of SPCA

 n =

 v1 = rnorm(n,,sqrt())

 v2 = rnorm(n,,sqrt())

 v3 = -.*v1 + 0.925*v2 + rnorm(n)

 x1 = v1 + rnorm(n)

 x2 = v1 + rnorm(n)

 x3 = v1 + rnorm(n)

 x4 = v1 + rnorm(n)

 x5 = v2 + rnorm(n)

 x6 = v2 + rnorm(n)

 x7 = v2 + rnorm(n)

 x8 = v2 + rnorm(n)

 x9 = v3 + rnorm(n)

 x10 = v3 + rnorm(n)

 x = cbind(x1,x2,x3,x4,x5,x6,x7,x8,x9,x10)

 x.cov = t(x) %*% x/n; head(x.cov)

 a = spca(x, , type='predictor', sparse='varnum', para=c(,), lambda=)

 a

 ## population version of SPCA

 g1 = matrix(, , )

 diag(g1) = 

 g2 = matrix(, , )

 diag(g2) = 

 g3 = matrix(283.7875, , )

 diag(g3) = diag(g3)+

 g1g3 = matrix(-, , )

 g2g3 = matrix(277.5, , )

 # construct the exact covariance matrix

 x.cov = matrix(, , )

 x.cov[:,:] = g1

 x.cov[:,:] = g2

 x.cov[:,:] = g3

 x.cov[:,:] = g1g3

 x.cov[:,:] = t(g1g3)

 x.cov[:,:] = g2g3

 x.cov[:,:] = t(g2g3)

 b = spca(x.cov, , type='Gram', sparse='varnum', para=c(,), lambda=)

 b

The results of the population version using exact covariance matrix are exactly as in the paper:

> b

Call:

spca(x = x.cov, K = , para = c(, ), type = "Gram", sparse = "varnum",

    lambda = )

 sparse PCs

Pct. of exp. var. : 40.9 39.5

Num. of non-zero loadings :

Sparse loadings

      PC1 PC2

 [,] 0.0 0.5

 [,] 0.0 0.5

 [,] 0.0 0.5

 [,] 0.0 0.5

 [,] 0.5 0.0

 [,] 0.5 0.0

 [,] 0.5 0.0

 [,] 0.5 0.0

 [,] 0.0 0.0

[,] 0.0 0.0

But the sample version may randomly vary a little.

> a

Call:

spca(x = x, K = , para = c(, ), type = "predictor", sparse = "varnum",

    lambda = )

 sparse PCs

Pct. of exp. var. : 37.9 37.6

Num. of non-zero loadings :

Sparse loadings

       PC1    PC2

x1   0.000 -0.303

x2   0.000 -0.533

x3   0.000 -0.576

x4   0.000 -0.540

x5  -0.492  0.000

x6  -0.287  0.000

x7  -0.481  0.000

x8  -0.666  0.000

x9   0.000  0.000

x10  0.000  0.000

Having fun learning sparse PCA!

Sparse PCA: reproduction of the synthetic example的更多相关文章

Deflation Methods for Sparse PCA
目录背景总括 Hotelling's deflation 公式特点 Projection deflation 公式特点 Schur complement deflation Orthogona ...
Spectral Bounds for Sparse PCA: Exact and Greedy Algorithms[贪婪算法选特征]
目录概括 Sparse PCA Formulation 非常普遍的问题 Optimality Conditions Eigenvalue Bounds 算法代码概括这篇论文,不像以往的那些论文 ...
Sparse PCA 稀疏主成分分析
Sparse PCA 稀疏主成分分析 2016-12-06 16:58:38 qilin2016 阅读数 15677 文章标签: 统计学习算法更多分类专栏: Machine Learning ...
A direct formulation for sparse PCA using semidefinite programming
目录背景 Sparse eigenvectors(单个向量的稀疏化) 初始问题(low-rank的思想?) 等价问题最小化$\lambda$ 得到下列问题(易推) 再来一个等价问题条件放松( ...
主成分分析（PCA）原理总结
主成分分析(Principal components analysis,以下简称PCA)是最重要的降维方法之一.在数据压缩消除冗余和数据噪音消除等领域都有广泛的应用.一般我们提到降维最容易想到的算法就 ...
Python机器学习笔记使用scikit-learn工具进行PCA降维
之前总结过关于PCA的知识:深入学习主成分分析(PCA)算法原理.这里打算再写一篇笔记,总结一下如何使用scikit-learn工具来进行PCA降维. 在数据处理中,经常会遇到特征维度比样本数量多得多 ...
深入学习主成分分析（PCA）算法原理（Python实现）
一:引入问题首先看一个表格,下表是某些学生的语文,数学,物理,化学成绩统计: 首先,假设这些科目成绩不相关,也就是说某一科目考多少分与其他科目没有关系,那么如何判断三个学生的优秀程度呢?首先我们一眼 ...
Sparse Principal Component Analysis
目录背景: 部分符号创新点文章梗概 The LASSO AND THE ELASTIC NET 将PCA改造为回归问题定理二单个向量(无需进行SVD版本) 定理三多个向量(无需进行SVD, ...
Full Regularization Path for Sparse Principal Component Analysis
目录背景 Notation Sparse PCA Semidefinite Relaxation Low Rank Optimization Sorting and Thresholding 背景 ...

随机推荐

python3.5+django2.0快速入门(一)
因为这篇教程需要用到anaconda的一些操作,如果还不懂anaconda的操作的同学可以看下这篇文章python 入门学习之anaconda篇. 创建python3+的开发环境直接在终端输入:co ...
事件驱动体系结构(EDA)
redis两种持久化的方法
Redis是一种高级key-value数据库.它跟memcached类似,不过数据可以持久化,而且支持的数据类型很丰富.有字符串,链表,集合和有序集合.支持在服务器端计算集合的并,交和补集(diff ...
python之callable
callback是python的内置函数英文说明: callable(object) Return True If the object argument appears callable,Fals ...
[Bzoj2004][Hnoi2010]Bus 公交线路(状压dp&&矩阵加速)
题目链接:https://www.lydsy.com/JudgeOnline/problem.php?id=2004 看了很多大佬的博客才理解了这道题,菜到安详QAQ 在不考虑优化的情况下,先推$dp ...
Codeforces 1047C (线性筛+因数分解)
题面传送门分析 1.暴力做法首先先把每个数除以gcd(a1,a2-,an)gcd(a_1,a_2 \dots,a_n )gcd(a1,a2-,an) 可以O(namax)O(n\sqrt ...
P2586 [ZJOI2008]杀蚂蚁（模拟）
P2586 [ZJOI2008]杀蚂蚁大模拟. 什么都不想补了. 看变量名感性理解吧 #include<iostream> #include<cstdio> #include ...
HDU 1009 FatMouse' Trade题解
版权声明:本文作者靖心,靖空间地址:http://blog.csdn.net/kenden23/.未经本作者同意不得转载. https://blog.csdn.net/kenden23/article ...
log.info（）传入多个参数的方法
不知道项目里用的是啥 ** 版本的 log4j 居然不能传入变长参数 logger.info(String.format("%s %s %s", username, feature ...
2018-10-2-win10-uwp-win2d-特效
title author date CreateTime categories win10 uwp win2d 特效 lindexi 2018-10-02 21:20:46 +0800 2018-6- ...

Sparse PCA: reproduction of the synthetic example

Sparse PCA: reproduction of the synthetic example的更多相关文章

随机推荐

热门专题