2016-09-13 98 views
-1

我想要生成从多变量高斯分布中绘制的数据点集群。随机集群生成

我希望所有的聚类具有相同的协方差矩阵。我正在使用clusterGeneration包中的命令genRandomClust。

但不幸的是,上述命令会生成具有不同协方差矩阵的群集。有人能指出我能实现这一目标的一个命令或包吗?

回答

0

这个怎么样?这里我们从双变量正态随机分布中抽取样本。

library(MASS) 
Sigma <- matrix(c(10,3,3,2),2,2) # start with a fixed 2x2 covariance matrix, or generate it randomly 
Sigma 
datapoints <- mvrnorm(n = 1000, rep(0, 2), Sigma) # draw 1000 MVN samples with cov matrix Sigma, with mean c(0, 0) or use some random mean vector 
var(datapoints) # notice that the covariance matrix for the datapoint generated is slightly different from Sigma 
datapoints <- mvrnorm(n = 1000, rep(0, 2), Sigma, empirical = TRUE) 
var(datapoints) # here the covariance matrix for the datapoint generated is exactly equal to Sigma