2013-08-27 42 views
0

我有8个变量(X1,X2 ... X8)的数据帧列表离群值在数据帧

我想使用获得的箱线图的离群值:

boxplot(dataframe, plot=FALSE)$out 

我的期望输出是让数据框列出每个变量的异常值。具体如下:

variable outlier 
x1  outlier1 from x1 
x1  outlier2 from x1 
x1  outlier3 from x1 
x1  outlier4 from x1 
x2  outlier1 from x2 
x2  outlier2 from x2 
x2  outlier3 from x2 
. 
. 
. 
x8  outliern from x8 

感谢您的支持,

回答

1

这是你想要的吗?

> testdata <- data.frame(x1=runif(1e3),x2=rnorm(1e3),x3=rnorm(1e3)) 
> temp <- boxplot(testdata,plot=F) 
> cbind(temp$group,temp$out) 
     [,1]  [,2] 
[1,] 2 2.765277 
[2,] 2 2.754730 
[3,] 2 -2.714811 
[4,] 2 3.257889 
[5,] 3 2.605549 
[6,] 3 -3.261950 
[7,] 3 -3.057532 
[8,] 3 2.820352 
[9,] 3 2.602933 
[10,] 3 2.580897 
[11,] 3 2.899350