2016-07-16 35 views
2

我想生成一个2列data.frame,但我想一定比每个二项式COL1 - col2上是独一无二的......[R data.frame与非重复二项式

n <- 1000 
w <- seq(from = 0, to = n-1, by = 1) 
x <- sample(0:99, n, replace=T) 
y <- sample(0:99, n, replace=T) 
input.df <- data.frame(xcor = sample(0:99, n, replace=T), y = sample(0:99, n, replace=T)) 

此时这显然行不通

summary(duplicated(input.df)) 

Mode FALSE TRUE NA's 
logical  972  28  0 

而且我想

summary(duplicated(input.df)) 

Mode FALSE TRUE NA's 
logical  1000  0  0 

任何诀窍是欢迎!

回答

2

你太亲近了!您只需在括号内使用duplicated并使用您使用!指定而不是重复。

n <- 1000 
w <- seq(from = 0, to = n-1, by = 1) 
x <- sample(0:99, n, replace=T) 
y <- sample(0:99, n, replace=T) 
input.df <- data.frame(xcor = sample(0:99, n, replace=T), y = sample(0:99, n, replace=T)) 

summary(duplicated(input.df[!duplicated(input.df),])) 
Mode FALSE NA's 
logical  958  0 

这相当于unique()宋哲元如提及。

final_data <- input.df[!duplicated(input.df),] 

final_data <- unique(input.df) 
+0

谢谢你,但它没有完成...也许我还没有指定比我想有1000点意见:-S – delaye