2012-07-31 63 views
26

我正在尝试向我的data.table添加列,其中名称是动态的。另外,我需要在添加这些列时使用by参数。例如:data.table中的动态列名

test_dtb <- data.table(a = sample(1:100, 100), b = sample(1:100, 100), id = rep(1:10,10)) 
cn <- parse(text = "blah") 
test_dtb[ , eval(cn) := mean(a), by = id] 

# Error in `[.data.table`(test_dtb, , `:=`(eval(cn), mean(a)), by = id) : 
# LHS of := must be a single column name when with=TRUE. When with=FALSE the LHS may be a vector of column names or positions. 

的另一种尝试:从马修

cn <- "blah" 
test_dtb[ , cn := mean(a), by = id, with = FALSE] 
# Error in `[.data.table`(test_dtb, , `:=`(cn, mean(a)), by = id, with = FALSE) : 'with' must be TRUE when 'by' or 'keyby' is provided 

更新:

这现在在v1.8.3的R-伪造。感谢您的突出!
见新的实例中,这类似的问题:

Assign multiple columns using data.table, by group

回答

22

data.table 1.9.4,你可以这样做:

## A parenthesized symbol, `(cn)`, gets evaluated to "blah" before `:=` is carried out 
test_dtb[, (cn) := mean(a), by = id] 
head(test_dtb, 4) 
#  a b id blah 
# 1: 41 19 1 54.2 
# 2: 4 99 2 50.0 
# 3: 49 85 3 46.7 
# 4: 61 4 4 57.1 

详细?:=

DT[i, (colvector) := val]

[...] NOW PREFERRED语法。这些包具足以阻止LHS成为一种象征;同c(colvector)


原来的答复:

你是在完全正确的轨道:构建表达呼叫内进行评估,以[.data.table是做data.table方式这种事情。进一步说,为什么不构建一个表达式,其计算结果为整个参数(而不仅仅是其左手侧)?

像这样的东西应该做的伎俩:

## Your code so far 
library(data.table) 
test_dtb <- data.table(a=sample(1:100, 100),b=sample(1:100, 100),id=rep(1:10,10)) 
cn <- "blah" 

## One solution 
expr <- parse(text = paste0(cn, ":=mean(a)")) 
test_dtb[,eval(expr), by=id] 

## Checking the result 
head(test_dtb, 4) 
#  a b id blah 
# 1: 30 26 1 38.4 
# 2: 83 82 2 47.4 
# 3: 47 66 3 39.5 
# 4: 87 23 4 65.2 
+0

太棒了,谢谢。我可以发誓我尝试了这种变化,但显然我没有。非常感谢。 – Alex 2012-07-31 17:52:09

+0

+1将此问题的链接添加到了[FR#2120](https://r-forge.r-project.org/tracker/index.php?func=detail&aid=2120&group_id=240&atid=978)。似乎会出现很多。 – 2012-08-09 13:26:34

15

表达可以用bquote构造。

cn <- "blah" 
expr <- bquote(.(as.name(cn)):=mean(a)) 
test_dtb[,eval(expr), by=id] 
+1

比做“动态data.tabling”好得多 – Juancentro 2013-09-12 18:34:47

+0

很好的答案,非常有用和灵活的方法。 +1! – marbel 2017-04-19 15:49:57