我列出了大约100,000次出现的项目被排列在一起,我已粘贴到一列中,因此我可以统计每个组合出现的次数。
R中列表的出现次数
4845 Curly Fries California Burger 1
4846 French Fries California Burger 1
4847 Hamburger California Burger 1
4848 $1 Fountain Drinks Curly Fries 1
4849 $1 Fountain Drinks Curly Fries 1
4850 California Burger Curly Fries 1
4851 Curly Fries Curly Fries 1
我已经探索了聚合函数使我有以下错误:
aggregate(t1$count,list(t1$pc), sum) <br>
Error in sort.list(y) : 'x' must be atomic for 'sort.list'
Have you called 'sort' on a list? <br>
我也曾尝试ddply的变化:
ddply(t1,t1$pc,transform,occurances=sum(t1$count))
但我得到这个错误
Error in UseMethod("as.quoted") :
no applicable method for 'as.quoted' applied to an object of class "c('matrix', 'list')"
我假设我得到这个,因为我试图通过一个字符值本质上“组”。我也根据对类似问题的回答探讨了tapply
和recast
,但无济于事。
我怎样才能得到这个组合数?
的考虑,单独列出的项目(再次,道歉的格式问题)的一个样本:
Var1 Var2 Var3
>2 Onion Rings Onion Rings 1
>3 Pineapple Cheddar Burger Onion Rings 1
>4 Onion Rings Pineapple Cheddar Burger 1
>5 Pineapple Cheddar Burger Pineapple Cheddar Burger 1
>5 Onion Rings Onion Rings 1
>6 Pineapple Cheddar Burger Onion Rings 1
>7 Onion Rings Pineapple Cheddar Burger 1
>8 Pineapple Cheddar Burger Pineapple Cheddar Burger 1
>9 Fountain Soda Fountain Soda 1
>10 French Fries Fountain Soda 1
你可以使用'table'函数:'table(t1 $ pc)'。 – Justin 2013-02-26 19:51:36