我有大约3000行的数据集。该数据可以经由https://pastebin.com/i4dYCUQX总和()在dplyr和汇总:NA值
问题进行访问:NA导致输出,虽然似乎没有NA中的数据。这是当我尝试通过dplyr或聚集总结一列中的每个类别总价值会发生什么:
example <- read.csv("https://pastebin.com/raw/i4dYCUQX", header=TRUE, sep=",")
example
# dplyr
example %>% group_by(size) %>% summarize_at(vars(volume), funs(sum))
Out:
# A tibble: 4 x 2
size volume
<fctr> <int>
1 Extra Large NA
2 Large NA
3 Medium 937581572
4 Small NA
# aggregate
aggregate(volume ~ size, data=example, FUN=sum)
Out:
size volume
1 Extra Large NA
2 Large NA
3 Medium 937581572
4 Small NA
当试图通过colSums
访问的价值,它似乎工作:
# Colsums
small <- example %>% filter(size == "Small")
colSums(small["volume"], na.rm = FALSE, dims = 1)
Out:
volume
3869267348
谁能想象的问题可能是什么?
好了,我相信了_Warning messages_相当翔实:'[...]整数溢出 - 使用和' – Henrik