2017-01-27 95 views
0

我目前正在通过“R数据科学”一书。在R中输出作为数据帧的输出循环

我正在尝试解决这个练习题(21.2.1 Q1.4),但在开始for循环之前无法确定正确的输出。

写一个for循环: 生成每个的μ= -10,0,10和100

就像在书前面的问题,我一直在试图插入10层随机法线到一个向量输出,但对于这个例子,它似乎我需要输出是一个数据帧?

这是我到目前为止的代码:

values <- c(-10,0,10,100) 
output <- vector("double", 10) 

for (i in seq_along(values)) { 
    output[[i]] <- rnorm(10, mean = values[[i]]) 
} 

我知道输出是错误的,但我不能确定如何创建我需要这里的格式。任何帮助非常感谢。谢谢!

+0

可以使用'lapply'或'sapply'即'D1 < - data.frame(lapply(值,函数(x)的RNORM(10 ,mean = x))); colnames(d1)< - paste0(“V”,seq_along(values))' – akrun

+0

嗨,感谢您的解决方案。如果能够获得for循环解决方案也是一件好事,因为这是我目前旨在改进的一项技能。谢谢! – George

+0

您首先创建一个空矩阵或列表,然后为其分配值。即'output < - vector(“list”,4)';并在必要时使用你的代码和'cbind'或'rbind',即'do.call(cbind,output)' – akrun

回答

2

由于这是一个学习问题,我不会直接提供解决方案。

> values <- c(-10,0,10,100) 
> for (i in seq_along(values)) {print(i)} # Checking we iterate by position 
[1] 1 
[1] 2 
[1] 3 
[1] 4 
> output <- vector("double", 10) 
> output # Checking the place where the output will be 
[1] 0 0 0 0 0 0 0 0 0 0 
> for (i in seq_along(values)) { # Testing the full code 
+  output[[i]] <- rnorm(10, mean = values[[i]]) 
+ } 
Error in output[[i]] <- rnorm(10, mean = values[[i]]) : 
    more elements supplied than there are to replace 

正如你所看到的错误说,有更多的要素投入比空间(每个迭代产生10个随机数字,(在总共40个),你只有10位。考虑使用的数据格式,允许存储几个值对于每次迭代 这样:

> output <- ?? 
> for (i in seq_along(values)) { # Testing the full code 
+  output[[i]] <- rnorm(10, mean = values[[i]]) 
+ } 
> output # Should have length 4 and each element all the 10 values you created in the loop 
+0

使用上面的akrun的答案working:output < - vector(“list”,4)。这是你会做什么?感谢所涉及的思考过程的彻底解释!很有帮助! – George

+0

是或者简单地使用'output < - list()'。你可以通过点击勾号选择一个最有用的答案吗?这样的问题被标记为解决:D – Llopis

+0

非常感谢你! – George

2

有很多方法可以做到这一点。这是一个。请参阅行内评论。

set.seed(357) # to make things reproducible, set random seed 

N <- 10 # number of loops 

xy <- vector("list", N) # create an empty list into which values are to be filled 

# run the loop N times and on each loop... 
for (i in 1:N) { 
    # generate a data.frame with 4 columns, and add a random number into each one 
    # random number depends on the mean specified 
    xy[[i]] <- data.frame(um10 = rnorm(1, mean = -10), 
         u0 = rnorm(1, mean = 0), 
         u10 = rnorm(1, mean = 10), 
         u100 = rnorm(1, mean = 100)) 
} 

# result is a list of data.frames with 1 row and 4 columns 

# you can bind them together into one data.frame using do.call 
# rbind means they will be merged row-wise 
xy <- do.call(rbind, xy) 

     um10   u0  u10  u100 
1 -11.241117 -0.5832050 10.394747 101.50421 
2 -9.233200 0.3174604 9.900024 100.22703 
3 -10.469015 0.4765213 9.088352 99.65822 
4 -9.453259 -0.3272080 10.041090 99.72397 
5 -10.593497 0.1764618 10.505760 101.00852 
6 -10.935463 0.3845648 9.981747 100.05564 
7 -11.447720 0.8477938 9.726617 99.12918 
8 -11.373889 -0.3550321 9.806823 99.52711 
9 -7.950092 0.5711058 10.162878 101.38218 
10 -9.408727 0.5885065 9.471274 100.69328 

另一种方法是预先分配矩阵,添加值并将其强制为data.frame。

xy <- matrix(NA, nrow = N, ncol = 4) 

for (i in 1:N) { 
    xy[i, ] <- rnorm(4, mean = c(-10, 0, 10, 100)) 
} 

# notice that i name the column names post festum 
colnames(xy) <- c("um10", "u0", "u10", "u100") 
xy <- as.data.frame(xy) 
0
# set the number of rows 
rows <- 10 

# vector with the values 
means <- c(-10,0,10,100) 

# generating output matrix 
output <- matrix(nrow = rows, 
      ncol = 4) 

# setting seed and looping through the number of rows 
set.seed(222) 
for (i in 1:rows){ 
    output[i,] <- rnorm(length(means), 
        mean=means) 
} 

#printing the output 
output