2011-12-28 102 views
0

我有一个由时间组成的因子变量。当我试图把它变成一个因素时,R按从低到高的顺序排列它们。 Unforunately,这意味着我最终像下面这样:按时间排序矢量

1:48:55 1:53:11 1:58:18 10:04:01 10:12:45 10:14:16 
10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 10:34:13 
10:39:47 10:54:33 10:58:43 3:21:59 3:27:38 3:37:58 
3:49:00 4:20:42 4:30:52 4:37:39 4:40:08 4:40:55 
4:42:25 5:03:25 5:10:30 5:10:49 5:12:39 5:14:20 

因此,而不是打印为了与时俱进,与10点00分时间7点之前来了。

有没有一种快速的方法来按实际时间排序?

time = c("1:48:55", "1:53:11", "1:58:18", "10:04:01", "10:12:45", "10:14:16", "10:20:24", "10:21:59", "10:23:11", "10:23:40", "10:30:40", "10:34:13", "10:39:47", "10:54:33", "10:58:43", "3:21:59", "3:27:38", "3:37:58", "3:49:00", "4:20:42", "4:30:52", "4:37:39", "4:40:08", "4:40:55", "4:42:25", "5:03:25", "5:10:30", "5:10:49", "5:12:39", "5:14:20") 

time = factor(time) 

回答

3

试试这个:

library(chron) 
sort(times(time)) 

或者这个,如果你想排序的载体仍然是一个因素:

time[order(times(time))] 
2

的项目都在他们的顺序,所以你用factor,他们将留在该订单,但你可以设置的方式水平,将让回归报告这些“正常”。如果你想要的时间值,你需要选择一个时间类的工作他们得到整理

require(gtools) 
mixedsort(time) 
[1] "1:48:55" "1:53:11" "1:58:18" "3:21:59" "3:27:38" "3:37:58" "3:49:00" "4:20:42" "4:30:52" 
[10] "4:37:39" "4:40:08" "4:40:55" "4:42:25" "5:03:25" "5:10:30" "5:10:49" "5:12:39" "5:14:20" 
[19] "10:04:01" "10:12:45" "10:14:16" "10:20:24" "10:21:59" "10:23:11" "10:23:40" "10:30:40" "10:34:13" 
[28] "10:39:47" "10:54:33" "10:58:43" 

> time=factor(time, levels= mixedsort(time)) 
> levels(time) 
[1] "1:48:55" "1:53:11" "1:58:18" "3:21:59" "3:27:38" "3:37:58" "3:49:00" "4:20:42" "4:30:52" 
[10] "4:37:39" "4:40:08" "4:40:55" "4:42:25" "5:03:25" "5:10:30" "5:10:49" "5:12:39" "5:14:20" 
[19] "10:04:01" "10:12:45" "10:14:16" "10:20:24" "10:21:59" "10:23:11" "10:23:40" "10:30:40" "10:34:13" 
[28] "10:39:47" "10:54:33" "10:58:43" 
> time 
[1] 1:48:55 1:53:11 1:58:18 10:04:01 10:12:45 10:14:16 10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 
[12] 10:34:13 10:39:47 10:54:33 10:58:43 3:21:59 3:27:38 3:37:58 3:49:00 4:20:42 4:30:52 4:37:39 
[23] 4:40:08 4:40:55 4:42:25 5:03:25 5:10:30 5:10:49 5:12:39 5:14:20 
30 Levels: 1:48:55 1:53:11 1:58:18 3:21:59 3:27:38 3:37:58 3:49:00 4:20:42 4:30:52 4:37:39 ... 10:58:43 

:如预期gtools的mixedsort功能将这样的特征向量排序。该代下包是一个可能的选择:

time = c("1:48:55", "1:53:11", "1:58:18", "10:04:01", "10:12:45", "10:14:16", "10:20:24", "10:21:59", "10:23:11", "10:23:40", "10:30:40", "10:34:13", "10:39:47", "10:54:33", "10:58:43", "3:21:59", "3:27:38", "3:37:58", "3:49:00", "4:20:42", "4:30:52", "4:37:39", "4:40:08", "4:40:55", "4:42:25", "5:03:25", "5:10:30", "5:10:49", "5:12:39", "5:14:20") 

> chron(times=time) 
[1] 01:48:55 01:53:11 01:58:18 10:04:01 10:12:45 10:14:16 10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 
[12] 10:34:13 10:39:47 10:54:33 10:58:43 03:21:59 03:27:38 03:37:58 03:49:00 04:20:42 04:30:52 04:37:39 
[23] 04:40:08 04:40:55 04:42:25 05:03:25 05:10:30 05:10:49 05:12:39 05:14:20 
> sort(chron(times=time)) 
[1] 01:48:55 01:53:11 01:58:18 03:21:59 03:27:38 03:37:58 03:49:00 04:20:42 04:30:52 04:37:39 04:40:08 
[12] 04:40:55 04:42:25 05:03:25 05:10:30 05:10:49 05:12:39 05:14:20 10:04:01 10:12:45 10:14:16 10:20:24 
[23] 10:21:59 10:23:11 10:23:40 10:30:40 10:34:13 10:39:47 10:54:33 10:58:43 
+0

短另外:注意因素无定义!但是,由于R具有您需要的所有内容,因此数据类型为“ordered”,这只是定义顺序的一个因素。如果你在DWINS的答案中用'ordered'替换'factor',你的数据类型将会更有意义(但是,如果你只显示这个因子,那么它们之间没有明显的差别,但是在某些统计方法中它会。 – Thilo 2011-12-28 19:44:37

+1

我对有序因素的经验是,他们在任何地方播下了混乱,所以我不推荐他们去新手。 – 2011-12-28 19:47:35