2017-04-15 57 views
0

我有两个不同的数据集,我想根据第二个变量组合得到第一个子集。具体来说,我想看看日期范围和直接变量匹配的组合。根据日期范围和变量匹配对子集进行数据设置

望着这里创建了下面的数据帧:

options(stringsAsFactors = FALSE) 


Loc<-rep("A",10) 
InEvent<-rep("IN",10) 
InDate<-c("2016-05-10","2016-05-20","2016-05-25","2016-06-10","2016-06-20","2016-07-05","2016-07-17","2016-07-27","2016-08-10","2016-08-20") 
InSN<-c("H1","H1","H1","H1","H1","H2","H2","H2","H2","H2") 
OutEvent<-rep("OUT",10) 
OutDate<-c("2016-05-15","2016-05-23","2016-06-02","2016-06-14","2016-06-26","2016-07-09","2016-07-26","2016-08-09","2016-08-19","2016-08-26") 
OutSN<-c("H1","H1","H1","H1","H1","H2","H2","H2","H2","H2") 

Cali<-data.frame(Loc,InEvent,InDate,InSN,OutEvent,OutDate,OutSN) 

Cali$InDate<-as.POSIXct(strptime(Cali$InDate,format="%Y-%m-%d", tz="UTC")) 

Cali$OutDate<-as.POSIXct(strptime(Cali$OutDate,format="%Y-%m-%d", tz="UTC")) 
Cali 



Sen<-rep("CL",20) 
Date<-c("2016-04-10","2016-05-11","2016-05-12","2016-05-13","2016-05-17","2016-05-26","2016-06-17","2016-06-27","2016-07-08","2016-07-20","2016-07-27","2016-08-01","2016-08-05","2016-08-07","2016-08-12","2016-08-15","2016-08-19","2016-08-20","2016-08-23","2016-09-20") 
SN<-c("H1","H1","H2","H5","H1","H1","H7","H2","H2","H2","H1","H2","H1","H5","H2","H5","H3","H1","5","H2") 


Data<-data.frame(Sen,Date,SN) 


Data$Date<-as.POSIXct(strptime(Data$Date,format="%Y-%m-%d", tz="UTC")) 
Data 

在最后的结果,我只是想从“卡利”的日期范围内奠定了“数据”的数据帧的行,但也匹配“InSN”和“OutSN”中的H值。

例如,卡利的第一行的范围为2016-5-10:2016-5-15,SN值为H1。所以我只希望在这个日期范围内的“数据”中有行,并且在“SN”列中有“H1”。

将所得数据列应为仅包括满足匹配条件的行(行2,6,9,10,12,15)

回答

1
library(dplyr) 
library(magrittr) 
Data=left_join(Cali, Data, by = c("InSN"="SN")) %>% 
    filter(Date>=InDate, Date<=OutDate) 
的“数据”的一个子集