2013-04-18 101 views
2

我有一个产生数据struicture看起来像这样的代码:过滤列表的列表python,如何创建最终列表?

{'AttributeId': '4192', 
'AttributeList': '', 
'ClassId': '1014 (AP)', 
'InstanceId': '0', 
'MessageType': '81 (GetAttributesResponse)', 
'ObjectInstance': '', 
'Protocol': 'BSMIS Rx', 
'RDN': '', 
'TransactionId': '66', 
'Sequences': [[], 
       [1,'2013-02-26T15:01:11Z'], 
       [], 
       [10564,13,388,0,-321,83,'272','05',67,67,708,896,31,128,-12,-109,0,-20,-111,-1,-1,0], 
       [10564,13,108,0,-11,83,'272','05',67,67,708,1796,31,128,-12,-109,0,-20,-111,-1,-1,0], 
       [10589,16,388,0,-15,79,'272','05',67,67,708,8680,31,125,-16,-110,0,-20,-111,-1,-1,0], 
       [10589,15,108,0,-16,81,'272','05',67,67,708,8105,31,126,-14,-109,0,-20,-111,-1,-1,0], 
       [10637,40,233,0,-11,89,'272','03',30052,1,5,54013,33,103,-6,-76,1,-20,-111,-1,-1,0], 
       [10662,46,234,0,-15,85,'272','03',30052,1,5,54016,33,97,-10,-74,1,-20,-111,-1,-1,0], 
       [10712,51,12,0,-24,91,'272','01',4013,254,200,2973,3,62,-4,-63,0,-20,-111,-1,-1,0], 
       [10737,15,224,0,-16,82,'272','01',3020,21,21,40770,33,128,-13,-108,0,-20,-111,-1,-1,0], 
       [10762,14,450,0,-7,78,'272','01',3020,21,21,53215,29,125,-17,-113,0,-20,-111,-1,-1,0], 
       [10762,15,224,0,-7,85,'272','01',3020,21,21,50770,33,128,-10,-105,0,-20,-111,-1,-1,0], 
       [10762,14,124,0,-7,78,'272','01',3020,10,10,56880,32,128,-17,-113,0,-20,-111,-1,-1,0], 
       [10812,11,135,0,-14,81,'272','02',36002,1,11,43159,31,130,-14,-113,1,-20,-111,-1,-1,0], 
       [10837,42,23,0,-9,89,'272','02',36002,1,11,53529,31,99,-6,-74,1,-20,-111,-1,-1,0,54], 
       [13,'2013-02-26T15:02:09Z'], 
       [], 
       [2,12,7,0,9,70,'272','02',20003,0,0,15535,0,0,0,0,1,100,100,-1,-1,0], 
       [5,15,44,0,-205,77,'272','02',20003,0,0,15632,0,0,0,0,1,100,100,-1,-1,0], 
       [7,25,9,0,0,84,'272','02',20002,0,0,50883,0,0,0,0,1,100,100,-1,-1,0]] 
} 

我然后过滤下来,使相关值的列表,我只想要的序列的第一个2个元素;如果长度为> = 22。我这样做如下:

len22seqs = filter(lambda s: len(s)>=22, data['Sequences']) 
UARFCNRSSI = [] 
for i in range(len(len22seqs)): 
    UARFCNRSSI.append([len22seqs[i][0], len22seqs[i][1]]) 

过滤列表中的一个例子是:

[[10564, 15], [10564, 13], [10589, 18], [10637, 39], [10662, 38], [10712, 50], [10737, 15], [10762, 14], [10787, 9], [10812, 12], [10837, 45], [3, 17], [7, 21], [46, 26], [48, 12], [49, 24], [64, 14], [66, 17], [976, 27], [981, 22], [982, 22], [983, 17], [985, 13], [517, 9], [521, 15], [525, 11], [526, 13], [528, 14], [698, 14], [788, 24], [792, 19]] 

但是现在我注意到,我需要在每个子列表的第三个元素。 即是这样的:

[1,'2013-02-26T15:01:11Z'], 

我需要的2长度每列表的第一个元件被附加到这个过滤的列表作为第三元件,用于跟随的元素。但是当有一个长度为2的新列表时,我需要将这个新值附加到后续条目上。

所以我的最终名单例子可能看起来像,当发现另一个列表长度为2注意,第三个元素的变化,以13:

[[10564, 15, 1], [10564, 13, 1], [10589, 18, 1], [10637, 39, 1], [10662, 38, 1], [10837, 45, 1], [3, 17, 13], [7, 21, 13], [46, 26, 13], etc] 

我该怎么办呢?我是否必须用len> = 22和len = 2过滤两次,并且为len> = 22单独过滤,因为我不想将元素0或1追加到长度为2的列表的最终列表中。

回答

4

我会尽量做到可读性:

UARFCNRSSI = [] 
x = None  # future "third element"; please choose a better name 
for item in data["Sequences"]: 
    if len(item) == 2: 
     x = item[0] 
    elif len(item) >= 22: 
     UARFCNRSSI.append([item[0], item[1], x]) 
+0

我想访问它..在for循环之前定义'x'不是r equired? – namit

+2

@namit:只是一个预防措施。你确定在第一个22元素列表之前总会有一个两元素列表吗? –

+0

谢谢,比我计划好得多的方式,更清洁和更简单,更易于阅读。首先总是会有一个2元素列表是,数据是cli中命令的输出,并且在生成另一个列表之前,2元素列表必须在那里。然而,你是非常正确的,如果它不在那里,由于产出我没有预见到它最好准备! – Paul

3

我会与发电机去筛选数据:

def filterdata(sequences): 
    add = [] 
    for item in sequences:   
     if len(item) == 2: 
      add = [item[0]] 
     elif len(item) >= 22: 
      yield [item[0], item[1]] + add 

您可以像data = list(filterdata(data['Sequences']))

+0

谢谢,我喜欢它! – Paul