2017-09-14 19 views
-1

我被困了尝试使用OpenPyxl和xlsxwriter在Excel扩展行数过了一段时间。追加行的数据和相应的细胞

现在我的文件看起来像:

enter image description here

,我需要追加另一个数据,对应于形状和移动都记录下来,所以它看起来像:

enter image description here

我有我的数据['a','b','c']作为列表存储。我需要使用字典吗?或者任何简单的方法?

现在我的代码如下所示:

from openpyxl import load_workbook 
from openpyxl import Workbook 
import xlsxwriter 
import collections 

INPUT_FILE = 'output.xlsx' 



wb = load_workbook(filename=INPUT_FILE) 
ws = wb['cond'] 

workbook = xlsxwriter.Workbook('data.xlsx') 
worksheet = workbook.add_worksheet() 


def get_data(start, stop, col): 
    mylist = [ws.cell(row=i,column= col).value for i in range(start,stop)] 
    findings = [v for i,v in enumerate(mylist) if mylist.index(v) == i] 
    return findings 

def get_data_row(start, stop, row): 
    mylist = [ws.cell(row = row, column = i).value for i in range(start, stop)] 
    values = [v for i,v in enumerate(mylist) if mylist.index(v) == i] 
    return values 

def write_from_dict(row, col, dict): 
    row = row 
    col = col 
    for key in dict.keys(): 
     worksheet.write(row, col, key) 
     for item in dict[key]: 
      worksheet.write(row, col + 1, item) 
      row += 1 



if __name__ == '__main__': 

    mammo_findings_list = get_data(2,10, 11) 
    us_findings_list = get_data(10, 17, 11) 
    mri_findings_list = get_data(17, 27, 11) 

    mammo_params_list = [get_data(2, 5, 14), get_data(5,8,14), 
         get_data(8, 9, 14), get_data(9,10, 14)] 
    mammo_data = dict(zip(mammo_findings_list,mammo_params_list)) 


    mammo = write_from_dict(1, 1, mammo_data) 


    shape_val = get_data_row(15, 19, 2) 
    margin_val = get_data_row(15, 19, 3) 
    density_val = get_data_row(15, 19, 4) 

    mass_dict = dict(zip(mammo_params_list[0],shape_val,margin_val,density_val])) 
    print(mass_dict) 

    test = write_from_dict(1,2,mass_dict) #eventually didn't work for me now 

    workbook.close() 

词典输出:

{'Shape': ['Oval, Round', None, 'Irregular'], 'Margin': ['Circumscribed', None, 'Obscured, Microlobulated, Indistinct, Spiculated'], 'Density': ['High density', 'Equal density, Low density', None]} 
{'Mass': ['Shape', 'Margin', 'Density'], 'Calcifications': ['Typically benign', 'Suspicious morphology', 'Distribution'], 'Assymetry': ['Assymetry'], 'Lymph nodes': ['Lymph nodes']} 

而我只需要追加/莫名其妙地合并这些数据,并同时写入一切到Excel。

谢谢!

+2

并您的代码看起来像在看什么? –

+0

@CharlieClark编辑后。 –

+0

谢谢。你为什么使用这两个库的任何原因? openpyxl应该能够做你需要的一切。 –

回答

1

如果我们限制在工作表术语词典转换为相应的列。

values_dict = { 
'Shape': ['Oval, Round', None, 'Irregular'], 
'Margin': ['Circumscribed', None, 'Obscured, Microlobulated, Indistinct, Spiculated'], 
'Density': ['High density', 'Equal density, Low density', None] 
} 

如果情况并非如此,那么包含在字符串中的列表首先需要扩展。

最简单的方法是与每个键工作,分别对值。对于openpyxl这将是

for col, (key, values) in enumerate(values_dict.items(), start=1): 
    ws.cell(row=1, column=col, value=key) # column title 
    for row, v in enumerate(values, start=2): 
     ws.cell(row=2, column=col, value=v) # column values 

但是,这涉及到很多行和单元格的坐标,我们尽量避免在openpyxl计数。另一种方法是使用zip将值列转换为我们可以添加的行。

from itertools import zip_longest 
headers = [] 
values = [] 
for k, v in values_dict.items(): 
    headers.append(k) 
    values.append(v) 
ws.append(headers) 
for row in zip_longest(*values): 
    ws.append(row) 

当然,这将是更好,如果中间结构可以被删除。