我有一个文件夹全是用相同方式命名的图片。如何使用Python将带图像的文件夹转换为Excel文件
文件名: .. \ name_ID。
我想创建一个电子表格,并将图片的名称,ID和链接放入单独的列中。
应该使用openpyxl,xlsxwriter还是别的?
我有一个文件夹全是用相同方式命名的图片。如何使用Python将带图像的文件夹转换为Excel文件
文件名: .. \ name_ID。
我想创建一个电子表格,并将图片的名称,ID和链接放入单独的列中。
应该使用openpyxl,xlsxwriter还是别的?
我提供一个答案,告诉您如何实现这使用xlsxwriter。它创建一个电子表格,其名称和ID以及三个单独列中关联图片的链接。
答案使用urllib.request使其具有可重现性(该模块不是必需的,我只是把它放在那里下载三个测试图像)。我还将目录设置为当前目录,您可以根据需要修改该目录。另外,在我的回答中,我已将其设置为仅查找.png文件,但您也可以调整以查找其他文件格式。
import urllib.request
import xlsxwriter
import os
#comment out the next 4 lines if you don't want to download 3 pictures
url = 'https://upload.wikimedia.org/wikipedia/en/thumb/4/43/Ipswich_Town.svg/255px-Ipswich_Town.svg.png'
urllib.request.urlretrieve(url, "pica_1.png")
urllib.request.urlretrieve(url, "picb_2.png")
urllib.request.urlretrieve(url, "picc_3.png")
dir_wanted = os.getcwd()
#uncomment the following line if you don't want the current directory
#dir_wanted = "C:\\users\\doe_j"
file_list = [file for file in os.listdir(dir_wanted) if file.endswith('.png')]
full_path_list = [dir_wanted + '\\' + file for file in file_list]
name_list = []
num_list = []
for item in file_list:
temp_list = item.rpartition('_')
name = str(temp_list[0])
num = str(temp_list[2].rpartition('.')[0])
name_list.append(name)
num_list.append(num)
workbook = xlsxwriter.Workbook('pics_and_links.xlsx')
ws = workbook.add_worksheet('Links')
#adding column titles and making them bold
bold = workbook.add_format({'bold': True})
ws.write('A1', "Name", bold)
ws.write('B1', "Number", bold)
ws.write('C1', "Link", bold)
#putting the three lists we made into the workbook
for i in range (0, len(full_path_list)):
row_num = i + 2
ws.write('A%d' % row_num, name_list[i])
ws.write('B%d' % row_num, int(num_list[i]))
ws.write_url('C%d' % row_num, full_path_list[i])
#Set the width of the column with the links in it
ws.set_column(2, 2, 40)
workbook.close()
我有openpyxl或xlsxwriter没有经验,但如果我看着openpyxl的文档,我想象中的计划将是这样的
from openpyxl import Workbook
from openpyxl.styles import PatternFill
from scipy.misc import imread
wb = Workboo()
ws = wb.active
img = imread('image.jpg', mode='RGB')
for i in range(len(img)):
for j in range(len(img[0])):
# TODO a method to set turn (3, 1) into 'D2'
index = excel_coordinate(i, j)
# TODO a method to change RGB in a hex value, perhaps imread also support hex, not sure
hexval = RGB2hex(img[i][j])
cel = ws[index]
cel.PatternFill("Solid", fgColor=hexval)
对不起,回答晚了,真的apreciate的帮助! – Grantler
可以使用pandas包装它:
import glob
import os
import pandas as pd
files_dir = '/home/username/files_dir' # here should be path to your directory with images
files = glob.glob(os.path.join(files_dir, '*'))
df = pd.DataFrame(columns=['name', 'id', 'hyperlink'])
for i, full_filename in enumerate(files):
filename = os.path.basename(full_filename)
name, id_ = filename.split('_')
id_ = os.path.splitext(id_)[0] # remove file extension from id_
hyperlink = '=HYPERLINK("file:///{}")'.format(full_filename)
df.loc[i] = [name, id_, hyperlink]
df.to_excel('output_file.xlsx', index=False)
非常感谢你,你真的帮我解决了这个问题! – Grantler
太好了。很高兴帮助。 – patrickjlong1