0
我刚刚拿起了python,我试图使用模块fuzzwuzzy与pandas一起协助匹配来自PLACEMENT和CREATIVE_NAME列的名称。Python循环通过单个单元格和csv中的列
我已经想出了如何针对所有CREATE_NAME行测试PLACEMENT的第一行;但是,我无法弄清楚如何移动到PLACEMENT的下一行,并针对CREATIVE_NAME列进行测试。
该项目的最终目标是打印每个PLACEMENT值的最佳匹配以便进一步分析。
df = pd.read_csv(filepath)
fp = df["PLACEMENT"]
tp = df["CREATIVE_NAME"]
score = 0
x=0
y=0
import csv
with open(filepath, 'r') as f:
reader = csv.DictReader(f)
for column in reader:
if score == 0:
score += fuzz.ratio(fp[x],tp[y])
if score > 95:
print "The score is %d"", We have a match!" %(score)
elif score > 70:
print "The score is %d"", We have a high likelihood of a match!" %(score)
elif score > 50:
print "The score is %d"", The match is not likely!" %(score)
else:
print "The score is only %d"", This is not a match!" %(score)
y += 1
score = 0
谢谢,代码工作完美!非常感谢您的反馈。是的,每列大约有15,000行。我将需要找到一种方法来首先对数据进行分组。 5排花了近20分钟。 –