我必须找到单词是否在列表中,如果它在列表中找到,那么文件将用标记“1”写入该列表,否则文件将写入标签为“0”的列表。我的Python代码是低于遇到类型错误的错误:只能串联列表(不是 “STR”),列出如何在python中找到列表中的特定单词
f2 = open("C:/Python26/Semantics.txt",'w')
sem = ["cells", "gene","factor","alpha", "receptor", "t","promotor"];
with open("C:/Python26/trigram.txt") as f:
contents = f.readlines()
for lines in contents:
tokens = lines.split('$')
for t in tokens:
if t.strip() in sem:
f2.write(tokens+"\t"+"1 \n");
else:
f2.write(tokens+"\t"+"0 \n");
f2.close()
我的文件看起来像这样:
IL-2$gene$expression$and
IL-2$gene$expression$and$NF-kappa
IL-2$gene$expression$and$NF-kappa$B
IL-2$gene$expression$and$NF-kappa$B$activation
gene$expression$and$NF-kappa$B$activation$through
expression$and$NF-kappa$B$activation$through$CD28
我所需的输出
IL-2 gene expression and 1
IL-2 gene expression and NF-kappa 1
IL-2 gene expression and NF-kappa B 1
IL-2 gene expression and NF-kappa B activation 1
gene expression and NF-kappa B activation through 1
expression and NF-kappa B activation through CD28 0
的情况下,我想产生像
Token cells gene factor……. promoter
IL-2 gene expression and 0 1 0 ……… 0
IL-2 gene expression and NF-kappa 0 1 0 ……… 0
IL-2 gene expression and NF-kappa B 0 1 0 ……… 0
IL-2 gene expression and NF-kappa B activation 0 1 0 ……… 0
gene expression and NF-kappa B activation through 0 1 0 ……… 0
expression and NF-kappa B activation through CD28 0 0 0 ……… 0
我认为将需要在代码一点点变化
为什么你以semicolon结束sem,no需要在python中分号 – Hackaholic 2015-04-04 06:07:26
在代码中粘贴后,选择整个块并然后按Ctrl + K缩进所有**。您的程序需要按照显示运行,因为它有缩进错误。 Andy为什么在几行后没有一个分号后面有分号? – Anthon 2015-04-04 06:11:43