2016-11-06 137 views
0

我试着运行这段代码:NLTK错误加载模块

import nltk 


text = "Mrs. Hudson made a cup of tea. She is a wonderful woman." 
sentences = nltk.tokenize.sent_tokenize(text)#breaks statement into  
print sentences 
#print tokens 
tokens = [nltk.tokenize.word_tokenize(s) for s in sentences]#tokenizes sentences passes as list of lists 

PosTokens = [nltk.pos_tag(e) for e in tokens] 

当我运行它,我得到一个错误:

Traceback (most recent call last): 
    File "<stdin>", line 1, in <module> 
NameError: name 'averaged_perceptron_tagger' is not defined 

于是我运行下载得到的恶搞和每个这个问题我需要 'maxtent_treebank_pos_tagger' nltk pos_tag usage

,我得到以下几点:

nltk.download('maxtent_treebank_pos-tagger') 

NameError: name 'averaged_perceptron_tagger' is not defined 
>>> nltk.download('maxtent_treebank_pos-tagger') 
[nltk_data] Error loading maxtent_treebank_pos-tagger: Package 
[nltk_data]  'maxtent_treebank_pos-tagger' not found in index 
False 

因此,我非常感谢所有帮助!

+0

你的问题是一个错字:它的 “MAXENT”(最大熵),而不是 “maxtent”。 – alexis

+0

[nltk_data]加载maxent时出错:未找到包中的'maxent' False –

+1

@alvas这是一个关于拼写错误的问题,而不是关于如何使用标记的重复。 – alexis

回答

0

我想通了,我输入一个错字

其nltk.download(maxent_treebank_pos_tagger)