0
我试着运行这段代码:NLTK错误加载模块
import nltk
text = "Mrs. Hudson made a cup of tea. She is a wonderful woman."
sentences = nltk.tokenize.sent_tokenize(text)#breaks statement into
print sentences
#print tokens
tokens = [nltk.tokenize.word_tokenize(s) for s in sentences]#tokenizes sentences passes as list of lists
PosTokens = [nltk.pos_tag(e) for e in tokens]
当我运行它,我得到一个错误:
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
NameError: name 'averaged_perceptron_tagger' is not defined
于是我运行下载得到的恶搞和每个这个问题我需要 'maxtent_treebank_pos_tagger' nltk pos_tag usage
,我得到以下几点:
nltk.download('maxtent_treebank_pos-tagger')
NameError: name 'averaged_perceptron_tagger' is not defined
>>> nltk.download('maxtent_treebank_pos-tagger')
[nltk_data] Error loading maxtent_treebank_pos-tagger: Package
[nltk_data] 'maxtent_treebank_pos-tagger' not found in index
False
因此,我非常感谢所有帮助!
你的问题是一个错字:它的 “MAXENT”(最大熵),而不是 “maxtent”。 – alexis
[nltk_data]加载maxent时出错:未找到包中的'maxent' False –
@alvas这是一个关于拼写错误的问题,而不是关于如何使用标记的重复。 – alexis