以下代码中的lda.show_topics
模块仅打印每个主题前10个单词的分布,我如何打印出语料库中所有单词的完整分布?如何在gensim的LDA主题中打印出单词的完整分布?
from gensim import corpora, models
documents = ["Human machine interface for lab abc computer applications",
"A survey of user opinion of computer system response time",
"The EPS user interface management system",
"System and human system engineering testing of EPS",
"Relation of user perceived response time to error measurement",
"The generation of random binary unordered trees",
"The intersection graph of paths in trees",
"Graph minors IV Widths of trees and well quasi ordering",
"Graph minors A survey"]
stoplist = set('for a of the and to in'.split())
texts = [[word for word in document.lower().split() if word not in stoplist]
for document in documents]
dictionary = corpora.Dictionary(texts)
corpus = [dictionary.doc2bow(text) for text in texts]
lda = models.ldamodel.LdaModel(corpus_tfidf, id2word=dictionary, num_topics=2)
for i in lda.show_topics():
print i
你可以做哈克的事情,并更改站点包的LDA包(或无论它是您的计算机上),打印所有的人,或复制其代码到你的程序,并改变它打印出全部而不是10个。 – debianplebian
刚刚找到答案,它隐藏在api =中)。找到自己的答案找到答案 – alvas
找到好工作。 – debianplebian