2017-07-26 34 views
0

使用R(3.2.5)并加载下列软件包 'SnowballC','tm','NLP','RWeka','RTextTools','wordcloud ','fpc'tm package和dtm/wordclouds引发的新错误

carmenCorpus <- Corpus(VectorSource(feedback$Description)) 
carmenCorpus <- tm_map(carmenCorpus, PlainTextDocument) 
carmenCorpus <- tm_map(carmenCorpus, removePunctuation) 
carmenCorpus <- tm_map(carmenCorpus, removeWords, stopwords('english')) 
carmenCorpus <- tm_map(carmenCorpus, stemDocument) 

当我去创建wordcloud时,出现以下错误。这是一个新的错误,当代码在几个月前运行时没有问题:

wordcloud(carmenCorpus, max.words = 100, random.order = FALSE) 

# Error in simple_triplet_matrix(i, j, v, nrow = length(terms), ncol = length(corpus), : 
# 'i, j' invalid 

请指教这个问题。

回答

0

wordcloud不能只是采取一个语料库和神奇地驱动一个wordcloud。

你必须这样做转换为TextDocumentMatrix,然后总结词频的辛勤工作:

# convert to TDM 
tdm <- TermDocumentMatrix(carmenCorpus, control=list(stemming=True)) 

# calculate word frequencies 
freqs = sort(rowSums(as.matrix(tdm)), decreasing=TRUE) 

# plot wordcloud 
wordcloud(names(freqs), freqs, 
    max.words = 100, 
    random.order = FALSE, 
    # any other params you want to pass into wordcloud 
    )