2012-08-28 18 views
0

我需要使用Weka的LibSVM实现RSS源中关键字频率上的SVM分类器,以将这些源分类为目标类别。但是我不确定给出我的数据要运行哪个版本。在Weka中运行哪个版本的SVM?

我.arff文件通常包含以下数据:

@attribute Keyword_1_nasa_Frequency numeric 
@attribute Keyword_2_fish_Frequency numeric 
@attribute Keyword_3_kill_Frequency numeric 
@attribute Keyword_4_show_Frequency numeric 
… 
@attribute RSSFeedCategoryDescription {BFE,FCL,F,M, NCA, SNT,S} 

@data 
0,0,0,34,0,0,0,0,0,40,0,0,0,0,0,0,0,0,0,0,24,0,0,0,0,13,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,BFE 
0,0,0,12,0,0,0,0,0,20,0,0,0,0,0,0,0,0,0,0,25,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0 
,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,BFE 
0,0,0,10,0,0,0,0,0,11,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,BFE 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,BFE 
… 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,FCL 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,F 
… 
20,0,64,19,0,162,0,0,36,72,179,24,24,47,24,40,0,48,0,0,0,97,24,0,48,205,143,62,7 
8,0,0,216,0,36,24,24,0,0,24,0,0,0,0,140,24,0,0,0,0,72,176,0,0,144,48,0,38,0,284, 
221,72,0,72,0,SNT 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 
0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,SNT 
0,0,0,0,0,0,11,0,0,0,0,0,0,0,19,0,0,0,0,0,0,0,0,0,0,10,0,0,0,0,0,0,0,0,0,0,0,0,0 
,0,0,0,0,0,0,0,0,0,17,0,0,0,0,0,0,0,0,0,0,0,0,0,20,0,S 

等等:总共有570行,其中每一个都是在每天的饲料用 频率的关键字中包含的。在这种情况下,共有57个记录供 10天共计570个记录进行分类。每个关键字的前缀为 ,并带有替代号码,后缀为“频率”。

但在其他情况下,我已经使用布尔值的频率,使上述第一行是:

假的,假的,假的,真的,假的......,BFE

而且依此类推,其中34是正确的,因为满足了阈值,其他因为阈值未达到而错误。

据我可以确定,有在Weka中三种类型的SVM的,但有谁能够告诉我,我应该用我上面的数据可以使用这类型?

回答

0

我建议所有三个核心类型的尝试,并确定哪一个是适合您的训练和验证数据的最好(做图),然后就继续使用该训练模型来预测新的投入。

在weka中,您可以保存模型以备将来使用。

相关问题