0
我想在J48中设置-C
参数并运行三个存储在哈希表中的特征选择算法。我想比较三者的表现,如精确度,真正积极,真正消极,F1等。但是,当我运行所有功能选择算法时,它们会返回相同的输出....我做错了什么?使用Weka进行参数优化和属性选择
Hashtable<String, ASEvaluation> search=new Hashtable<String, ASEvaluation>();
Instances training_data = new Instances(new BufferedReader(
new FileReader("test.arff")));
training_data.setClassIndex(training_data.numAttributes() - 1);
topAttributes = new int[training_data.numAttributes()];
AttributeSelectedClassifier classifier = new AttributeSelectedClassifier();
AttributeSelection attsel = new AttributeSelection();
search.put("Infogain", new InfoGainAttributeEval());
search.put("SymmetricalUncertAttribute",new SymmetricalUncertAttributeEval());
search.put("Chisquared",new ChiSquaredAttributeEval());
for(String key : search.keySet()) {
try{
Ranker attribute_search = new Ranker();
J48 base = new J48();
CVParameterSelection ps = new CVParameterSelection();
ps.setClassifier(base);
ps.setNumFolds(5);
ps.addCVParameter("C 0.1 0.5 5");
ps.buildClassifier(training_data);
System.out.println("---------------- " + search.get(key).toString() + " ----------------");
classifier.setClassifier(ps);
classifier.setEvaluator(search.get(key));
classifier.setSearch(attribute_search);
attsel.setEvaluator(search.get(key));
attsel.setSearch(attribute_search);
attsel.setInputFormat(training_data);
Evaluation evaluation = new Evaluation(training_data);
evaluation.crossValidateModel(ps, training_data, 10, new Random(1));
System.out.println("\nevaluation ->");
System.out.println(evaluation.toSummaryString());
System.out.println("MAE: " + evaluation.meanAbsoluteError());
} catch(Exception e) {
e.printStackTrace();
}
}
您是否检查过功能选择算法是否对您的数据集产生影响?他们可能真的会返回所有相同的功能集。即使他们不这样做,'J48'也可以简单地选择在生成的树中使用的属性的相同子集。 – 2013-03-02 15:52:18