文章基本信息

标题：Why Not Use an Oracle When You Got One?
本地全文：下载
作者：Ulf Johansson ; Tuve Löfström ; Rikard König 等
期刊名称：Neural Information Processing: Letters and Reviews
电子版ISSN：1738-2532
出版年度：2006
卷号：10
期号：8
页码：227-236
出版社：Neural Information Processing
摘要：The primary goal of predictive modeling is to achieve high accuracy when the model is applied to novel data. For certain problems this requires the use of complex techniques like neural networks or ensembles, resulting in opaque models that are hard or impossible to interpret. For some domains this is unacceptable, since models need to be comprehensible. To achieve comprehensibility, accuracy is often sacrificed by using simpler techniques; a tradeoff termed the accuracy vs. comprehensibility tradeoff. Another, frequently studied, alternative is rule extraction; i.e. the activity where another, transparent, model is generated from the opaque model. In this paper it is argued that existing rule extraction algorithms do not use all information available, and typically should benefit from also using oracle data; i.e. test set instances, together with corresponding predictions from the opaque model. The experiments, using fifteen publicly available data sets, clearly show that rules extracted using either just oracle data or training data augmented with oracle data, will explain the predictions significantly better than rules extracted in the standard way; i.e. using training data only. Rule extraction, neural networks, accuracy vs. comprehensibility tradeoff