文章基本信息

标题：Improving the Efficiency of ILP by Simultaneous Use of Most Specific Hypotheses
本地全文：下载
作者：Tomonobu Ozaki ; Koichi Furukawa
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2001
卷号：16
期号：6
页码：521-530
DOI：10.1527/tjsai.16.521
出版社：The Japanese Society for Artificial Intelligence
摘要：Inductive Logic Programming (ILP) employs expressive representation languages, i.e. Prolog Programs, so that ILP can handle structural data which other traditional inductive learner can not or hardly do. In this reason, ILP has been regarded as one of the most important technologies in the area of Data Mining and Knowledge Discovery in Databases recently. However, ILP usually needs enormous computational time to obtain the results from a huge amount of data appearing in such problems as Data Mining. To cope with this problem and to make ILP more practical, we need more efficient algorithms. Since learning by ILP can be regarded as the search problem, we need to consider the reduction of the number of hypotheses to be evaluated and the efficient evaluation algorithm in order to make ILP more efficient. One of the most effective ways in reducing the hypothesis space to be searched is to compute a Most Specific Hypothesis (MSH) from one positive example by Inverse Entailment. Since MSH bounds the hypothesis space, it is important to decide which example should be used to generate MSH. In this paper, in order to reduce the computational time, we propose the following three algorithms for ILP systems which employ coverset algorithm, inverse entailment and top-down search strategy: (1)selection of search space, (2)incremental search within one class, (3)integration of hypothesis spaces among the different classes. The main common feature of these three algorithms is that, instead of single MSH, plural MSHs are considered in the search process. Experiments were conducted to assess the effectiveness of the proposed algorithms. The results show that the proposed algorithms are useful for reducing the number of candidate hypotheses to be evaluated as well as the total computational time for induction.
关键词：inductive logic programming ; most specific hypothesis ; incremental search ; multiple class classification