文章基本信息

标题：A Feature Selection Based on Relevance and Redundancy
其他标题：A Feature Selection Based on Relevance and Redundancy
本地全文：下载
作者：Yonghe Lu ; Wenqiu Liu ; Yanfeng Li 等
期刊名称：Journal of Computers
印刷版ISSN：1796-203X
出版年度：2015
卷号：10
期号：4
页码：284-291
DOI：10.17706/jcp.10.4.284-291
出版社：Academy Publisher
摘要：At present, most of the researches on feature selection do not consider the relevance between a term and its own category, the redundancy among terms. In order to solve this problem efficiently, we propose a new feature selection based on analyzing how to measure the relevance and the redundancy, which use Euclidean distance as the similarity calculation method. R2, the new feature selection algorithm, can obtain the optimal feature subset which has considered the correlations between term and category and filtered the redundant terms. Finally, the validity of the new algorithm in feature selection is validated by the classification experiments on Chinese classification corpus by two classifiers, including KNN and Centroid-based classifier.
其他关键词：Text classification; feature selection; relevance; redundancy