期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2015
卷号:4
期号:11
页码:4020-4024
出版社:Shri Pannalal Research Institute of Technolgy
摘要:To guarantee the quality of relevance feature in text documents is a big challenge because of large scale terms and data patterns. All the existing approaches have suffered from the two major problems such as polysemy and synonmy .There are several hybrid approaches were proposed for text classification. Feature selection is the process of selecting a subset of feature used to represent the data. In text classification it focuses on identifying relevant information without affecting the accuracy of the classifier. This paper gives the surveys on several approaches of text classification and feature selection methods for text classification. Feature selection methods are discussed for reducing the dimensionality of the dataset by removing features that are irrelevant for the classification.
关键词:Feature selection; Text classification; Relevance feature.