期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2015
卷号:4
期号:2
页码:419-423
出版社:Shri Pannalal Research Institute of Technolgy
摘要:The goal of text classification is to classify the text documents into a certain number of pre-defined classes. As per demands for text is increased with the evolution of large amount of information available in internet, news, institutes. For this large amount of data we need a text classifier which will help in classifying text documents. In classification there are major issues such as handling large number of features, unstructured text documents, and selecting a machine learning technique suitable for the text classification application. The Automatic Text Classification System solves those problems by giving the text document to a set of pre-defined classes by using machine learning techniques. The appropriate machine learning technique for text classification is k-nearest neighbour. The classification is mainly done on the basis of significant words or features extracted from the text document. It also can be used for creating training documents and for creating databases for various texts.
关键词:Automatic Text Classification; Text Mining ; (Mining Algorithm); K-Nearest Neighbour algorithm; Text ; Data.