期刊名称:International Journal of Computer Science Issues
印刷版ISSN:1694-0784
电子版ISSN:1694-0814
出版年度:2012
卷号:9
期号:4
出版社:IJCSI Press
摘要:Text classification is a challenging field in the current scenario and has great importance in text categorization application. Documents may be classified or categorized according to their subjects or according to their attributes. There is need to categorize a collection of text document into mutually exclusive categories by extracting the concept or features using supervised learning paradigm and different classification algorithm. In this paper we present a nave based approach for the classification using semi-supervised text classification methodology with the help of Activation term sets. Such frequent term set can be discovered based on David Merrills First principles of instruction (FPI) techniques. The system uses a pre-defined category group by providing them with the proper training set based on the activation of FPI We made an attempt to classify the document using FPI methodology, the algorithm involves the text tokenization, text categorization and text analysis
关键词:Text mining; Text characterization; Text Classification; Text tokenization; FPI and Instructional phase