首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Semi Automated Text Categorization using Demonstration Based Term Set
  • 本地全文:下载
  • 作者:M. Pushpa ; K. Nirmala
  • 期刊名称:International Journal of Computer Science, Engineering and Applications (IJCSEA)
  • 印刷版ISSN:2231-0088
  • 电子版ISSN:2230-9616
  • 出版年度:2012
  • 卷号:2
  • 期号:4
  • DOI:10.5121/ijcsea.2012.2408
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Manual Analysis of huge amount of textual data requires a tremendous amount of processing time and effort in reading the text and organizing them in required format. In the current scenario, the major problem is with text categorization because of the high dimensionality of feature space. Now-a-days there are many methods available to deal with text feature selection. This paper aims at such semi automated text categorization feature selection methodology to deal with a massive data using one of the phases of David Merrill’s First principles of instruction (FPI). It uses a pre-defined category group by providing them with the proper training set based on the demonstration phase of FPI. The methodology involves the text tokenization, text categorization and text analysis.
  • 关键词:Text mining; Text characterization; Feature Selection; Text tokenization; FPI and Instructional phase
国家哲学社会科学文献中心版权所有