首页    期刊浏览 2025年06月15日 星期日
登录注册

文章基本信息

  • 标题:Fuzzy Rules for Document Classification to Improve Information Retrieval
  • 本地全文:下载
  • 作者:Tatiane M. Nogueira ; Heloisa A. Camargo ; Solange O. Rezende
  • 期刊名称:International Journal of Computer Information Systems and Industrial Management Applications
  • 印刷版ISSN:2150-7988
  • 电子版ISSN:2150-7988
  • 出版年度:2011
  • 卷号:3
  • 页码:210-217
  • 出版社:Machine Intelligence Research Labs (MIR Labs)
  • 摘要:In this work, we present a method to generate, from text documents, fuzzy rules used to classify documents and to improve the information retrieval. With this method, we face the issue of dimensionality in text documents for information retrieval. We also present a comparison analysis among the method that we proposed and well-known machine learning methods for classification. The aim of our work is to develop a mechanism to reduce the high dimensionality of the attribute-value matrix obtained from the documents and, consequently, scale up the proposed classifier. Some experiments have been run using different domains in order to validate the proposed approach and compare the results with the ones obtained with the OneR, K-Nearest Neighbor classifier, C4.5, Multi-variable Naive Bayes, and SVM methods. The experiments and the obtained results showed that this is a promising approach to deal with the dimensionality problem of document for information retrieval.
  • 关键词:fuzzy clustering; information retrieval; text mining; ; text categorization; uncertainty; imprecision
国家哲学社会科学文献中心版权所有