首页    期刊浏览 2025年07月12日 星期六
登录注册

文章基本信息

  • 标题:Creation of Text Document Matrices and Visualization by Self-Organizing Map
  • 其他标题:Creation of Text Document Matrices and Visualization by Self-Organizing Map
  • 作者:Stefanovič, P. ; Kurasova, O.
  • 期刊名称:Engineering Economics
  • 印刷版ISSN:2029-5839
  • 出版年度:2014
  • 卷号:43
  • 期号:1
  • 页码:37-46
  • DOI:10.5755/j01.itc.43.1.4299
  • 语种:English
  • 出版社:Kaunas University of Technology
  • 摘要:In the paper, text mining and visualization by self-organizing map (SOM) are investigated. At first, textual information must be converted into numerical one. The results of text mining and visualization depend on the conversion. So, the influence of some control factors (the common word list and usage of the stemming algorithm) on text mining results, when a document dictionary is created, is investigated. A self-organizing map is used for text clustering and graphical representation (visualization). A comparative analysis is made where a dataset consists of scientific papers about the optimization, based on Pareto, simplex, and genetic algorithms. Two new measures are also proposed to estimate the SOM quality when the classified data are analyzed: distances between SOM cells, corresponding to data items assigned to the same class, and the distance between centers of SOM cells, corresponding to different classes. The quantization error is measured to estimate the SOM quality, too. DOI: http://dx.doi.org/10.5755/j01.itc.43.1.4299
  • 关键词:self-organizing map; text mining; text document matrix; document dictionary; quantization error; SOM quality measures; common word list
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有