首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:EIGENVECTOR SPACE MODEL TO CAPTURE FEATURES OF DOCUMENTS
  • 本地全文:下载
  • 作者:Choi DONGJIN ; Kim PANKOO
  • 期刊名称:Annals of Spiru Haret University Economic Series
  • 电子版ISSN:2393-1795
  • 出版年度:2011
  • 卷号:11
  • 期号:3
  • 语种:English
  • 出版社:Editura Fundatiei Romania de Maine
  • 摘要:Eigenvectors are a special set of vectors associated with a linear system of equations. Because of the special property of eigenvector, it has been used a lot for computer vision area. When the eigenvector is applied to information retrieval field, it is possible to obtain properties of documents data corpus. To capture properties of given documents, this paper conducted simple experiments to prove the eigenvector is also possible to use in document analysis. For the experiment, we use short abstract document of Wikipedia provided by DBpedia as a document corpus. To build an original square matrix, the most popular method named tf-idf measurement will be used. After calculating the eigenvectors of original matrix, each vector will be plotted into 3D graph to find what the eigenvector means in document processing.
  • 关键词:eigenvector;Vector Space Model;Natural Language Processing;document analyzing;Information Retrieval;text minin
国家哲学社会科学文献中心版权所有