首页    期刊浏览 2025年02月22日 星期六
登录注册

文章基本信息

  • 标题:Using New Data Structure to Implement Documents Vectors in Vector Space Model in Information Retrieval System
  • 本地全文:下载
  • 作者:Dr. Khalaf Khatatneh ; M.Wedyan ; Dr.Mohamed Alham
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2010
  • 卷号:19
  • 期号:01
  • 出版社:Journal of Theoretical and Applied
  • 摘要:

    In this paper, we present how table memorized semiring structure contributes in the vector space model in information retrieval system. We implement this new structure by generating table for each document and the first row filled with key word and the second row filled with weight for each word. This new structure implements using 242 Arabic documents which were presented in the Saudi Arabian National Computer Conference. The new method (technique) shows a new result which is more efficient than traditional structure and can save space. The results also show that when we used traditional structure system it occupies 204248 × units to implement vectors, but in new data structure system it occupies 5388×unit which means that we saved more than 198860 space units.

  • 关键词:Stop Words; Memorized Semirings; Vector Space Model; Stopwords
国家哲学社会科学文献中心版权所有