首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Automatic Tag Attachment Scheme based on Text Clustering for Efficient File Search in Unstructured Peer-to-Peer File Sharing Systems
  • 本地全文:下载
  • 作者:T.T. Qin ; S. Fujita
  • 期刊名称:Journal of Universal Computer Science
  • 印刷版ISSN:0948-6968
  • 出版年度:2012
  • 卷号:18
  • 期号:8
  • 页码:1032-1047
  • 出版社:Graz University of Technology and Know-Center
  • 摘要:In this paper, the authors address the issue of automatic tag attachment to the documents distributed over a P2P network aiming at improving the efficiency of file search in such networks. The proposed scheme combines text clustering with a modified tag extraction algorithm, and is executed in a fully distributed manner. Meanwhile, the optimal cluster number can also be fixed automatically through a distance cost function. We have conducted experiments to evaluate the accuracy of the proposed scheme. The result of experiments indicates that the proposed approach is capable of making effective and efficient tag attachment in real scenarios; i.e., for more than 90% of documents, it attaches the same tags as the ones attached by human reviewers. Moreover, it proofs by the experiments that the optimal cluster number is almost the same as the number of topics from the website.
  • 关键词:P2P system; text clustering; automatic tag attachment; K-DMeans; TFIDCF
国家哲学社会科学文献中心版权所有