首页    期刊浏览 2024年11月09日 星期六
登录注册

文章基本信息

  • 标题:Categorisation by Context
  • 本地全文:下载
  • 作者:Giuseppe Attardi ; Sergio Di Marco ; Davide Salvi
  • 期刊名称:Journal of Universal Computer Science
  • 印刷版ISSN:0948-6968
  • 出版年度:1998
  • 卷号:4
  • 期号:9
  • 页码:719-736
  • 出版社:Graz University of Technology and Know-Center
  • 摘要:Assistance in retrieving of documents on the World Wide Web is provided either by search engines, through keyword based queries, or by catalogues, which organise documents into hierarchical collections. Maintaining catalogues manually is becoming increasingly difficult due to the sheer amount of material on the Web, and therefore it will be soon necessary to resort to techniques for automatic classification of documents. Classification is traditionally performed by extracting information for indexing a document from the document itself. The paper describes the technique of categorisation by context, which exploits the context perceivable from the structure of HTML documents to extract useful information for classifying the documents they refer to. We present the results of experiments with a preliminary implementation of the technique.
国家哲学社会科学文献中心版权所有