首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Finding Specification Pages from the Web
  • 本地全文:下载
  • 作者:Naoki Yoshinaga ; Kentaro Torisawa
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2006
  • 卷号:21
  • 期号:6
  • 页码:493-501
  • DOI:10.1527/tjsai.21.493
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:This paper presents a method of finding a specification page on the Web for a given object (e.g., ``Ch. d'Yquem'') and its class label (e.g., ``wine''). A specification page for an object is a Web page which gives concise attribute-value information about the object (e.g., ``county''-``Sauternes'') in well formatted structures. A simple unsupervised method using layout and symbolic decoration cues was applied to a large number of the Web pages to acquire candidate attributes for each class (e.g., ``county'' for a class ``wine''). We then filter out irrelevant words from the putative attributes through an author-aware scoring function that we called site frequency . We used the acquired attributes to select a representative specification page for a given object from the Web pages retrieved by a normal search engine. Experimental results revealed that our system greatly outperformed the normal search engine in terms of this specification retrieval.
  • 关键词:specification finding ; Web search ; attribute acquisition
国家哲学社会科学文献中心版权所有