首页    期刊浏览 2024年11月13日 星期三
登录注册

文章基本信息

  • 标题:Glossing User Search Results From Large Web Databases
  • 本地全文:下载
  • 作者:G.Baloji ; Punugoti Srikanth ; Janapati Venkata Krishna
  • 期刊名称:International Journal of Computer Trends and Technology
  • 电子版ISSN:2231-2803
  • 出版年度:2014
  • 卷号:16
  • 期号:4
  • 页码:137-140
  • 出版社:Seventh Sense Research Group
  • 摘要:An increasing number of databases have become web accessible through HTML formbased search interfaces. Hence the data units returned from the underlying database are usually encoded into the result pages dynamically for human browsing. For the encoded data units which is to be machine processable, which is necessary for many applications such as deep web data collection and Internet comparison shopping, they need to extracted out and assigned meaningful labels. In this paper, we are presenting an automatic annotation approach that first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Hence, for each group we annotate it from different aspects and aggregate the different annotations to predict a final annotation label for it. An annotation wrapper to the search site is automatically constructed and can be used to annotate new result pages from the same web database. The search technic is related to the fuzzy search technique and we also propose ranked keyword search. Our experiments indicate that the proposed approach is highly effective.
  • 关键词:Glossing User Search Results From Large Web Databases
国家哲学社会科学文献中心版权所有