首页    期刊浏览 2025年02月20日 星期四
登录注册

文章基本信息

  • 标题:Veracity Finding from Information Provide on the Web
  • 本地全文:下载
  • 作者:Vijay Kumar, B.Srinivasa Rao
  • 期刊名称:Computer Sciences and Telecommunications
  • 印刷版ISSN:1512-1232
  • 出版年度:2010
  • 卷号:28
  • 期号:05
  • 出版社:Internet Academy
  • 摘要:

    The world-wide web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the web. Moreover, different web sites often provide conflicting information on a subject, such as different specifications for the same product. In this paper we propose a new problem called Veracity, i.e., conformity to truth, which studies how to find true facts from a large amount of conflicting information on many subjects that is provided by various web sites. We design a general framework for the Veracity problem, and invent an algorithm called TruthFinder, which uti- lizes the relationships between web sites and their information, i.e., a web site is trustworthy if it provides true information, and a piece of information is likely to be true if it is provided by many trustworthy web sites. Our experiments show that TruthFinder successfully finds true facts among conflicting information, and identifies trustworthy web sites better than the popular search engines

  • 关键词:Data quality; Web mining; Link analysis.
国家哲学社会科学文献中心版权所有