首页    期刊浏览 2024年10月04日 星期五
登录注册

文章基本信息

  • 标题:GrandBase: generating actionable knowledge from Big Data
  • 本地全文:下载
  • 作者:Xiu Susie Fang ; Quan Z. Sheng ; Xianzhi Wang
  • 期刊名称:PSU Research Review
  • 印刷版ISSN:2399-1747
  • 出版年度:2017
  • 卷号:1
  • 期号:2
  • 页码:105-126
  • DOI:10.1108/PRR-01-2017-0005
  • 摘要:This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase.,In particular, this study extracts new predicates from four types of data sources, namely, Web texts, Document Object Model (DOM) trees, existing KBs and query stream to augment the ontology of the existing KB (i.e. Freebase). In addition, a graph-based approach to conduct better truth discovery for multi-valued predicates is also proposed.,Empirical studies demonstrate the effectiveness of the approaches presented in this study and the potential of GrandBase. The future research directions regarding GrandBase construction and extension has also been discussed.,To revolutionize our modern society by using the wisdom of Big Data, considerable KBs have been constructed to feed the massive knowledge-driven applications with Resource Description Framework triples. The important challenges for KB construction include extracting information from large-scale, possibly conflicting and different-structured data sources (i.e. the knowledge extraction problem) and reconciling the conflicts that reside in the sources (i.e. the truth discovery problem). Tremendous research efforts have been contributed on both problems. However, the existing KBs are far from being comprehensive and accurate: first, existing knowledge extraction systems retrieve data from limited types of Web sources; second, existing truth discovery approaches commonly assume each predicate has only one true value. In this paper, the focus is on the problem of generating actionable knowledge from Big Data. A system is proposed, which consists of two phases, namely, knowledge extraction and truth discovery, to construct a broader KB, called GrandBase.
  • 关键词:Big Data;Information extraction;DOM trees;Knowledge bases;Multi-valued predicates;Truth discovery
国家哲学社会科学文献中心版权所有