期刊名称:Bulletin of the Technical Committee on Data Engineering
出版年度:2014
卷号:37
期号:3
出版社:IEEE Computer Society
摘要:Modern knowledge bases such as Yago [14], DeepDive [19], and Google’s Knowledge Vault [6] are constructedfrom large corpora of text by using some form of supervised information extraction. The extracted data usuallystarts as a large probabilistic database, then its accuracy is improved by adding domain knowledge expressed ashard or soft constraints. Finally, the knowledge base can be queried using some general-purpose query language(SQL, or Sparql).