首页    期刊浏览 2025年06月30日 星期一
登录注册

文章基本信息

  • 标题:RealText-lex: A Lexicalization Framework for RDF Triples
  • 本地全文:下载
  • 作者:Rivindu Perera ; Parma Nand ; Gisela Klette
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2016
  • 卷号:106
  • 期号:1
  • 页码:45-68
  • DOI:10.1515/pralin-2016-0011
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:The online era has made available almost cosmic amounts of information in the public and semi-restricted domains, prompting development of corresponding host of technologies to organize and navigate this information. One of these developing technologies deals with encoding information from free form natural language into a structured form as RDF triples. This representation enables machine processing of the data, however the processed information can not be directly converted back to human language. This has created a need to be able to lexicalize machine processed data existing as triples into a natural language, so that there is seamless transition between machine representation of information and information meant for human consumption. This paper presents a framework to lexicalize RDF triples extracted from DBpedia, a central interlinking hub for the emerging Web of Data. The framework comprises of four pattern mining modules which generate lexicalization patterns to transform triples to natural language sentences. Among these modules, three are based on lexicons and the other works on extracting relations by exploiting unstructured text to generate lexicalization patterns. A linguistic accuracy evaluation and a human evaluation on a sub-sample showed that the framework can produce patterns which are accurate and emanate human generated qualities.
国家哲学社会科学文献中心版权所有