首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Linking Datasets Using Semantic Textual Similarity
  • 本地全文:下载
  • 作者:John P. McCrae ; Paul Buitelaar
  • 期刊名称:Cybernetics and Information Technologies
  • 印刷版ISSN:1311-9702
  • 电子版ISSN:1314-4081
  • 出版年度:2018
  • 卷号:18
  • 期号:1
  • 页码:109-123
  • DOI:10.2478/cait-2018-0010
  • 出版社:Bulgarian Academy of Science
  • 摘要:Linked data has been widely recognized as an important paradigm for representing data and one of the most important aspects of supporting its use is discovery of links between datasets. For many datasets, there is a significant amount of textual information in the form of labels, descriptions and documentation about the elements of the dataset and the fundament of a precise linking is in the application of semantic textual similarity to link these datasets. However, most linking tools so far rely on only simple string similarity metrics such as Jaccard scores. We present an evaluation of some metrics that have performed well in recent semantic textual similarity evaluations and apply these to linking existing datasets.
  • 关键词:Linked data; link discovery; ontology alignment; semantic textual; similarity; structural similarity; NLP architectures.
国家哲学社会科学文献中心版权所有