首页    期刊浏览 2024年09月07日 星期六
登录注册

文章基本信息

  • 标题:Statistical String Similarity Model for Information Linkage
  • 本地全文:下载
  • 作者:Atsuhiro Takasu
  • 期刊名称:Progress in Informatics
  • 印刷版ISSN:1349-8614
  • 电子版ISSN:1349-8606
  • 出版年度:2009
  • 期号:06
  • DOI:10.2201/NiiPi.2009.6.7
  • 出版社:National Institute of Informatics
  • 摘要:

    This paper proposes a statistical string similarity model for approximate matching in information linkage. The proposed similarity model is an extension of hidden Markov model and its learnable ability realizes string matching function adaptable to various information sources. The main contribution of this paper is to develop an efficient learning algorithm for estimating parameters of the statistical similarity model. The proposed algorithm is based on the Expectation-Maximization (EM) technique where dynamic programing technique is used to update parameters in EM process.

  • 关键词:String similarity; statistical model; EM algorithm
国家哲学社会科学文献中心版权所有