首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Correcting spelling errors by modelling their causes
  • 本地全文:下载
  • 作者:Sebastian Deorowicz ; Marcin G. Ciura
  • 期刊名称:International Journal of Applied Mathematics and Computer Science
  • 电子版ISSN:2083-8492
  • 出版年度:2005
  • 卷号:15
  • 期号:2
  • 出版社:De Gruyter Open
  • 摘要:This paper accounts for a new technique of correcting isolated words in typed texts. A language-dependent set of string substitutions reflects the surface form of errors that result from vocabulary incompetence, misspellings, or mistypings. Candidate corrections are formed by applying the substitutions to text words absent from the computer lexicon. A minimal acyclic deterministic finite automaton storing the lexicon allows quick rejection of nonsense corrections, while costs associated with the substitutions serve to rank the remaining ones. A comparison of the correction lists generated by several spellcheckers for two corpora of English spelling errors shows that our technique suggests the right words more accurately than the others
  • 关键词:spelling correction; finite state automata; spelling errors
国家哲学社会科学文献中心版权所有