首页    期刊浏览 2024年11月06日 星期三
登录注册

文章基本信息

  • 标题:Creating and Weighting Hunspell Dictionariesas Finite-State Automata
  • 本地全文:下载
  • 作者:Tommi Pirinen ; Krister Lindén
  • 期刊名称:Investigationes Linguisticae (Online)
  • 印刷版ISSN:1426-188X
  • 电子版ISSN:1733-1757
  • 出版年度:2017
  • 卷号:21
  • 页码:1
  • DOI:10.14746/il.2010.21.1
  • 出版社:Adam Mickiewicz University
  • 摘要:Therearenumerousformatsforwritingspell-checkersforopen-source systems and there are many lexical descriptions for natural languages written in these formats. In this paper, we demonstrate a method for converting Hunspell and related spell-checking lexicons into finite-state automata. We also present a simple way to apply unigram corpus training in order to improve the spellcheckingsuggestionmechanismusingweightedfinite-statetechnology.Whatwe propose is a generic and efficient language-independent framework of weighted finite-stateautomataforspell checkingintypicalopen-sourcesoftware,e.g.Mozilla Firefox, OpenOffice and the Gnome desktop.
国家哲学社会科学文献中心版权所有