首页    期刊浏览 2025年08月18日 星期一
登录注册

文章基本信息

  • 标题:An Algorithm for Morphological Segmentation of Esperanto Words
  • 作者:Theresa Guinard
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2016
  • 卷号:105
  • 期号:1
  • 页码:63-76
  • DOI:10.1515/pralin-2016-0003
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:Morphological analysis (finding the component morphemes of a word and tagging morphemes with part-of-speech information) is a useful preprocessing step in many natural language processing applications, especially for synthetic languages. Compound words from the constructed language Esperanto are formed by straightforward agglutination, but for many words, there is more than one possible sequence of component morphemes. However, one segmentation is usually more semantically probable than the others. This paper presents a modified n-gram Markov model that finds the most probable segmentation of any Esperanto word, where the model’s states represent morpheme part-of-speech and semantic classes. The overall segmentation accuracy was over 98% for a set of presegmented dictionary words.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有