首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Binarization of Synchronous Context-Free Grammars
  • 本地全文:下载
  • 作者:Liang Huang ; Hao Zhang ; Daniel Gildea
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2009
  • 卷号:35
  • 期号:4
  • 页码:559-595
  • DOI:10.1162/coli.2009.35.4.35406
  • 语种:English
  • 出版社:MIT Press
  • 摘要:Systems based on synchronous grammars and tree transducers promise to improve the quality of statistical machine translation output, but are often very computationally intensive. The complexity is exponential in the size of individual grammar rules due to arbitrary re-orderings between the two languages. We develop a theory of binarization for synchronous context-free grammars and present a linear-time algorithm for binarizing synchronous rules when possible. In our large-scale experiments, we found that almost all rules are binarizable and the resulting binarized rule set significantly improves the speed and accuracy of a state-of-the-art syntax-based machine translation system. We also discuss the more general, and computationally more difficult, problem of finding good parsing strategies for non-binarizable rules, and present an approximate polynomial-time algorithm for this problem.
国家哲学社会科学文献中心版权所有