首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Shift-reduce Spinal TAG Parsing with Dynamic Programming
  • 本地全文:下载
  • 作者:Katsuhiko Hayashi ; Jun Suzuki ; Masaaki Nagata
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2016
  • 卷号:11
  • 页码:93-100
  • DOI:10.11185/imt.11.93
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:The spinal tree adjoining grammar (TAG) parsing model of [Carreras 08] achieves the current state-of-the-art constituent parsing accuracy on the commonly used English Penn Treebank evaluation setting. Unfortunately, the model has the serious drawback of low parsing efficiency since its Eisner-CKY style parsing algorithm needs O ( n 4) computation time for input length n . This paper investigates a more practical solution and presents a beam search shift-reduce algorithm for spinal TAG parsing. Since the algorithm works in O ( bn ) ( b is beam width), it can be expected to provide a significant improvement in parsing speed. However, to achieve faster parsing, it needs to prune a large number of candidates in an exponentially large search space and often suffers from severe search errors. In fact, our experiments show that the basic beam search shift-reduce parser does not work well for spinal TAGs. To alleviate this problem, we extend the proposed shift-reduce algorithm with two techniques: Dynamic Programming of [Huang 10a] and Supertagging. The proposed extended parsing algorithm is about 8 times faster than the Berkeley parser , which is well-known to be fast constituent parsing software, while offering state-of-the-art performance. Moreover, we conduct experiments on the Keyaki Treebank for Japanese to show that the good performance of our proposed parser is language-independent.
  • 关键词:spinal tree adjoining grammar;transition-based parsing;dynamic programming;supertagging
国家哲学社会科学文献中心版权所有