首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:A Python-based Interface for Wide Coverage Lexicalized Tree-adjoining Grammars
  • 作者:Ziqi Wang ; Haotian Zhang ; Anoop Sarkar
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2015
  • 卷号:103
  • 期号:1
  • 页码:139-159
  • DOI:10.1515/pralin-2015-0008
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:This paper describes the design and implementation of a Python-based interface for wide coverage Lexicalized Tree-adjoining Grammars. The grammars are part of the XTAG Grammar project at the University of Pennsylvania, which were hand-written and semi-automatically curated to parse real-world corpora. We provide an interface to the wide coverage English and Korean XTAG grammars. Each XTAG grammar is lexicalized, which means at least one word selects a tree fragment (called an elementary tree or etree). Derivations for sentences are built by combining etrees using substitution (replacement of a tree node with an etree at the frontier of another etree) and adjunction (replacement of an internal tree node in an etree by another etree). Each etree is associated with a feature structure representing constraints on substitution and adjunction. Feature structures are combined using unification during the combination of etrees. We plan to integrate our toolkit for XTAG grammars into the Python-based Natural Language Toolkit (NLTK: nltk.org). We have provided an API capable of searching the lexicalized etrees for a given word or multiple words, searching for a etree by name or function, display the lexicalized etrees to the user using a graphical view, display the feature structure associated with each tree node in an etree, hide or highlight features based on a regular expression, and browsing the entire tree database for each XTAG grammar.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有