首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Zero-shot Neural Passage Retrieval via Domain-targeted Synthetic Question Generation
  • 本地全文:下载
  • 作者:Ji Ma ; Ivan Korotkov ; Yinfei Yang
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:1075-1088
  • DOI:10.18653/v1/2021.eacl-main.92
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:A major obstacle to the wide-spread adoption of neural retrieval models is that they require large supervised training sets to surpass traditional term-based techniques, which are constructed from raw corpora. In this paper, we propose an approach to zero-shot learning for passage retrieval that uses synthetic question generation to close this gap. The question generation system is trained on general domain data, but is applied to documents in the targeted domain. This allows us to create arbitrarily large, yet noisy, question-passage relevance pairs that are domain specific. Furthermore, when this is coupled with a simple hybrid term-neural model, first-stage retrieval performance can be improved further. Empirically, we show that this is an effective strategy for building neural passage retrieval models in the absence of large training corpora. Depending on the domain, this technique can even approach the accuracy of supervised models.
国家哲学社会科学文献中心版权所有