首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:DOCPRO: A Framework for Building Document Processing Systems
  • 本地全文:下载
  • 作者:Ming-Jen Huang ; Chun-Fang Huang ; Chiching Wei
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2020
  • 卷号:10
  • 期号:9
  • 页码:213-224
  • DOI:10.5121/csit.2020.100917
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:With the recent advance of the deep neural network, we observe new applications of natural language processing (NLP) and computer vision (CV) technologies. Especaully, when applying them to document processing, NLP and CV tasks are usually treated individually in research work and open source libraries. However, designing a real-world document processing system needs to weave NLP and CV tasks and their generated information together. There is a need to have a unified approach for processing documents containing textual and graphical elements with rich formats, diverse layout arrangement, and distinct semantics. This paper introduces a framework to fulfil this need. The framework includes a representation model definition for holding the generated information and specifications defining the coordination between the NLP and CV tasks.
  • 关键词:Document Processing ;Framework ;Formal definition ;Machine Learning.
国家哲学社会科学文献中心版权所有