首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:A Structure for Annotation and Ground-truthing of Urdu Handwritten Text Image Corpus
  • 本地全文:下载
  • 作者:Prakash Choudhary ; Prakash Choudhary ; Neeta Nain
  • 期刊名称:Procedia - Social and Behavioral Sciences
  • 印刷版ISSN:1877-0428
  • 出版年度:2015
  • 卷号:198
  • 页码:84-88
  • DOI:10.1016/j.sbspro.2015.07.422
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractOver the last few decades, a large evolution has been made in the field of handwritten recognition. Material of handwritten documents is become less with current trends of digital electronics. However, for the investigation and research on a particular language a large volume of handwritten documents database is required. In this paper we describe our approach for development a large volume of Urdu handwritten text images Corpus on Urdu language. To make the database available in large field of Natural Language Processing we annotate database for each image and associate a XML based ground-truth Meta information to make it computer compatible as a linguistic resource. This paper focus on the some issue related with Corpus design and annotation such as data collection, writers selection, methodology of annotation etc.
  • 关键词:Urdu Corpus;Annotation;Groundtruthing;Handwritten Documents;Documents Analysis
国家哲学社会科学文献中心版权所有