首页    期刊浏览 2024年09月21日 星期六
登录注册

文章基本信息

  • 标题:A Scheme Towards Automatic Word Indexation System for Balinese Palm Leaf Manuscripts
  • 本地全文:下载
  • 作者:Made Windu Antara Kesiman ; Gede Aditra Pradnyana
  • 期刊名称:Journal of ICT Research and Applications
  • 印刷版ISSN:2337-5787
  • 电子版ISSN:2338-5499
  • 出版年度:2021
  • 卷号:15
  • 期号:2
  • 页码:105-119
  • 语种:English
  • 出版社:Institut Teknologi Bandung
  • 摘要:This paper proposes an initial scheme towards the development of an automatic word indexation system for Balinese lontar (palm leaf manuscript) collections. The word indexation system scheme consists of a sub module for patch image extraction of text areas in lontars and a sub module for word image transliteration. This is the first word indexation system for lontar collections to be proposed. To detect parts of a lontar image that contain text, a Gabor filter is used to provide initial information about the presence of text texture in the image. An adaptive sliding patch algorithm for the extraction of patch images in lontars is also proposed. The word image transliteration sub module was built using the long short-term memory (LSTM) model. The results showed that the image patch extraction of text areas process succeeded in optimally detecting text areas in lontars and extracting the patch image in a suitable position. The proposed scheme successfully extracted between 20% to 40% of the keywords in lontars and thus can at least provide an initial description for prospective lontar readers of the content contained in a lontar collection or to find in which lontar collection certain keywords can be found.
  • 关键词:Balinese palm leaf manuscript;patch extraction;text detection;transliteration;word indexing
国家哲学社会科学文献中心版权所有