首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:Automating Text Simplification Using Pictographs for People with Language Deficits
  • 本地全文:下载
  • 作者:Mai Farag Imam ; Amal Elsayed Aboutabl ; Ensaf H. Mohamed
  • 期刊名称:International Journal of Information Technology and Computer Science
  • 印刷版ISSN:2074-9007
  • 电子版ISSN:2074-9015
  • 出版年度:2019
  • 卷号:11
  • 期号:7
  • 页码:26-34
  • DOI:10.5815/ijitcs.2019.07.04
  • 出版社:MECS Publisher
  • 摘要:Automating text simplification is a challenging research area due to the compound structures present in natural languages. Social involvement of people with language deficits can be enhanced by providing them with means to communicate with the outside world, for instance using the internet independently. Using pictographs instead of text is one of such means. This paper presents a system which performs text simplification by translating text into pictographs. The proposed system consists of a set of phases. First, a simple summarization technique is used to decrease the number of sentences before converting them to pictures. Then, text preprocessing is performed including processes such as tokenization and lemmatization. The resulting text goes through a spelling checker followed by a word sense disambiguation algorithm to find words which are most suitable to the context in order to increase the accuracy of the result. Clearly, using WSD improves the results. Furthermore, when support vector machine is used for WSD, the system yields the best results. Finally, the text is translated into a list of images. For testing and evaluation purposes, a test corpus of 37 Basic English sentences has been manually constructed. Experiments are conducted by presenting the list of generated images to ten normal children who are asked to reproduce the input sentences based on the pictographs. The reproduced sentences are evaluated using precision, recall, and F-Score. Results show that the proposed system enhances pictograph understanding and succeeds to convert text to pictograph with precision, recall and F-score of over 90% when SVM is used for word sense disambiguation, also all these techniques are not combined together before which increases the accuracy of the system over all other studies.
  • 关键词:Natural language processing;pictographic communication;social inclusion;Text simplification;text summarization;word sense disambiguation
国家哲学社会科学文献中心版权所有