首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Transformer based Model for Coherence Evaluation of Scientific Abstracts: Second Fine-tuned BERT
  • 本地全文:下载
  • 作者:Anyelo-Carlos Gutierrez-Choque ; Vivian Medina-Mamani ; Eveling Castro-Gutierrez
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2022
  • 卷号:13
  • 期号:5
  • DOI:10.14569/IJACSA.2022.01305105
  • 语种:English
  • 出版社:Science and Information Society (SAI)
  • 摘要:Coherence evaluation is a problem related to the area of natural language processing whose complexity lies mainly in the analysis of the semantics and context of the words in the text. Fortunately, the Bidirectional Encoder Representation from Transformers (BERT) architecture can capture the aforemen-tioned variables and represent them as embeddings to perform Fine-tunings. The present study proposes a Second Fine-Tuned model based on BERT to detect inconsistent sentences (coherence evaluation) in scientific abstracts written in English/Spanish. For this purpose, 2 formal methods for the generation of inconsistent abstracts have been proposed: Random Manipulation (RM) and K-means Random Manipulation (KRM). Six experiments were performed; showing that performing Second Fine-Tuned improves the detection of inconsistent sentences with an accuracy of 71%. This happens even if the new retraining data are of different language or different domain. It was also shown that using several methods for generating inconsistent abstracts and mixing them when performing Second Fine-Tuned does not provide better results than using a single technique.
  • 关键词:Coherence evaluation; inconsistent sentences detec-tion; BERT; second fine-tuned
国家哲学社会科学文献中心版权所有