首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Semantic Segmentation of Text Using Deep Learning
  • 本地全文:下载
  • 作者:Tiziano Lattisi ; Davide Farina ; Marco Ronchetti
  • 期刊名称:COMPUTING AND INFORMATICS
  • 印刷版ISSN:1335-9150
  • 出版年度:2022
  • 卷号:41
  • 期号:1
  • 页码:78-97
  • DOI:10.31577/cai_2022_1_78
  • 语种:English
  • 出版社:COMPUTING AND INFORMATICS
  • 摘要:Given a text, can we segment it into semantically coherent sections in an automatic way? Can we detect the semantic boundaries, if we know how many they are? Can we determine how many semantically distinct sections are in the text? These are the questions we address in this paper. To respond, we use the Bidirectional Encoder Representation from Transformer (BERT) to analyze the text and evaluate a function that we call local incoherence, which we expect to show maxima at the points where a semantic boundary is detected. Our results, although preliminary, are encouraging and suggest that our approach can be successfully applied. However, they are quite sensitive with respect to the text quality, as it happens in the case in which the text is derived from an audio stream via Automatic Speech Recognition techniques.
  • 关键词:Text segmentation;semantic boundaries;BERT
国家哲学社会科学文献中心版权所有