文章基本信息

标题：Text Semantic Classification of Long Discourses Based on Neural Networks with Improved Focal Loss
本地全文：下载
作者：Dan Jiang ; Jin He
期刊名称：Computational Intelligence and Neuroscience
印刷版ISSN：1687-5265
电子版ISSN：1687-5273
出版年度：2021
卷号：2021
页码：1-9
DOI：10.1155/2021/8845362
出版社：Hindawi Publishing Corporation
摘要：Semantic classification of Chinese long discourses is an important and challenging task. Discourse text is high-dimensional and sparse. Furthermore, when the number of classes of dataset is large, the data distribution will be seriously imbalanced. In solving these problems, we propose a novel end-to-end model called CRAFL, which is based on the convolutional layer with attention mechanism, recurrent neural networks, and improved focal loss function. First, the residual network (ResNet) extracts phrase semantic representations from word embedding vectors and reduces the dimensionality of the input matrix. Then, the attention mechanism differentiates the focus on the output of ResNet, and the long short-term memory layer learns the features of the sequences. Lastly but most significantly, we apply an improved focal loss function to mitigate the problem of data class imbalance. Our model is compared with other state-of-the-art models on the long discourse dataset, and CRAFL model has proven be more efficient for this task.