文章基本信息

标题：Bidirectional Recurrent Neural Network Approach for Arabic Named Entity Recognition
作者：Mohammed N. A. Ali ; Guanzheng Tan ; Aamir Hussain 等
期刊名称：Future Internet
电子版ISSN：1999-5903
出版年度：2018
卷号：10
期号：12
页码：123
DOI：10.3390/fi10120123
语种：English
出版社：MDPI Publishing
摘要：Recurrent neural network (RNN) has achieved remarkable success in sequence labeling tasks with memory requirement. RNN can remember previous information of a sequence and can thus be used to solve natural language processing (NLP) tasks. Named entity recognition (NER) is a common task of NLP and can be considered a classification problem. We propose a bidirectional long short-term memory (LSTM) model for this entity recognition task of the Arabic text. The LSTM network can process sequences and relate to each part of it, which makes it useful for the NER task. Moreover, we use pre-trained word embedding to train the inputs that are fed into the LSTM network. The proposed model is evaluated on a popular dataset called “ANERcorp.” Experimental results show that the model with word embedding achieves a high F-score measure of approximately 88.01%.
关键词：Arabic named entity recognition; bidirectional recurrent neural network; GRU; LSTM; natural language processing; word embedding Arabic named entity recognition ; bidirectional recurrent neural network ; GRU ; LSTM ; natural language processing ; word embedding