期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN:2158-107X
电子版ISSN:2156-5570
出版年度:2020
卷号:11
期号:10
DOI:10.14569/IJACSA.2020.0111086
出版社:Science and Information Society (SAI)
摘要:There is a massive growth of text documents on the web. This led to the increasing need for methods that can organize and classify electronic documents (instances) automati-cally. Multi-label classification task is widely used in real-world problems and it has been applied on di˙erent applications. It assigns multiple labels for each document simultaneously. Few and insuÿcient research studies have investigated the multi-label text classification problem in the Arabic language. Therefore, this survey paper aims to present an extensive review of the existing multi-label classification methods and techniques that can deal with multi-label problem. Besides, we focus on Arabic language by covering the relevant applications of multi-label classification on the Arabic text, and identify the main challenges faced by these studies. Furthermore, this survey presents an experimental comparisons of di˙erent multi-label classification methods applied for the Arabic context and points out some baseline results. We found that further investigations are also needed to improve the multi-label classification task in the Arabic language, especially the hierarchical classification task.
关键词:Machine learning; text classification; multi-label classification; Arabic natural language processing; hierarchical classification; Lexicon approach