首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:SVM Machine Learning Classifier to Automate the Extraction of SRS Elements
  • 本地全文:下载
  • 作者:Ayad Tareq Imam ; Aysh Alhroob ; Wael Jumah Alzyadat
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2021
  • 卷号:12
  • 期号:3
  • 页码:174-185
  • DOI:10.14569/IJACSA.2021.0120322
  • 出版社:Science and Information Society (SAI)
  • 摘要:The process of extraction of software entities such as system, use case, and actor from an English natural language description of a user’s software requirements is a linguistic and semantic process of a natural language processing application. Entity extraction is known to be a complicated and challenging problem by researchers in the fields of linguistics or computation, due to the ambiguities in natural languages. This paper presents a named entity recognition method called SyAcUcNER (System Actor Use-Case Named Entity Recognizer), for extracting the system, actor, and use case entities from unstructured English descriptions of user requirements for the software. SyAcUcNER uses one of the Machine Learning (ML) approaches, that is, the Support Vector Machine (SVM) as an effective classifier. Also, SyAcUcNER uses a semantic role labeling process to tag the words in the text of user software requirements. SyAcUcNER is the first work that defines the structure of a requirements engineering specialized NER, the first work that uses a specialized NER model as an approach for extracting actor and use case entities from English language requirements description, and the first time an SVM has been used to specify the semantic meanings of words in a certain domain of discourse; that is the Software Requirements Specification (SRS). The performance of SyAcUcNER, which utilizes WEKA’s SVM, is evaluated using a binomial technique, and the results gained from running SyAcUcNER on text corpora from assorted sources give weighted averages of 76.2% for precision, 76% for recall, and 72.1% for the F-measure.
  • 关键词:Information extraction; named entity recognition; machine learning; support vector machine; software requirement specification; WEKA; I-CASE
国家哲学社会科学文献中心版权所有