首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:QUESTION ANSWERING SYSTEM SUPPORTING VECTOR MACHINE METHOD FOR HADITH DOMAIN
  • 本地全文:下载
  • 作者:NABEEL NEAMAH ; SAIDAH SAAD
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2017
  • 卷号:95
  • 期号:7
  • 出版社:Journal of Theoretical and Applied
  • 摘要:Retrieving accurate answers based on users query is the main issue of question answering systems. Challenges such as analyse the need of users query and extract accurate answers from large corpus are increase the difficulty of developing effective question answering system. This work aims to enhance the accuracy of question answering system for hadiths using useful methods. Pre-processing methods like tokenization and stop-word removal is used to identify the main concepts of users query. Answering processing methods and techniques like N-gram, WordNet, CS, and LCS are used to update and enrich the extracted concepts of users query based on the formal representation of hadiths answers or documents. Support Vector Machine (SVM) and Name Entity Recognition (NER) methods are conducted to classify Hadiths documents based on relevant subjects and questions types in order to reduce the searching scope of answers documents. Documents in Hadith corpus are classified according to proposed question types, and related subjects as four main classes which are: when for pray, where for pray, when for fasting, and where for fasting. The SVM classification of documents is accomplished supporting NER methods to identify the places (where) and time (when) features that included in the documents. The proposed question answering system is tested using 132 Hadiths documents about Fasting and Pray that are selected from Al-Bukhari source. The findings revealed that the average answers accuracy using CS technique is 67%, the average answers accuracy using LCS technique is 66%, the average answers accuracy using combination of CS and LCS techniques is 70%, and the average answers accuracy using CS, LCS, and SVM is 80%. SVM enhance the system accuracy up to 10% more than using other methods without classification processes. The main contribution of this research is using SVM method to reduce searching scope of Hadiths documents based on various subjects and question types beside effective analysis of query need using NLP methods. SVM provides more accurate answers than extracting answers using only similarity techniques such as CS and LCS.
  • 关键词:Question Answering System; Hadiths; Pre-processing; Answers Processing; SVM; NER.
国家哲学社会科学文献中心版权所有