期刊名称:International Journal of Web & Semantic Technology
印刷版ISSN:0976-2280
电子版ISSN:0975-9026
出版年度:2013
卷号:4
期号:3
DOI:10.5121/ijwest.2013.4304
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Question classification is very important for question answering. This paper present our research work onquestion classification through machine learning approach. In order to train the learning model, wedesigned a rich set of features that are predictive of question categories. An important component ofquestion answering systems is question classification. The task of question classification is to predict theentity type of the answer of a natural language question. Question classification is typically done usingmachine learning techniques. Different lexical, syntactical and semantic features can be extracted from aquestion. In this work we combined lexical, syntactic and semantic features which improve the accuracy ofclassification. F urthermore, we adopted three different classifiers: Nearest Neighbors (NN), Na.ve Bayes(NB), and Support Vector Machines (SVM) using two kinds of features: bag-of-words and bag-of n grams.Furthermore, we discovered that when we take SVM classifier and combine the semantic, syntactic, lexicalfeature we found that it will improve the accuracy of classification. We tested our proposed approaches onthe well-known UIUC dataset and succeeded to achieve anew record on the accuracy of classification onthis dataset.