首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics
  • 本地全文:下载
  • 作者:Atif Khan ; Ibrahim Ibrahim ; M. Irfan Uddin
  • 期刊名称:Scientific Programming
  • 印刷版ISSN:1058-9244
  • 出版年度:2020
  • 卷号:2020
  • 页码:1-10
  • DOI:10.1155/2020/4621196
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    Nowadays, data are flooding into online web forums, and it is highly desirable to turn gigantic amount of data into actionable knowledge. Online web forums have become an integral part of the web and are main sources of knowledge. People use this platform to post their questions and get answers from other forum members. Usually, an initial post (question) gets more than one reply posts (answers) that make it difficult for a user to scan all of them for most relevant and quality answer. Thus, how to automatically extract the most relevant answer for a question within a thread is an important issue. In this research, we treat the task of answer extraction as classification problem. A reply post can be classified as relevant, partially relevant, or irrelevant to the initial post. To find the relevancy/similarity of a reply to the question, both lexical and nonlexical features are used. We proposed to use LinearSVC, a variant of support vector machine (SVM), for answer classification. Two selection techniques such as chi-square and univariate are employed to reduce the feature space size. The experimental results showed that LinearSVC classifier outperformed the other state-of-the-art classifiers in the context of classification accuracy for both Ubuntu and TripAdvisor (NYC) discussion forum datasets.

国家哲学社会科学文献中心版权所有