首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Performance Analysis of Resampling Techniques on Class Imbalance Issue in Software Defect Prediction
  • 本地全文:下载
  • 作者:Ahmed Iqbal ; Shabib Aftab ; Faseeha Matloob
  • 期刊名称:International Journal of Information Technology and Computer Science
  • 印刷版ISSN:2074-9007
  • 电子版ISSN:2074-9015
  • 出版年度:2019
  • 卷号:11
  • 期号:11
  • 页码:44-53
  • DOI:10.5815/ijitcs.2019.11.05
  • 出版社:MECS Publisher
  • 摘要:Predicting the defects at early stage of software development life cycle can improve the quality of end product at lower cost. Machine learning techniques have been proved to be an effective way for software defect prediction however an imbalance dataset of software defects is the main issue of lower and biased performance of classifiers. This issue can be resolved by applying the re-sampling methods on software defect dataset before the classification process. This research analyzes the performance of three widely used resampling techniques on class imbalance issue for software defect prediction. The resampling techniques include: “Random Under Sampling”, “Random Over Sampling” and “Synthetic Minority Oversampling Technique (SMOTE)”. For experiments, 12 publically available cleaned NASA MDP datasets are used with 10 widely used supervised machine learning classifiers. The performance is evaluated through various measures including: F-measure, Accuracy, MCC and ROC. According to results, most of the classifiers performed better with “Random Over Sampling” technique in many datasets.
  • 关键词:Software Defect predication;Imbalanced Dataset;Resampling Methods;Random Under Sampling;Random Oversampling;Synthetic Minority Oversampling Technique
国家哲学社会科学文献中心版权所有