首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:TGT: A Novel Adversarial Guided Oversampling Technique for Handling Imbalanced Datasets
  • 本地全文:下载
  • 作者:Ayat Mahmoud ; Ayman El-Kilany ; Farid Ali
  • 期刊名称:Egyptian Informatics Journal
  • 印刷版ISSN:1110-8665
  • 出版年度:2021
  • 卷号:22
  • 期号:4
  • 页码:433-438
  • DOI:10.1016/j.eij.2021.01.002
  • 语种:English
  • 出版社:Elsevier
  • 摘要:AbstractWith the volume of data increasing exponentially, there is a growing interest in helping people to benefit from their data regardless of its poor quality. One of the major data quality problems is the imbalanced distribution of different categories existing in the data. Such problem would affect the performance of any possible of analysis and mining on the data. For instance, data with an imbalanced distribution hasa negative effect on the performance achieved by most traditional classification techniques. This paper proposes TGT (Train Generate Test), a novel oversampling technique for handling imbalanced datasets problem. Using different learning strategies, TGT guarantees that the generated synthetic samples reside in minority regions. TGT showed a high improvement in performance of different classification techniques when was experimented with five imbalanced datasets of different types.
  • 关键词:Imbalance;Oversampling;Classification
国家哲学社会科学文献中心版权所有