首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Improving classification of mature microRNA by solving class imbalance problem
  • 本地全文:下载
  • 作者:Ying Wang ; Xiaoye Li ; Bairui Tao
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2016
  • 卷号:6
  • 期号:1
  • DOI:10.1038/srep25941
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:MicroRNAs (miRNAs) are ~20-25 nucleotides non-coding RNAs, which regulated gene expression in the post-transcriptional level. The accurate rate of identifying the start sit of mature miRNA from a given pre-miRNA remains lower. It is noting that the mature miRNA prediction is a class-imbalanced problem which also leads to the unsatisfactory performance of these methods. We improved the prediction accuracy of classifier using balanced datasets and presented MatFind which is used for identifying 5' mature miRNAs candidates from their pre-miRNA based on ensemble SVM classifiers with idea of adaboost. Firstly, the balanced-dataset was extract based on K-nearest neighbor algorithm. Secondly, the multiple SVM classifiers were trained in orderly using the balance datasets base on represented features. At last, all SVM classifiers were combined together to form the ensemble classifier. Our results on independent testing dataset show that the proposed method is more efficient than one without treating class imbalance problem. Moreover, MatFind achieves much higher classification accuracy than other three approaches. The ensemble SVM classifiers and balanced-datasets can solve the class-imbalanced problem, as well as improve performance of classifier for mature miRNA identification. MatFind is an accurate and fast method for 5' mature miRNA identification.
国家哲学社会科学文献中心版权所有