期刊名称:IOP Conference Series: Earth and Environmental Science
印刷版ISSN:1755-1307
电子版ISSN:1755-1315
出版年度:2019
卷号:242
期号:5
页码:1-7
DOI:10.1088/1755-1315/242/5/052035
出版社:IOP Publishing
摘要:The identification of tissue-specific alternative polyadenylation (tsAPA) sites contributes to the research on gene expression regulation and transcriptome diversity in rice. However, identifying the tsAPA sites in plants is difficult, because of the dispersion, variability, complexity of their features and the lack of related research. A hybrid feature selection algorithm called SRBT, based on the SVM-RFE and Boruta, was presented to identify the tsAPA sites in rice. In the experiment, the tsAPA sites data were adopted to reduce dimension with SRBT algorithm and then classified by the support vector machine (SVM). The results show that the proposed method can effectively extract important features and obtain a higher average prediction accuracy of 81%, compared with SVM-RFE, Boruta, GAFS, T-test and ReliefF. The SRBT works well in identifying the tsAPA sites, which offers an effective method for further analysis of the tsAPA in gene expression and transcription during the growth of rice.