4th International Conference on Energy Equipment Science and Engineering | |
A Feature Selection Method for Tissue-Specific Alternative Polyadenylation Sites Data in Rice | |
Zou, Wenbing^1 ; Chen, Moliang^1 ; Li, Shuchao^1 ; Ye, Pengchao^1 ; Ji, Guoli^1 | |
Department of Automation, Xiamen University, Fujian Province, Xiamen | |
361005, China^1 | |
关键词: Feature selection methods; Hybrid feature selections; Important features; Polyadenylation; Prediction accuracy; ReliefF; Tissue specifics; Transcriptomes; | |
Others : https://iopscience.iop.org/article/10.1088/1755-1315/242/5/052035/pdf DOI : 10.1088/1755-1315/242/5/052035 |
|
来源: IOP | |
【 摘 要 】
The identification of tissue-specific alternative polyadenylation (tsAPA) sites contributes to the research on gene expression regulation and transcriptome diversity in rice. However, identifying the tsAPA sites in plants is difficult, because of the dispersion, variability, complexity of their features and the lack of related research. A hybrid feature selection algorithm called SRBT, based on the SVM-RFE and Boruta, was presented to identify the tsAPA sites in rice. In the experiment, the tsAPA sites data were adopted to reduce dimension with SRBT algorithm and then classified by the support vector machine (SVM). The results show that the proposed method can effectively extract important features and obtain a higher average prediction accuracy of 81%, compared with SVM-RFE, Boruta, GAFS, T-test and ReliefF. The SRBT works well in identifying the tsAPA sites, which offers an effective method for further analysis of the tsAPA in gene expression and transcription during the growth of rice.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
A Feature Selection Method for Tissue-Specific Alternative Polyadenylation Sites Data in Rice | 545KB | download |