期刊论文

【摘要】

BackgroundSupervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low.ResultsIn this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%.ConclusionsThe availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets.

【授权许可】

CC BY
© Cerulo et al.; licensee BioMed Central Ltd. 2013

【预览】

附件列表
Files	Size	Format	View
RO202311091573605ZK.pdf	1700KB	PDF	download

【参考文献】

[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
[43]

BMC Bioinformatics
A negative selection heuristic to predict new transcriptional targets
Research
Vincenzo Paduano¹ Michele Ceccarelli² Luigi Cerulo² Pietro Zoppoli³
[1] BioGeM s.c.a r.l., Institute of Genetic Research "Gaetano Salvatore", Ariano Irpino, (AV), Italy;Department of Science, University of Sannio, Benevento, Italy;BioGeM s.c.a r.l., Institute of Genetic Research "Gaetano Salvatore", Ariano Irpino, (AV), Italy;Institute for Cancer Genetics, Columbia University, New York, NY, USA;
关键词: Support Vector Machine; Negative Selection; Gene Regulatory Network; Support Vector Machine Classifier; Unlabeled Data;
DOI : 10.1186/1471-2105-14-S1-S3
来源: Springer
PDF


	文献评价指标
	下载次数：1次	浏览次数：0次

【 摘 要 】

【 授权许可】

【 预 览 】

【 参考文献 】

【摘要】

【授权许可】

【预览】

【参考文献】