期刊论文详细信息
Bulletin of the Korean chemical society
A New Variable Selection Method Based on Mutual Information Maximization by Replacing Collinear Variables for Nonlinear Quantitative Structure-Property Relationship Models
Jahan B. Ghasemi1  Ehsan Zolfonoun1 
关键词: Mutual information;    Variable selection;    Quantitative structure-property relationship;   
DOI  :  
学科分类:化学(综合)
来源: Korean Chemical Society
PDF
【 摘 要 】

Selection of the most informative molecular descriptors from the original data set is a key step for development of quantitative structure activity/property relationship models. Recently, mutual information (MI) has gained increasing attention in feature selection problems. This paper presents an effective mutual information-based feature selection approach, named mutual information maximization by replacing collinear variables (MIMRCV), for nonlinear quantitative structure-property relationship models. The proposed variable selection method was applied to three different QSPR datasets, soil degradation half-life of 47 organophosphorus pesticides, GC-MS retention times of 85 volatile organic compounds, and water-to-micellar cetyltrimethylammonium bromide partition coefficients of 62 organic compounds.The obtained results revealed that using MIMRCV as feature selection method improves the predictive quality of the developed models compared to conventional MI based variable selection algorithms.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912010243780ZK.pdf 1100KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:9次