Data Science Journal | |
Limits with modeling data and modeling data with limits | |
Lionello Pogliani1  | |
[1] Dipartimento di Chimica, Università della Calabria | |
关键词: Modeling; Solubility; Amino acids; Bases; Incomplete Data; Molecular Connectivity Indices; Configuration Interaction of Graph-Type Basis Indices; | |
DOI : 10.2481/dsj.1.203 | |
学科分类:计算机科学(综合) | |
来源: Ubiquity Press Ltd. | |
【 摘 要 】
References(34)Modeling of the solubility of amino acids and purine and pyrimidine bases with a set of sixteen molecular descriptors has been thoroughly analyzed to detect and understand the reasons for anomalies in the description of this property for these two classes of compounds. Unsatisfactory modeling can be ascribed to incomplete collateral data, i.e, to the fact that there is insufficient data known about the behavior of these compounds in solution. This is usually because intermolecular forces cannot be modeled. The anomalous modeling can be detected from the rather large values of the standard deviation of the estimates of the whole set of compounds, and from the unsatisfactory modeling of some of the subsets of these compounds. Thus the detected abnormalities can be used (i) to get an idea about weak intermolecular interactions such as hydration, self-association, the hydrogen-bond phenomena in solution, and (ii) to reshape the molecular descriptors with the introduction of parameters that allow better modeling. This last procedure should be used with care, bearing in mind that the solubility phenomena is rather complex.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201911300021201ZK.pdf | 359KB | download |