Journal of Cheminformatics | |
COMA: efficient structure-constrained molecular generation using contractive and margin losses | |
Research | |
Sanghyun Park1  Sangmin Seo2  Jonghwan Choi2  | |
[1] Department of Computer Science, Yonsei University, Yonsei-ro 50, 03722, Seoul, Republic of Korea;Department of Computer Science, Yonsei University, Yonsei-ro 50, 03722, Seoul, Republic of Korea;UBLBio Corporation, Yeongtong-ro 237, 16679, Suwon, Gyeonggi-do, Republic of Korea; | |
关键词: Drug design; Molecular optimization; Goal-directed molecular generation; Structure-constrained molecular generation; Deep generative model; Metric learning; Contrastive learning; Reinforcement learning; | |
DOI : 10.1186/s13321-023-00679-y | |
received in 2022-10-05, accepted in 2023-01-04, 发布年份 2023 | |
来源: Springer | |
【 摘 要 】
BackgroundStructure-constrained molecular generation is a promising approach to drug discovery. The goal of structure-constrained molecular generation is to produce a novel molecule that is similar to a given source molecule (e.g. hit molecules) but has enhanced chemical properties (for lead optimization). Many structure-constrained molecular generation models with superior performance in improving chemical properties have been proposed; however, they still have difficulty producing many novel molecules that satisfy both the high structural similarities to each source molecule and improved molecular properties.MethodsWe propose a structure-constrained molecular generation model that utilizes contractive and margin loss terms to simultaneously achieve property improvement and high structural similarity. The proposed model has two training phases; a generator first learns molecular representation vectors using metric learning with contractive and margin losses and then explores optimized molecular structure for target property improvement via reinforcement learning.ResultsWe demonstrate the superiority of our proposed method by comparing it with various state-of-the-art baselines and through ablation studies. Furthermore, we demonstrate the use of our method in drug discovery using an example of sorafenib-like molecular generation in patients with drug resistance.
【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202305112605938ZK.pdf | 3588KB | download | |
Fig. 3 | 718KB | Image | download |
Fig. 4 | 886KB | Image | download |
41116_2022_35_Article_IEq11.gif | 1KB | Image | download |
Fig. 1 | 233KB | Image | download |
Fig. 1 | 499KB | Image | download |
Fig. 3 | 1840KB | Image | download |
Fig. 1 | 471KB | Image | download |
12936_2022_4438_Article_IEq29.gif | 1KB | Image | download |
Fig. 1 | 1888KB | Image | download |
Fig. 1 | 162KB | Image | download |
41116_2022_35_Article_IEq28.gif | 1KB | Image | download |
41116_2022_35_Article_IEq29.gif | 1KB | Image | download |
41116_2022_35_Article_IEq33.gif | 1KB | Image | download |
Fig. 4 | 582KB | Image | download |
41116_2022_35_Article_IEq35.gif | 1KB | Image | download |
Fig. 3 | 2334KB | Image | download |
40249_2022_1049_Article_IEq26.gif | 1KB | Image | download |
41116_2022_35_Article_IEq37.gif | 1KB | Image | download |
41116_2022_35_Article_IEq38.gif | 1KB | Image | download |
41116_2022_35_Article_IEq40.gif | 1KB | Image | download |
Fig. 6 | 900KB | Image | download |
41116_2022_35_Article_IEq47.gif | 1KB | Image | download |
Fig. 5 | 2122KB | Image | download |
Fig. 4 | 172KB | Image | download |
MediaObjects/41408_2023_784_MOESM1_ESM.pdf | 1800KB | download | |
40249_2022_1049_Article_IEq33.gif | 1KB | Image | download |
Fig. 8 | 176KB | Image | download |
Fig. 9 | 216KB | Image | download |
40249_2022_1049_Article_IEq39.gif | 1KB | Image | download |
Fig. 12 | 1003KB | Image | download |
Fig. 1 | 1223KB | Image | download |
41116_2022_35_Article_IEq74.gif | 1KB | Image | download |
41116_2022_35_Article_IEq75.gif | 1KB | Image | download |
41116_2022_35_Article_IEq76.gif | 1KB | Image | download |
41116_2022_35_Article_IEq77.gif | 1KB | Image | download |
MediaObjects/12888_2022_4499_MOESM1_ESM.docx | 18KB | Other | download |
Fig. 6 | 235KB | Image | download |
41116_2022_35_Article_IEq78.gif | 1KB | Image | download |
41116_2022_35_Article_IEq80.gif | 1KB | Image | download |
41116_2022_35_Article_IEq81.gif | 1KB | Image | download |
Fig. 4 | 293KB | Image | download |
Fig. 2 | 749KB | Image | download |
Fig. 1 | 125KB | Image | download |
41116_2022_35_Article_IEq83.gif | 1KB | Image | download |
41116_2022_35_Article_IEq84.gif | 1KB | Image | download |
Fig. 2 | 143KB | Image | download |
Fig. 3 | 88KB | Image | download |
40854_2022_413_Article_IEq262.gif | 1KB | Image | download |
Fig. 4 | 167KB | Image | download |
41116_2022_35_Article_IEq87.gif | 1KB | Image | download |
Fig. 1 | 96KB | Image | download |
MediaObjects/13046_2022_2514_MOESM1_ESM.avi | 15999KB | Other | download |
Fig. 4 | 2516KB | Image | download |
41116_2022_35_Article_IEq90.gif | 1KB | Image | download |
41116_2022_35_Article_IEq92.gif | 1KB | Image | download |
41116_2022_35_Article_IEq93.gif | 1KB | Image | download |
Fig. 6 | 3486KB | Image | download |
MediaObjects/12888_2023_4527_MOESM1_ESM.doc | 78KB | Other | download |
41116_2022_35_Article_IEq94.gif | 1KB | Image | download |
41116_2022_35_Article_IEq95.gif | 1KB | Image | download |
41116_2022_35_Article_IEq97.gif | 1KB | Image | download |
Fig. 6 | 284KB | Image | download |
41235_2023_464_Article_IEq27.gif | 1KB | Image | download |
MediaObjects/12888_2022_4348_MOESM1_ESM.xlsx | 15KB | Other | download |
41116_2022_35_Article_IEq100.gif | 1KB | Image | download |
41116_2022_35_Article_IEq101.gif | 1KB | Image | download |
Fig. 3 | 615KB | Image | download |
MediaObjects/12888_2023_4543_MOESM1_ESM.docx | 72KB | Other | download |
Fig. 6 | 3333KB | Image | download |
41116_2022_35_Article_IEq102.gif | 1KB | Image | download |
Fig. 1 | 128KB | Image | download |
Fig. 3 | 110KB | Image | download |
41116_2022_35_Article_IEq104.gif | 1KB | Image | download |
41116_2022_35_Article_IEq105.gif | 1KB | Image | download |
41116_2022_35_Article_IEq106.gif | 1KB | Image | download |
MediaObjects/12888_2022_4508_MOESM1_ESM.pdf | 173KB | download | |
41116_2022_35_Article_IEq108.gif | 1KB | Image | download |
41116_2022_35_Article_IEq109.gif | 1KB | Image | download |
41116_2022_35_Article_IEq110.gif | 1KB | Image | download |
Fig. 13 | 114KB | Image | download |
Fig. 4 | 1132KB | Image | download |
41116_2022_35_Article_IEq111.gif | 1KB | Image | download |
41116_2022_35_Article_IEq112.gif | 1KB | Image | download |
41116_2022_35_Article_IEq113.gif | 1KB | Image | download |
41116_2022_35_Article_IEq114.gif | 1KB | Image | download |
Fig. 1 | 1219KB | Image | download |
Fig. 16 | 262KB | Image | download |
41116_2022_35_Article_IEq115.gif | 1KB | Image | download |
41116_2022_35_Article_IEq116.gif | 1KB | Image | download |
41116_2022_35_Article_IEq117.gif | 1KB | Image | download |
41116_2022_35_Article_IEq118.gif | 1KB | Image | download |
MediaObjects/12888_2023_4540_MOESM1_ESM.docx | 18KB | Other | download |
42004_2022_796_Article_IEq4.gif | 1KB | Image | download |
Fig. 2 | 603KB | Image | download |
41116_2022_35_Article_IEq122.gif | 1KB | Image | download |
41116_2022_35_Article_IEq123.gif | 1KB | Image | download |
【 图 表 】
41116_2022_35_Article_IEq123.gif
41116_2022_35_Article_IEq122.gif
Fig. 2
42004_2022_796_Article_IEq4.gif
41116_2022_35_Article_IEq118.gif
41116_2022_35_Article_IEq117.gif
41116_2022_35_Article_IEq116.gif
41116_2022_35_Article_IEq115.gif
Fig. 16
Fig. 1
41116_2022_35_Article_IEq114.gif
41116_2022_35_Article_IEq113.gif
41116_2022_35_Article_IEq112.gif
41116_2022_35_Article_IEq111.gif
Fig. 4
Fig. 13
41116_2022_35_Article_IEq110.gif
41116_2022_35_Article_IEq109.gif
41116_2022_35_Article_IEq108.gif
41116_2022_35_Article_IEq106.gif
41116_2022_35_Article_IEq105.gif
41116_2022_35_Article_IEq104.gif
Fig. 3
Fig. 1
41116_2022_35_Article_IEq102.gif
Fig. 6
Fig. 3
41116_2022_35_Article_IEq101.gif
41116_2022_35_Article_IEq100.gif
41235_2023_464_Article_IEq27.gif
Fig. 6
41116_2022_35_Article_IEq97.gif
41116_2022_35_Article_IEq95.gif
41116_2022_35_Article_IEq94.gif
Fig. 6
41116_2022_35_Article_IEq93.gif
41116_2022_35_Article_IEq92.gif
41116_2022_35_Article_IEq90.gif
Fig. 4
Fig. 1
41116_2022_35_Article_IEq87.gif
Fig. 4
40854_2022_413_Article_IEq262.gif
Fig. 3
Fig. 2
41116_2022_35_Article_IEq84.gif
41116_2022_35_Article_IEq83.gif
Fig. 1
Fig. 2
Fig. 4
41116_2022_35_Article_IEq81.gif
41116_2022_35_Article_IEq80.gif
41116_2022_35_Article_IEq78.gif
Fig. 6
41116_2022_35_Article_IEq77.gif
41116_2022_35_Article_IEq76.gif
41116_2022_35_Article_IEq75.gif
41116_2022_35_Article_IEq74.gif
Fig. 1
Fig. 12
40249_2022_1049_Article_IEq39.gif
Fig. 9
Fig. 8
40249_2022_1049_Article_IEq33.gif
Fig. 4
Fig. 5
41116_2022_35_Article_IEq47.gif
Fig. 6
41116_2022_35_Article_IEq40.gif
41116_2022_35_Article_IEq38.gif
41116_2022_35_Article_IEq37.gif
40249_2022_1049_Article_IEq26.gif
Fig. 3
41116_2022_35_Article_IEq35.gif
Fig. 4
41116_2022_35_Article_IEq33.gif
41116_2022_35_Article_IEq29.gif
41116_2022_35_Article_IEq28.gif
Fig. 1
Fig. 1
12936_2022_4438_Article_IEq29.gif
Fig. 1
Fig. 3
Fig. 1
Fig. 1
41116_2022_35_Article_IEq11.gif
Fig. 4
Fig. 3
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]