期刊论文详细信息
Journal of Cheminformatics
Human-in-the-loop assisted de novo molecular design
Research
Haoping Xiao1  Markus Heinonen1  Iiris Sundin1  Samuel Kaski2  Alexey Voronov3  Ola Engkvist4  Esben Jannik Bjerrum5  Atanas Patronov5  Kostas Papadopoulos5 
[1] Department of Computer Science, Aalto University, Espoo, Finland;Department of Computer Science, Aalto University, Espoo, Finland;Department of Computer Science, University of Manchester, Manchester, UK;Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden;Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden;Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden;Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden;Odyssey Therapeutics, Cambridge, MA, USA;
关键词: Interactive algorithms;    De novo molecular design;    Human-in-the-loop;    AI-assisted design;    Goal-oriented molecule generation;    Expert knowledge elicitation;    Reward elicitation;   
DOI  :  10.1186/s13321-022-00667-8
 received in 2022-06-30, accepted in 2022-12-03,  发布年份 2022
来源: Springer
PDF
【 摘 要 】

A de novo molecular design workflow can be used together with technologies such as reinforcement learning to navigate the chemical space. A bottleneck in the workflow that remains to be solved is how to integrate human feedback in the exploration of the chemical space to optimize molecules. A human drug designer still needs to design the goal, expressed as a scoring function for the molecules that captures the designer’s implicit knowledge about the optimization task. Little support for this task exists and, consequently, a chemist usually resorts to iteratively building the objective function of multi-parameter optimization (MPO) in de novo design. We propose a principled approach to use human-in-the-loop machine learning to help the chemist to adapt the MPO scoring function to better match their goal. An advantage is that the method can learn the scoring function directly from the user’s feedback while they browse the output of the molecule generator, instead of the current manual tuning of the scoring function with trial and error. The proposed method uses a probabilistic model that captures the user’s idea and uncertainty about the scoring function, and it uses active learning to interact with the user. We present two case studies for this: In the first use-case, the parameters of an MPO are learned, and in the second use-case a non-parametric component of the scoring function to capture human domain knowledge is developed. The results show the effectiveness of the methods in two simulated example cases with an oracle, achieving significant improvement in less than 200 feedback queries, for the goals of a high QED score and identifying potent molecules for the DRD2 receptor, respectively. We further demonstrate the performance gains with a medicinal chemist interacting with the system.Graphical Abstract

【 授权许可】

CC BY   
© The Author(s) 2022

【 预 览 】
附件列表
Files Size Format View
RO202305065089041ZK.pdf 3374KB PDF download
Fig. 1 190KB Image download
Fig. 2 404KB Image download
40517_2022_243_Article_IEq3.gif 1KB Image download
Fig. 4 5524KB Image download
895KB Image download
Table 1 241KB Table download
MediaObjects/12888_2022_4476_MOESM1_ESM.pdf 116KB PDF download
MediaObjects/12888_2022_4476_MOESM2_ESM.pdf 144KB PDF download
Fig. 4 1551KB Image download
Fig. 3 62KB Image download
Fig. 3 294KB Image download
Fig. 9 2022KB Image download
MediaObjects/12902_2022_1259_MOESM1_ESM.docx 808KB Other download
MediaObjects/12888_2022_4371_MOESM1_ESM.docx 28KB Other download
Fig. 4 161KB Image download
40517_2022_243_Article_IEq10.gif 1KB Image download
Fig. 3 35KB Image download
MediaObjects/40249_2022_1028_MOESM1_ESM.docx 28KB Other download
Fig. 2 232KB Image download
Fig. 1 752KB Image download
Fig. 4 2623KB Image download
Fig. 2 2811KB Image download
12864_2022_9026_Article_IEq3.gif 1KB Image download
Fig. 1 587KB Image download
40517_2022_243_Article_IEq11.gif 1KB Image download
40517_2022_243_Article_IEq12.gif 1KB Image download
Fig. 2 851KB Image download
40517_2022_243_Article_IEq14.gif 1KB Image download
Fig. 1 681KB Image download
MediaObjects/13046_2022_2570_MOESM1_ESM.docx 25KB Other download
MediaObjects/13046_2022_2570_MOESM2_ESM.docx 1615KB Other download
Fig. 5 490KB Image download
MediaObjects/42004_2022_793_MOESM5_ESM.zip 46555KB Package download
Fig. 3 3872KB Image download
40708_2022_178_Article_IEq17.gif 1KB Image download
Fig. 1 93KB Image download
MediaObjects/12951_2022_1460_MOESM1_ESM.docx 3057KB Other download
Fig. 2 113KB Image download
40517_2022_243_Article_IEq16.gif 1KB Image download
40517_2022_243_Article_IEq17.gif 1KB Image download
Fig. 3 595KB Image download
Fig. 2 164KB Image download
MediaObjects/12888_2022_4370_MOESM1_ESM.docx 59KB Other download
40708_2022_178_Article_IEq25.gif 1KB Image download
Fig. 3 694KB Image download
Fig. 2 88KB Image download
Fig. 4 370KB Image download
Fig. 1 98KB Image download
MediaObjects/12888_2022_4464_MOESM1_ESM.doc 284KB Other download
MediaObjects/12951_2022_1737_MOESM2_ESM.zip 2KB Package download
40517_2022_243_Article_IEq19.gif 1KB Image download
40517_2022_243_Article_IEq20.gif 1KB Image download
MediaObjects/12888_2022_4464_MOESM2_ESM.pdf 1104KB PDF download
40708_2022_178_Article_IEq35.gif 1KB Image download
Fig. 2 856KB Image download
Fig. 6 702KB Image download
MediaObjects/12888_2022_4411_MOESM1_ESM.docx 144KB Other download
40708_2022_178_Article_IEq39.gif 1KB Image download
Fig. 6 2894KB Image download
Fig. 3 163KB Image download
40708_2022_178_Article_IEq42.gif 1KB Image download
Fig. 7 385KB Image download
Fig. 1 52KB Image download
Fig. 4 1474KB Image download
MediaObjects/12888_2022_4464_MOESM3_ESM.pdf 713KB PDF download
Fig. 1 154KB Image download
Fig. 1 464KB Image download
Fig. 4 592KB Image download
Fig. 6 368KB Image download
Fig. 1 261KB Image download
Fig. 2 462KB Image download
MediaObjects/42004_2022_778_MOESM2_ESM.pdf 45657KB PDF download
40708_2022_178_Article_IEq53.gif 1KB Image download
Fig. 5 2280KB Image download
Fig. 3 609KB Image download
Fig. 2 82KB Image download
Fig. 4 4008KB Image download
Fig. 5 2746KB Image download
Fig.5 716KB Image download
Fig. 1 656KB Image download
Fig. 1 345KB Image download
Fig. 2 94KB Image download
MediaObjects/12993_2022_198_MOESM2_ESM.pdf 1428KB PDF download
40708_2022_178_Article_IEq64.gif 1KB Image download
MediaObjects/40249_2022_1050_MOESM1_ESM.docx 393KB Other download
Fig. 1 139KB Image download
Fig. 1 3883KB Image download
Fig. 1 59KB Image download
Fig. 1 403KB Image download
MediaObjects/13068_2022_2241_MOESM3_ESM.gb 35KB Other download
MediaObjects/12974_2022_2652_MOESM3_ESM.pdf 4626KB PDF download
MediaObjects/40249_2022_1050_MOESM3_ESM.docx 165KB Other download
Fig. 5 138KB Image download
Fig. 2 778KB Image download
MediaObjects/40249_2022_1050_MOESM4_ESM.docx 21KB Other download
Fig. 2 547KB Image download
Fig. 5 326KB Image download
Fig. 6 2361KB Image download
Fig. 1 1050KB Image download
Fig. 3 156KB Image download
13731_2022_257_Article_IEq2.gif 1KB Image download
Fig. 3 643KB Image download
13731_2022_257_Article_IEq4.gif 1KB Image download
Fig. 1 192KB Image download
Fig. 2 462KB Image download
13731_2022_257_Article_IEq7.gif 1KB Image download
MediaObjects/12888_2022_4395_MOESM1_ESM.xlsx 128KB Other download
13731_2022_257_Article_IEq9.gif 1KB Image download
Fig. 1 896KB Image download
MediaObjects/12888_2022_4395_MOESM2_ESM.doc 12KB Other download
13731_2022_257_Article_IEq12.gif 1KB Image download
Fig. 3 1712KB Image download
Fig. 2 330KB Image download
MediaObjects/12902_2022_1254_MOESM1_ESM.xlsx 11KB Other download
Fig. 2 57KB Image download
Fig. 3 448KB Image download
Fig. 3 330KB Image download
Fig. 3 107KB Image download
Fig. 2 91KB Image download
MediaObjects/40249_2022_1047_MOESM1_ESM.docx 26KB Other download
MediaObjects/12888_2022_4468_MOESM1_ESM.tif 2493KB Other download
Fig. 6 1342KB Image download
Fig. 3 90KB Image download
MediaObjects/12974_2022_2652_MOESM4_ESM.pdf 29720KB PDF download
Fig. 2 400KB Image download
Fig. 1 331KB Image download
Fig. 8 2342KB Image download
Fig. 6 1394KB Image download
Fig. 2 144KB Image download
MediaObjects/40249_2022_1047_MOESM2_ESM.docx 14KB Other download
Fig. 1 120KB Image download
Fig. 1 111KB Image download
Fig. 2 161KB Image download
Fig. 2 626KB Image download
MediaObjects/12888_2022_4475_MOESM1_ESM.docx 15KB Other download
Fig. 2 469KB Image download
Fig. 8 1162KB Image download
Fig. 3 364KB Image download
Fig. 3 323KB Image download
Fig. 3 558KB Image download
12864_2022_9026_Article_IEq26.gif 1KB Image download
Fig. 4 1002KB Image download
Fig. 1 537KB Image download
Fig. 1 743KB Image download
Fig. 4 987KB Image download
40249_2022_1045_Article_IEq1.gif 1KB Image download
MediaObjects/41408_2022_759_MOESM1_ESM.docx 20KB Other download
Fig. 1 62KB Image download
MediaObjects/41408_2022_759_MOESM2_ESM.pdf 4455KB PDF download
MediaObjects/12888_2022_4457_MOESM1_ESM.docx 57KB Other download
40249_2022_1045_Article_IEq6.gif 1KB Image download
40249_2022_1045_Article_IEq7.gif 1KB Image download
40249_2022_1045_Article_IEq8.gif 1KB Image download
40249_2022_1045_Article_IEq9.gif 1KB Image download
40249_2022_1045_Article_IEq10.gif 1KB Image download
40249_2022_1045_Article_IEq11.gif 1KB Image download
40249_2022_1045_Article_IEq12.gif 1KB Image download
40249_2022_1045_Article_IEq13.gif 1KB Image download
40249_2022_1045_Article_IEq14.gif 1KB Image download
40249_2022_1045_Article_IEq15.gif 1KB Image download
Fig. 1 1143KB Image download
40249_2022_1045_Article_IEq16.gif 1KB Image download
40249_2022_1045_Article_IEq17.gif 1KB Image download
MediaObjects/12888_2022_4468_MOESM3_ESM.tif 2483KB Other download
MediaObjects/12888_2022_4439_MOESM1_ESM.docx 669KB Other download
40249_2022_1045_Article_IEq20.gif 1KB Image download
40249_2022_1045_Article_IEq21.gif 1KB Image download
40249_2022_1045_Article_IEq22.gif 1KB Image download
40249_2022_1045_Article_IEq23.gif 1KB Image download
40249_2022_1045_Article_IEq24.gif 1KB Image download
40249_2022_1045_Article_IEq25.gif 1KB Image download
Fig. 5 1362KB Image download
40249_2022_1045_Article_IEq27.gif 1KB Image download
MediaObjects/40249_2022_1045_MOESM1_ESM.docx 23KB Other download
MediaObjects/40249_2022_1045_MOESM2_ESM.docx 27KB Other download
Fig. 1 88KB Image download
Fig. 1 602KB Image download
Fig. 5 102KB Image download
MediaObjects/12937_2022_825_MOESM1_ESM.docx 159KB Other download
Fig. 2 603KB Image download
MediaObjects/41408_2022_759_MOESM3_ESM.pdf 1219KB PDF download
41408_2022_764_Article_IEq1.gif 1KB Image download
Fig. 1 106KB Image download
41408_2022_764_Article_IEq3.gif 1KB Image download
Fig. 2 106KB Image download
MediaObjects/12888_2022_4468_MOESM4_ESM.docx 39KB Other download
41408_2022_764_Article_IEq6.gif 1KB Image download
41408_2022_764_Article_IEq7.gif 1KB Image download
41408_2022_764_Article_IEq8.gif 1KB Image download
41408_2022_764_Article_IEq9.gif 1KB Image download
41408_2022_764_Article_IEq10.gif 1KB Image download
41408_2022_764_Article_IEq11.gif 1KB Image download
41408_2022_764_Article_IEq12.gif 1KB Image download
41408_2022_764_Article_IEq13.gif 1KB Image download
41408_2022_764_Article_IEq14.gif 1KB Image download
41408_2022_764_Article_IEq15.gif 1KB Image download
41408_2022_764_Article_IEq16.gif 1KB Image download
41408_2022_764_Article_IEq17.gif 1KB Image download
41408_2022_764_Article_IEq18.gif 1KB Image download
Fig. 1 93KB Image download
41408_2022_764_Article_IEq19.gif 1KB Image download
41408_2022_764_Article_IEq20.gif 1KB Image download
41408_2022_764_Article_IEq21.gif 1KB Image download
41408_2022_764_Article_IEq22.gif 1KB Image download
41408_2022_764_Article_IEq23.gif 1KB Image download
41408_2022_764_Article_IEq24.gif 1KB Image download
41408_2022_764_Article_IEq25.gif 1KB Image download
【 图 表 】

41408_2022_764_Article_IEq25.gif

41408_2022_764_Article_IEq24.gif

41408_2022_764_Article_IEq23.gif

41408_2022_764_Article_IEq22.gif

41408_2022_764_Article_IEq21.gif

41408_2022_764_Article_IEq20.gif

41408_2022_764_Article_IEq19.gif

Fig. 1

41408_2022_764_Article_IEq18.gif

41408_2022_764_Article_IEq17.gif

41408_2022_764_Article_IEq16.gif

41408_2022_764_Article_IEq15.gif

41408_2022_764_Article_IEq14.gif

41408_2022_764_Article_IEq13.gif

41408_2022_764_Article_IEq12.gif

41408_2022_764_Article_IEq11.gif

41408_2022_764_Article_IEq10.gif

41408_2022_764_Article_IEq9.gif

41408_2022_764_Article_IEq8.gif

41408_2022_764_Article_IEq7.gif

41408_2022_764_Article_IEq6.gif

Fig. 2

41408_2022_764_Article_IEq3.gif

Fig. 1

41408_2022_764_Article_IEq1.gif

Fig. 2

Fig. 5

Fig. 1

Fig. 1

40249_2022_1045_Article_IEq27.gif

Fig. 5

40249_2022_1045_Article_IEq25.gif

40249_2022_1045_Article_IEq24.gif

40249_2022_1045_Article_IEq23.gif

40249_2022_1045_Article_IEq22.gif

40249_2022_1045_Article_IEq21.gif

40249_2022_1045_Article_IEq20.gif

40249_2022_1045_Article_IEq17.gif

40249_2022_1045_Article_IEq16.gif

Fig. 1

40249_2022_1045_Article_IEq15.gif

40249_2022_1045_Article_IEq14.gif

40249_2022_1045_Article_IEq13.gif

40249_2022_1045_Article_IEq12.gif

40249_2022_1045_Article_IEq11.gif

40249_2022_1045_Article_IEq10.gif

40249_2022_1045_Article_IEq9.gif

40249_2022_1045_Article_IEq8.gif

40249_2022_1045_Article_IEq7.gif

40249_2022_1045_Article_IEq6.gif

Fig. 1

40249_2022_1045_Article_IEq1.gif

Fig. 4

Fig. 1

Fig. 1

Fig. 4

12864_2022_9026_Article_IEq26.gif

Fig. 3

Fig. 3

Fig. 3

Fig. 8

Fig. 2

Fig. 2

Fig. 2

Fig. 1

Fig. 1

Fig. 2

Fig. 6

Fig. 8

Fig. 1

Fig. 2

Fig. 3

Fig. 6

Fig. 2

Fig. 3

Fig. 3

Fig. 3

Fig. 2

Fig. 2

Fig. 3

13731_2022_257_Article_IEq12.gif

Fig. 1

13731_2022_257_Article_IEq9.gif

13731_2022_257_Article_IEq7.gif

Fig. 2

Fig. 1

13731_2022_257_Article_IEq4.gif

Fig. 3

13731_2022_257_Article_IEq2.gif

Fig. 3

Fig. 1

Fig. 6

Fig. 5

Fig. 2

Fig. 2

Fig. 5

Fig. 1

Fig. 1

Fig. 1

Fig. 1

40708_2022_178_Article_IEq64.gif

Fig. 2

Fig. 1

Fig. 1

Fig.5

Fig. 5

Fig. 4

Fig. 2

Fig. 3

Fig. 5

40708_2022_178_Article_IEq53.gif

Fig. 2

Fig. 1

Fig. 6

Fig. 4

Fig. 1

Fig. 1

Fig. 4

Fig. 1

Fig. 7

40708_2022_178_Article_IEq42.gif

Fig. 3

Fig. 6

40708_2022_178_Article_IEq39.gif

Fig. 6

Fig. 2

40708_2022_178_Article_IEq35.gif

40517_2022_243_Article_IEq20.gif

40517_2022_243_Article_IEq19.gif

Fig. 1

Fig. 4

Fig. 2

Fig. 3

40708_2022_178_Article_IEq25.gif

Fig. 2

Fig. 3

40517_2022_243_Article_IEq17.gif

40517_2022_243_Article_IEq16.gif

Fig. 2

Fig. 1

40708_2022_178_Article_IEq17.gif

Fig. 3

Fig. 5

Fig. 1

40517_2022_243_Article_IEq14.gif

Fig. 2

40517_2022_243_Article_IEq12.gif

40517_2022_243_Article_IEq11.gif

Fig. 1

12864_2022_9026_Article_IEq3.gif

Fig. 2

Fig. 4

Fig. 1

Fig. 2

Fig. 3

40517_2022_243_Article_IEq10.gif

Fig. 4

Fig. 9

Fig. 3

Fig. 3

Fig. 4

Fig. 4

40517_2022_243_Article_IEq3.gif

Fig. 2

Fig. 1

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  文献评价指标  
  下载次数:3次 浏览次数:1次