how to generate the new/novel molecules by using TL_Mol2Mol.model
after running transfer learning?
#20
-
Hi, sorry to bring a naive question. after running the transfer learning by command line how can I generate the new/novel molecules by using How should I set the parameters of the config file (e.g. toml) if I want to get new compounds with more diversity of scaffold variations? could you please provide a demo or suggestions ? many many thanks Sh-Y |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hi again, to generate molecules from a model you would need to sample from it. There is a special "sampling" runmode for this with example config files in the repository, However, that being said, the problem with TL is that it is hard to control: if you run for too short the model will stay close to the original prior, if you run for too long the generated molecules may be very close to the input data set. Neither is desirable in practice. TL is therefore best be used to "focus" a prior model closer to the desirect structures you expect to receive. But then you would use RL with a clearly defined "scoring profile" to optimizxe the model for this profile. In practice, 3D methods like docking or shape similarity give better results than just 2D or simple physchem properties. Hope this helps, |
Beta Was this translation helpful? Give feedback.
The config file looks fine to me. It may not be best to use the final model from TL as it could be too much biased towards your input molecules. As said it is hard to judge when TL should be stopped (TL writes "intermediate" model files at defined steps so you can pick up from there). The question at the end is what you end goal is.