TY - JOUR
T1 - Training recurrent neural networks as generative neural networks for molecular structures
T2 - how does it impact drug discovery?
AU - D’Souza, Sofia
AU - Kv, Prema
AU - Balaji, Seetharaman
N1 - Funding Information:
This paper was not funded.
Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2022
Y1 - 2022
N2 - Introduction: Deep learning approaches have become popular in recent years in de novo drug design. Generative models for molecule generation and optimization have shown promising results. Molecules trained on different chemical data could regenerate molecules that were similar to the query molecule, thus supporting lead optimization. Recurrent neural network-based generative models have demonstrated application in low-data drug discovery, fragment-based drug design and in lead optimization. Areas covered: In this review, we have provided an overview of recurrent neural network models and their variants for molecule generation with recent examples. The input representation of molecules as SMILES and molecular graphs have been discussed. The evaluation benchmarks and metrics used in generative neural network models are also highlighted. For this, ScienceDirect, Web of Science, and Google Scholar databases were searched with the article’s keywords and their combinations to retrieve the most relevant and up-to-date information. Expert opinion: The simplicity of SMILES notation makes it suitable for training a sequence-based model such as a recurrent neural network. However, models that could be trained on molecular graphs to generate molecular structures which could be synthesized could open new possibility for valid molecule generation and synthetic feasibility.
AB - Introduction: Deep learning approaches have become popular in recent years in de novo drug design. Generative models for molecule generation and optimization have shown promising results. Molecules trained on different chemical data could regenerate molecules that were similar to the query molecule, thus supporting lead optimization. Recurrent neural network-based generative models have demonstrated application in low-data drug discovery, fragment-based drug design and in lead optimization. Areas covered: In this review, we have provided an overview of recurrent neural network models and their variants for molecule generation with recent examples. The input representation of molecules as SMILES and molecular graphs have been discussed. The evaluation benchmarks and metrics used in generative neural network models are also highlighted. For this, ScienceDirect, Web of Science, and Google Scholar databases were searched with the article’s keywords and their combinations to retrieve the most relevant and up-to-date information. Expert opinion: The simplicity of SMILES notation makes it suitable for training a sequence-based model such as a recurrent neural network. However, models that could be trained on molecular graphs to generate molecular structures which could be synthesized could open new possibility for valid molecule generation and synthetic feasibility.
UR - http://www.scopus.com/inward/record.url?scp=85140114765&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140114765&partnerID=8YFLogxK
U2 - 10.1080/17460441.2023.2134340
DO - 10.1080/17460441.2023.2134340
M3 - Review article
C2 - 36216812
AN - SCOPUS:85140114765
SN - 1746-0441
VL - 17
SP - 1071
EP - 1079
JO - Expert Opinion on Drug Discovery
JF - Expert Opinion on Drug Discovery
IS - 10
ER -