Computational modeling and design of small molecules
The ability to embed molecules in a numeric space is essential in order to build models describing molecular properties or other processes affected by molecules. MACAW is a cheminformatic tool that allows embedding small molecules into a multidimensional numeric space. In the embedding, each molecule is assigned a numeric vector that captures information of the molecule in relation to other molecules, and that vector can be used as input to mathematical models. Molecules that are more similar to each other are embedded closer in this numeric space, whereas molecules that are more different are embedded further away. Some advantages of MACAW compared to established alternatives are that it is fast and does not require extensive computational resources or expertise. In particular, MACAW embeddings can be used as input to machine-learning models without the need for variable cleaning or feature selection, saving time and simplifying their use.
MACAW also contains methods to generate new molecules and to recommend new molecules that meet a desired molecular property. The generation of new molecules can be biased based on an input set of molecules, generating molecular diversity around it. On the other hand, MACAW's molecular recommendation tool is a novel method for evolving molecules in silico towards a desired molecular specification. In this method, the biased molecular generator tool is applied iteratively in combination with a molecular selection step. As a result, in each iteration the molecules selected by the software are increasingly closer to the desired specification. Both the molecular generation and the molecular recommendation tools are very fast, efficient, and intuitive to use.
Learn more about MACAW in the publication and the repository.
Email for more information:
copyright@lbl.gov