.
Angewandte
Communications
DOI: 10.1002/anie.201310864
Combinatorial Chemistry
Multi-Objective Molecular De Novo Design by Adaptive Fragment
Prioritization**
Michael Reutlinger, Tiago Rodrigues, Petra Schneider, and Gisbert Schneider*
Abstract: We present the development and application of
a computational molecular de novo design method for
obtaining bioactive compounds with desired on- and off-
target binding. The approach translates the nature-inspired
concept of ant colony optimization to combinatorial building
block selection. By relying on publicly available structure–
activity data, we developed a predictive quantitative polyphar-
macology model for 640 human drug targets. By taking
reductive amination as an example of a privileged reaction, we
obtained novel subtype-selective and multitarget-modulating
dopamine D4 antagonists, as well as ligands selective for the
sigma-1 receptor with accurately predicted affinities. The
nanomolar potencies of the hits obtained, their high ligand
efficiencies, and an overall success rate of 90% demonstrate
that this ligand-based computer-aided molecular design
method may guide target-focused combinatorial chemistry.
a nature-inspired optimization principle to chemistry-driven
molecular design.
For a proof-of-concept we focused on the reductive
amination reaction as a scheme for combinatorial synthesis.
By automated structure optimization, MAntA generated
small compound libraries with lead-like qualities, high hit
rates, and nanomolar activities. It implements a new design
strategy that is applicable to all kinds of chemistry-driven
computational methods,[8] and neither requires prior knowl-
edge about the bioactivity of scaffold classes nor is limited to
privileged scaffolds. In a retrospective study, ant colony
optimization turned out to perform better or on par with
other optimization methods.[7b] Here, we pioneer the concept
of polypharmacology-based molecular de novo design using
combinatorial chemistry. We demonstrate that both target-
selective, and multitarget-modulating members of large
combinatorial compound libraries are rapidly identified with-
out the need for full library enumeration and synthesis.
The molecular design method requires 1) a scheme for
compound synthesis, 2) a method for predicting the affinity of
the virtual products, and 3) a technique for optimizing the
building blocks. For our concept study, we chose the reductive
amination reaction and aldehydes/ketones and amines as
building blocks. We applied MAntA to the products of single-
step reductive amination starting from commercially avail-
able building blocks. The reaction products have a high
likelihood of possessing desirable druglike features, as
visualized in Figure 1, which presents a map of the known
bioactivity space. Virtual reaction products (green dots)
cluster in a densely populated area, and the reductive
amination may be regarded as a preferred reaction for drug
discovery.
For affinity prediction we trained individual Gaussian
process (GP) regression models[9] for 640 human targets
annotated in ChEMBL (v14),[10] based on 279866 compounds
with 569725 measured bioactivities. Molecules were repre-
sented by topological pharmacophore (“CATS2”)[11] and
substructure (circular Morgan fingerprints)[12] descriptors.
The choice of GP regression was motivated by extensive
comparison to other modeling techniques using the same
training data, where the GP approach performed best
(Tables S2 and S3 in the Supporting Information). In addition,
GP models compute a data-density-dependent confidence
estimate, which we combined with the quantitative bioactivity
prediction (pAffinity) to obtain a single robust prediction
score for each compound.
T
raditional combinatorial chemistry aims at the generation
of large diverse compound arrays for bioactivity screening.[1]
It has been realized that multiple “adaptive” synthesis-and-
test cycles using smaller, focused compound libraries might be
better suited, faster, and more economical to find lead-like
bioactive compounds.[2,3] Computational molecular design
methods offer the additional advantage of generating bioac-
tive compounds while considering multiple objectives in
parallel,[4] and combinatorial libraries with desired properties
can be obtained by relying on chemistry-oriented computa-
tional molecular design.[5,6] Though potentially appealing,
these methods have rarely been prospectively applied. Here,
we present the comprehensive application of a computational
concept for designing combinatorial libraries that exhibit an
accurately predicted bioactivity profile. We show that the
molecular ant algorithm (MAntA)[7] effectively transfers
[*] M. Reutlinger, Dr. T. Rodrigues, Dr. P. Schneider,
Prof. Dr. G. Schneider
Eidgençssische Technische Hochschule (ETH)
Departement Chemie und Angewandte Biowissenschaften
Vladimir-Prelog-Weg 4, 8093 Zꢀrich (Switzerland)
E-mail: gisbert.schneider@pharma.ethz.ch
Dr. P. Schneider, Prof. Dr. G. Schneider
inSili.com GmbH
Segantinisteig 3, 8049 Zꢀrich (Switzerland)
[**] We thank Dr. Michael Bieler for computing QED values. This
research was financially supported by a grant from the OPO
Foundation, Zꢀrich.
Equipped with this quantitative affinity prediction model,
MAntA performs an adaptive search for optimal combina-
tions of building blocks for the given reaction scheme
(Figure 2). The search space consists of all possible substrates
Supporting information for this article (details on the computa-
tional methods, synthesis, analytics, and binding assays) is
4244
ꢀ 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
Angew. Chem. Int. Ed. 2014, 53, 4244 –4248