.
Angewandte
Communications
Target Identification
Revealing the Macromolecular Targets of Fragment-Like Natural
Products**
Tiago Rodrigues, Daniel Reker, Jens Kunze, Petra Schneider, and Gisbert Schneider*
[
7]
Abstract: Fragment-like natural products were identified as
ligand-efficient chemical matter for hit-to-lead development
and chemical-probe discovery. Relying on a computational
method using a topological pharmacophore descriptor and
a drug database, several macromolecular targets from distinct
protein families were expeditiously retrieved for structurally
unrelated chemotypes. The selected fragments feature struc-
tural dissimilarity to the reference compounds and suitable
target affinity, and they offer opportunities for chemical
optimization. Experimental confirmation of hitherto unknown
macromolecular targets for the selected molecules corroborate
the usefulness of the computational approach and suggests
broad applicability to chemical biology and molecular med-
icine.
objective drug design. We have recently developed a soft-
ware tool for predicting the macromolecular targets of
[
4a]
structurally complex NPs. Conversely, small fragment-like
chemical entities are generally recognized as better-suited
starting points for hit-to-lead discovery programs. In this
study, we set out to identify the targets of fragment-like NPs
(molecular weight < 300 gmol ) and probe the underlying
molecular basis of macromolecular recognition.
By using molecular representation in terms of topological
pharmacophore features (CATS2 descriptor), we trained
a self-organizing map (SOM)
(Dictionary of Natural Products, DNP), together with 12 661
[
8]
À1 [9]
[10]
[11]
on a set of 210 213 NPs
[12]
small molecules with known targets (COBRA database).
This allowed us to span the combined pharmacophore feature
space of both drugs and NPs. The software modeled
qualitative ligand–receptor relationships by using statistically
interpreted similarities between drugs and NPs that co-
clustered in this comprehensive space. We derived a stat-
istical significance estimate (p value) for each individual
target prediction (see the Supporting Information). In
a previous study, we demonstrated that this method achieves
receiver-operating characteristic area under the curve
N
atural products (NPs) are the focus of strong interest as
[
1]
chemical matter for interrogating biological systems. Acting
as starting points for drug and chemical-probe discovery, they
comprise biologically prevalidated architectures that allow
the exploration of drug-relevant chemical space not covered
[5a]
[
2]
by fully synthetic small molecules. Consequently, computa-
tional tools have been deployed for enriching collections of
screening compounds by gathering structural and physico-
(ROC AUC) values of 0.88 based on an extensive retrospec-
[
3]
[5a]
chemical features from NPs. However, their applicability is
tightly bound to factual knowledge of the macromolecular
counterparts (e.g., target receptors) of the parent NP. The on-
and off-target interactions remain largely unknown for the
majority of these molecules, thus hindering the widespread
and efficient exploration of NP chemical motifs as leads for
tive study.
ROC AUC values can be interpreted as
probabilities for the algorithm to rank a correct ligand–
target pair higher than an incorrect pair.
The SOM algorithm afforded at least one highly confident
target prediction (p < 1%) for 80 024 (38%) of all of the NPs
annotated in the DNP, with an average of 0.9 targets per
chemical entity. From the 64 650 fragment-like NPs in the
DNP, 34 239 (53%) were confidently predicted to interact
with at least one target (average of 1.4 targets per fragment-
like NP). Importantly, the algorithm captured known ligand–
target relationships for fragment-like NPs that were not part
of the training data (see the Supporting Information). With
these target-engagement predictions in hand, we selected
goitrin, isomacroin, and graveolinine (Figure 1) for experi-
mental validation, taking into account: 1) their fragment-like
nature; 2) the confidence of the predictions (p < 1%, i.e., top
ranking); 3) their profound structural dissimilarity to the most
similar reference compound from the drug database (struc-
tural Tanimoto similarity ꢀ 0.2; Table 1), thus ensuring that
straightforward similarity searches would not have recog-
nized these potential ligand–receptor associations; 4) syn-
thetic tractability, and 5) a lack of known macromolecular
targets. Importantly, other publically available target predic-
[4]
chemical biology. Herein, we disclose the targets of the
natural products goitrin, isomacroin, and graveolinine, which
we discovered by using a ligand-based computational method
for predicting the targets of (de-orphaning) NPs in the
absence of apparent chemotype similarity to known macro-
molecular target effectors.
Chemocentric approaches for target prediction have
[
5]
already proven their applicability to drug repurposing,
[
6]
identifying drug liabilities, and driving innovative multi-
[
*] Dr. T. Rodrigues, D. Reker, J. Kunze, Dr. P. Schneider,
Prof. Dr. G. Schneider
Department of Chemistry and Applied Biosciences
Swiss Federal Institute of Technology (ETH)
Vladimir-Prelog-Weg 4, 8093 Zurich (Switzerland)
E-mail: gisbert.schneider@pharma.ethz.ch
[
**] This research was financially supported by ETH Zurich and a grant
from the OPO Foundation, Zurich, Switzerland. P. S. and G. S. are
the founders of inSili.com GmbH, Zurich.
[5e]
[13]
[14]
tion tools, for example, SEA, PASS, and SuperPred,
failed to confidently predict either these or any other targets
for the selected NPs (see the Supporting Information), thus
1
0516
ꢀ 2015 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
Angew. Chem. Int. Ed. 2015, 54, 10516 –10520