.
Angewandte
Communications
Table 1: Summary of results for selected compounds 1, 3, 5, 7, 8, 9, and 12.
discovery, such as on-chip synthesis
and computational target predic-
tion, may advance hit and lead
identification in chemical biology
and molecular medicine. In light of
recent advances in lab-on-a-chip
technologies,[14] one could even
envisage a fully automated hit-find-
ing automaton that integrates com-
putational target prediction and
building block selection for the
microfluidic-assisted synthesis and
testing of candidate compounds.
Target
Predicted Mahalanobis
Experimental
LE[a]
LLE[b]
SILE[c]
pAffinity
distance
pKi or % binding
1
3
5
7
8
9
a1A[d]/PDE10A[e]
a1B
5.7/5.7
6.2
5.8/6.5
6.1
5.7
6.4/5.8
6.0
0.7/0.8
2.4
0.7/2.4
2.0
3.2
2.6/1.7
3.3
<4/<4
5.6
–
0.33
–
3.46
–
3.04
[f]
[g]
a1A/A2B
a1B
5.4/5.2
5.7
0.30/0.29 3.07/2.86 2.87/2.76
0.40
–
–
–
1.74
3.23
>80%[i]
<4/<4
>80%[i]
–
–
–
–
–
–
[h]
A1
A2B/PDE10A
12 A1
[a] Ligand efficiency. [b] Lipophilic ligand efficiency. [c] Size-independent ligand efficiency. [d] Adrenergic
a1A receptor. [e] Phosphodiesterase 10A. [f] Adrenergic a1B receptor. [g] Adenosine A2B receptor.
[h] Adenosine A1 receptor. [i] Radioligand assay; activity values are averaged from two measurements.
novelty of the scaffold compared to known ligands in the
ChEMBL database. In fact, to the best of our knowledge,
Experimental Section
Computations. For training the Gaussian process models[15] we used
the ChEMBL database (version 14) containing 1213242 distinct
compounds with 10129256 bioactivities for 9003 targets.[9] Protein
targets with fewer than 200 annotated human bioactivities were
excluded. All activity end-points were standardized to pAffinity =
ꢀlog10(activity). The final affinity data set consisted of 209293
compounds with 431313 bioactivities for 469 human targets. Post-
Knime v.2.6.0.[16] Molecular structures were standardized using the
“wash” function in MOE 2012.10 (The Chemical Computing Group
Inc., Montreal, Canada); logP(o/w) was calculated with MOE. Two
different molecular descriptors were calculated for each compound:
topological pharmacophores (CATS2, 0–9 bonds, type-sensitive
scaling),[6] and an ECFP-like topological circular fingerprint
org).[17] Predictive models were implemented using Matlab R2012b
(The MathWorks Inc., Natick, USA) and the GPML toolbox v3.1
tenfold stratified cross-validation (cross-validated squared correla-
tion coefficient, Q2; mean absolute error, MAE). The Boltzmann-
enhanced discrimination of ROC (BEDROC; a = 56, top 3%
contribute 80% to the score) was used to quantify the early
enrichment performance.[18] We used the lower confidence-bound
pAffinity estimate throughout this study: prediction ¼ m ꢀ s2 ,
imidazopyridines with this framework have not been reported
as adenosine or adrenergic receptor ligands.[20]
Having predicted potential macromolecular targets for all
synthesized compounds, we tested those compounds for
which we had obtained robust pAffinity predictions. For one
of the prominent targets, phosphoinositide 3-kinase, activity
had previously been reported for the underlying imidazopyr-
idine scaffold,[6] which corroborated the prediction. As
a proof-of-concept, we then explored a range of predicted
GPCR targets aiming at the discovery of a new activity island
in chemical space. In radioligand displacement assays probing
the direct ligand–receptor binding and in cell-based func-
tional activity assays, 71% of the compounds were found to
be active as predicted (Table 1). More specifically, com-
pounds 3 and 7 presented antagonistic Ki values of 2–3 mm,
respectively, against the adrenergic a1B receptor, while com-
pound 5 showed similar low micromolar antagonistic potency
against the adrenergic a1A and adenosine A2B receptors.
Compounds 8 and 12 turned out to be potent direct ligands of
the A1 receptor (84% and 89% binding at 100 mm, respec-
tively), but were inactive in the functional cell-based assay.
Additional tests will be required to determine selectivity
profiles in a full GPCR panel screen.
Several quality indices have been suggested to guide hit
prioritization in drug discovery.[13] Accordingly, our com-
pounds fully qualify as lead structure candidates (Table 1).
For example, compound 7 is a scarcely decorated, yet highly
ligand-efficient chemical entity (LE = 0.40; SILE = 3.23)
which might justify development as an adrenergic a1A
receptor antagonist. On the other hand, although less efficient
than 7, compound 3 presents a better balance between affinity
and computed logP(o/w) (LLE = 3.46 vs. 1.74). Most impor-
tantly, the leads presented herein are dissimilar to their
nearest neighbors from the training data (structural similarity
Tanimoto = 0.16–0.30, Table S1) and would likely not have
been selected using straightforward substructure-based sim-
ilarity searching.
*
*
where m* is the modelꢀs predictive mean and s2 the predictive
*
variance. To distinguish from random predictions we calculated the
Mahalanobis distance of an activity prediction: MD(prediction) =
(predictionꢀmr)/sr, where mr and sr are the mean and standard
deviation of a randomized predictive distribution. The background
consisted of 50000 randomly selected molecules from ChemDB.[19]
Synthesis. Stock solutions of building blocks were prepared in
ethanol. The amine and aldehyde components were premixed, and
perchloric acid was added. Two independent syringe pumps delivered
the amine/benzaldehyde/perchloric acid solution and the isocyanide
solution at suitable flow rates. The reaction chamber containing the
microchip was heated at different temperatures and the crude product
was collected in a vial. The crude mixtures were purified by
preparative HPLC (acetonitrile/H2O + 0.1% formic acid in each
solvent) using a gradient of 30–95% or 5–50% acetonitrile over
16 min. Microfluidics hardware and the Qmix Elements software were
obtained from Cetoni (Korbussen, Germany). Microwave synthesis
was performed in a Biotage Initiator (Uppsala, Sweden) in 1–2 mL
vials, as described.[6]
Altogether, our chemistry-driven approach to the design
of a target-focused combinatorial library, in an expeditious
and efficient manner, led to the identification of a molecular
framework targeting four GPCRs. The results highlight the
imidazopyridine scaffold as a privileged motif and demon-
strate how the integration of emerging technologies in drug
Testing. Activity determinations were performed by Cerep (Le
Bois l’EvÞque, 86600 Celle l’Evescault, France) on a fee-for-service
basis. For details see the Supporting Information.
Received: September 4, 2013
Published online: November 26, 2013
584
ꢀ 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
Angew. Chem. Int. Ed. 2014, 53, 582 –585