Paper
NJC
`
the imidazolidine ring. These are labelled a–g in Fig. S1 of the We thank Chansele Jourdan for her assistance with the UV-metric
ESI.† Five bond lengths from within the guanidine moiety were pKa measurements. Dr Rozas is gratefully acknowledged for the
also extracted, which correspond to the two CN single and one gift of the pyridino derivatives 16–22 and the Boc-protected
CN double bond of the guanidine group, the N–H bond attached precursors of compounds 1–10 and 23–27.
to the imine nitrogen in A, B, D and E, and one N–H bond of a
primary amine group. These are labelled i–v, in Fig. S2 (ESI†). The
five bond lengths i–v within tautomers A–E were then regressed
against the pKa values for the set of guanidines. The seven bond
References
lengths a–g of the three tautomers T1–T3 were also regressed
against their pKa values for the set of 2-(phenylimino)imidazoli-
dines. The squared correlation coefficient (r2), Root-Mean-Squared
Error of Estimation (RMSEE) and leave-one-seventh-out q2 values
were obtained using the program SIMCA-P 10.0.45
1 J. W. Shaw, D. H. Grayson and I. Rozas, Topics in Heterocyclic
Chemistry, Springer Berlin Heidelberg, Berlin, Heidelberg,
2015, pp. 1–51, DOI: 10.1007/7081_2015_174.
2 P. O’Sullivan and I. Rozas, ChemMedChem, 2014, 9,
2065–2073.
By a comparison of internal and external validation metrics
obtained for each bond length model, an optimal linear equation
was chosen. For guanidines, the model was constructed using the
CQN bond lengths of training set compounds as tautomer A,
labelled ii in Fig. S2 (ESI†). For imidazolidines, the model was
constructed using an endocyclic C–N bond length of the imino
tautomer T3, labelled ‘‘a’’ in Fig. S1 (ESI†). The predictions for
test set compounds 1a–23a (2-(phenylimino)imidazolidines) and
1b–27b (aryl guanidines) were obtained by energy minimization,
(via a conformational search using Marvin followed by geometry
optimisation), frequency calculations, and insertion of the
appropriate bond length into the linear equation for the
optimal bond length vs. pKa model. The test compounds were
constructed in the same tautomeric form as those of the training
set used to construct the model.
3 P. S. Nagle, C. McKeever, F. Rodriguez, B. Nguyen, W. D. Wilson
and I. Rozas, J. Med. Chem., 2014, 57, 7663–7672.
4 P. S. Nagle, F. Rodriguez, B. Nguyen, W. D. Wilson and
I. Rozas, J. Med. Chem., 2012, 55, 4397–4406.
5 P. S. Nagle, S. J. Quinn, J. M. Kelly, D. H. O’Donovan,
A. R. Khan, F. Rodriguez, B. Nguyen, W. D. Wilson and
I. Rozas, Org. Biomol. Chem., 2010, 8, 5558–5567.
6 P. S. Nagle, F. Rodriguez, A. Kahvedzic, S. J. Quinn and
I. Rozas, J. Med. Chem., 2009, 52, 7113–7121.
´
7 F. Rodrıguez, I. Rozas, M. Kaiser, R. Brun, B. Nguyen,
´
W. D. Wilson, R. N. Garcıa and C. Dardonville, J. Med.
Chem., 2008, 51, 909–923.
8 C. Dardonville, M. P. Barrett, R. Brun, M. Kaiser, F. Tanious
and W. D. Wilson, J. Med. Chem., 2006, 49, 3748–3752.
9 C. Dardonville and R. Brun, J. Med. Chem., 2004, 47,
2296–2307.
The program AIMAll (version 14.04.17)46 was used to carry
´
´
out QTAIM analysis and the IQA partitioning calculations to 10 C. Dardonville and J. J. Nue Martınez, Curr. Med. Chem.,
obtain Vxc values for the atomic interactions corresponding to 2017, DOI: 10.2174/0929867324666170623091522.
the C–N(im) and N–C(Ph) bonds. Calculations were carried out 11 C. Millan, F. Acosta-Reyes, L. Lagartera, G. Ebiloma, L. Lemgruber,
´
on B3LYP/6-311G(d,p) wavefunctions of the following species:
J. J. Nue Martinez, N. Saperas, C. Dardonville, H. de Koning and
the five nitro compounds (11a–15a), compound 11 of the
J. L. Campos, Nucleic Acid Res., 2017, 45, 8378–8391.
training set and five compounds for which no pKa data exists, 12 C. Dardonville, C. Fernandez-Fernandez, S. L. Gibbons,
2-(3-Cl-phenylimino), 2-(3-F-phenylimino), 2-(3,5-Cl2-4-NO2-
phenylimino), 2-(3-Br-4-NO2-phenylimino) and 2-(3-F,5-Cl-4-NO2-
G. J. Ryan, N. Jagerovic, A. M. Gabilondo, J. J. Meana and
L. F. Callado, Bioorg. Med. Chem., 2006, 14, 6570–6580.
phenylimino)imidazolidine. For the five additional compounds, 13 C. Dardonville, N. Jagerovic, L. F. Callado and J. J. Meana,
optimisation and frequency calculations were also performed Bioorg. Med. Chem. Lett., 2004, 14, 491–493.
at the B3LYP/6-311G(d,p) level before AIMAll analysis and the 14 C. Dardonville, I. Rozas, P. Goya, R. Giron, C. Goicoechea
relevant bond lengths were extracted.
and M. I. Martin, Bioorg. Med. Chem., 2003, 11, 1283–1291.
15 A. Alsasua, M. J. Borrego, M. T. Dannert, C. Dardonville,
M. Rozas and M. I. Martin, Fundam. Clin. Pharmacol., 2001,
15, 84.
Conflicts of interest
16 B. Kelly, M. McMullan, C. Muguruza, J. E. Ortega, J. J. Meana,
L. F. Callado and I. Rozas, J. Med. Chem., 2015, 58, 963–977.
17 C. Muguruza, F. Rodriguez, I. Rozas, J. J. Meana, L. Uriguen
and L. F. Callado, Neuropharmacology, 2013, 65, 13–19.
There are no conflicts of interest to declare.
Acknowledgements
´
18 F. Rodrıguez, I. Rozas, J. E. Ortega, A. M. Erdozain, J. J. Meana
This work was supported by the Spanish Ministerio de Economia y
and L. F. Callado, J. Med. Chem., 2008, 51, 3304–3312.
Competitividad (Grant SAF2015-66690-R). B. A. Caine thanks 19 F. Rodriguez, I. Rozas, J. E. Ortega, J. J. Meana and
BBSRC and Syngenta Ltd for PhD funding. P. L. A. P. acknowledges L. F. Callado, J. Med. Chem., 2007, 50, 4516–4527.
the EPSRC for funding through the award of an Established Career 20 F. Rodriguez, I. Rozas, J. E. Ortega, A. M. Erdozain, J. J. Meana
Fellowship (grant EP/K005472). We acknowledge the assistance and L. F. Callado, J. Med. Chem., 2009, 52, 601–609.
of G. Romero and F. Perez Gordillo (Instrumental Analysis 21 C. Dardonville and I. Rozas, Med. Res. Rev., 2004, 24,
Department at IQM) with the potentiometric pKa measurements. 639–661.
´
This journal is ©The Royal Society of Chemistry and the Centre National de la Recherche Scientifique 2017
New J. Chem., 2017, 41, 11016--11028 | 11027