Ar tificia l Gen etic System s: Exp loitin g th e
“Ar om a ticity” F or m a lism To Im p r ove th e
Ta u tom er ic Ra tio for Isogu a n osin e
Der iva tives
Theodore A. Martinot and Steven A. Benner*
Department of Chemistry, University of Florida,
P.O. Box 117200, Gainesville, Florida 32611-7200
benner@chem.ufl.edu
F IGURE 1. Isoguanosine (1), a purine (pu), can generate two
possible tautomers, an enol tautomer (giving a puDAD form,
which will pair with T or U) or a keto tautomer (giving a
puDDA form, which will pair with isoC). The extent of this
tautomerism has been reported to be 10:1 in favor of the keto
form.
Received February 4, 2004
Abstr a ct: The tautomerism of 2′-deoxy-7-deaza-isogua-
nosine (2) was studied and compared to that of 2′-deoxy-
1
isoguanosine (1). The fixed N-methyl (8) and O-methyl (4)
derivatives were synthesized to represent the pure extremes
of each tautomer. The replacement of the imidazole ring in
1 with a pyrrole ring in 2 makes the keto form in the latter
more favored by 2 orders of magnitude (KTAUT for 2 ≈ 103,
as opposed to KTAUT for 1 ≈ 10).
presents the donor-acceptor-donor hydrogen bonding
pattern that is complementary to the thymidine and
uridine heterocycles.6 Although the presence of the minor
tautomer does not adversely affect the use of isogua-
nosine in clinical diagnostics, it does inconvenience some
polymerases that prefer to place thymidine (T) and/or
uridine (U), rather than isocytidine (isoC), opposite
isoguanosine in a template.7
It has been a decade since it was shown that the
geometry of the Watson-Crick nucleobase pair can
accommodate as many as 16 nucleobases forming eight
mutually exclusive pairs, to yield an artificially expanded
genetic information system (AEGIS).1 By providing mo-
lecular recognition on demand in aqueous solution,
similar to nucleic acids but with a coding system that is
orthogonal to the system in DNA and RNA, AEGIS today
enables clinical assays such as Bayer’s branched DNA
diagnostics tool that monitors the load of viruses in
patients infected with HIV and hepatitis C viruses.2 Both
assays are FDA-approved and are widely used to provide
personalized patient care in the clinic. Further, AEGIS
components enable an assay for the early detection of the
SARS virus.3 With the emergence of the first six-letter
PCR,4 AEGIS now supports the development of a syn-
thetic biology where higher order processes of living
systems, including reproduction and Darwinian evolu-
tion, are duplicated by artificial, designed chemical
systems.
One issue remaining before a synthetic biology based
on AEGIS is fully implemented arises from the tauto-
meric form displayed by isoguanosine (and 2′-deoxy-
isoguanosine, disoG, 1). In its major keto form (Figure
1), disoG implements a hydrogen bond donor-donor-
acceptor pattern (proceeding from the major to the minor
groove) on a purine heterocycle (Figure 1). A minor enolic
tautomer of disoG is known to be present in aqueous
solution to perhaps 10%,5 however. The enolic tautomer
Given that the keto forms of pyrimidinones normally
dominate over enolic forms, we viewed the enolic tau-
tomer of disoG as an exception in need of explanation.
We reasoned that the enolic form was present in ap-
preciable concentration in disoG because it restores
formal aromaticity to the five-membered imidazole ring,
which must be cross-conjugated in the keto tautomer
(Figure 1). It is well-known that pyrrole is less “aromatic”
than imidazole, with the former being 59% as aromatic
as benzene, the latter being 64% (compared to pyridine
at 86%, thiophene at 66%, and furan at 43%).8 This
suggested that replacement of the imidazole ring of disoG
by a pyrrole ring to give 7-deaza-isoguanine might
generate a purine analogue that would also implement
the puDDA hydrogen bonding pattern (like disoG), but
with less of the enolic tautomer present at equilibrium.
7-Deaza-isoguanine (C7isoG) and its 2′-deoxynucleo-
side analogue (dC7isoG, 2) are both known.9-11 The
crystal structure of dC7isoG12 and its base-pairing ability
have both been studied.13-15 The tautomerism of both
C7isoG and dC7isoG remain undetermined, however.
(6) Robinson, H.; Gao, Y.-G.; Bauer, C.; Roberts, C.; Switzer, C.;
Wang, A. H.-J . Biochemistry 1998, 37, 10897-10905.
(7) Switzer, C. Y.; Moroney, S. E.; Benner, S. A. Biochemistry 1993,
32, 10489-10496.
(8) Katritzky, A. R.; Pozharskii, A. F. Handbook of Heterocyclic
Chemistry, 2nd ed.; Pergamon Press: Oxford, 2000.
(9) Seela, F.; Menkhoff, S.; Behrendt, S. J . Chem. Soc., Perkin Trans.
2 1986, 525-530.
(10) Kazimierczuk, Z.; Mertens, R.; Kawczynski, W.; Seela, F. Helv.
Chim. Acta 1991, 74, 1742-1748.
(11) Seela, F.; Muth, H.-P.; Kaiser, K.; Bourgeois, W.; Muehlegger,
K.; Von der Eltz, H.; Batz, H.-G. U.S. Patent 6,211,158 B1, 2001.
(12) Seela, F.; Wei, C.; Reuter, H.; Kastner, G. Acta Crystallogr.
1999, C55, 1335-1337.
(1) Geyer, C. R.; Battersby, T. R.; Benner, S. A. Structure 2003, 11,
1485-1498.
(2) Collins, M. L.; Irvine, B.; Tyner, D.; Fine, E.; Zayati, C.; Chang,
C. A.; Horn, T.; Ahle, D.; Detmer, J .; Shen, L. P.; Kolberg, J .; Bushnell,
S.; Urdea, M. S.; Ho, D. D. Nucleic Acids Res. 1997, 25, 2979-2984.
sars.html.
(4) Sismour, A. M.; Lutz, S.; Park, J .-H.; Lutz, M. J .; Boyer, P. L.;
Hughes, S. H.; Benner, S. A. Nucleic Acids Res. 2004, 32, 728-735.
(5) Sepiol, J .; Kazimierczuk, Z.; Shugar, D. Z. Naturforsch. C 1976,
31, 361-370.
(13) Seela, F.; Wei, C. Chem. Commun. 1997, 1869-1870.
(14) Seela, F.; Wei, C. Helv. Chim. Acta 1999, 83, 726-745.
(15) Seela, F.; Wei, C.; Becher, G.; Zulauf, M.; Leonard, P. Bioorg.
Med. Chem. Lett. 2000, 10, 289-292.
10.1021/jo0497959 CCC: $27.50 © 2004 American Chemical Society
Published on Web 05/01/2004
3972
J . Org. Chem. 2004, 69, 3972-3975