Unnatural Base Pairs
FULL PAPER
site-specific modification of DNA and RNA with two differ-
ent functional groups, which should be useful for variety of
in vitro applications, including SELEX with an expanded ge-
netic alphabet,[32] as well as biophysical studies that rely on
the modification of DNA with multiple biophysical probes.
The mechanism by which DNA polymerases replicate pre-
dominantly hydrophobic unnatural base pairs is of great in-
terest for designing better base pairs, as well as for under-
standing the range of activities possible with these important
enzymes. It has been suggested that shape complementarity
is important;[2–4] however, it is critical to define in what con-
text it is manifest (i.e., the mode of pairing). Shape comple-
mentarity is usually evoked within a natural, Watson–Crick-
like mode of pairing, where two in-plane nucleobases inter-
act in an edge-on manner. Each natural base pair thus
adopts a similar shape that is thought to be uniquely well ac-
commodated by DNA polymerases.[2–4] In contrast, the
model proposed here (Figure 2) evokes a different mode of
base pairing, where instead of interacting edge-to-edge,
where little to no stabilization is available, the nucleobases
partially interstrand intercalate during base pair synthesis,
which is likely driven by their high stacking potential. How-
ever, extension of the nascent unnatural base pair requires
deintercalation to position the primer terminus 3’-OH ap-
propriately for continued elongation. While deintercalation
is favored by a stabilizing hydrogen-bond between the poly-
merase and the ortho substituents of the nucleobase ana-
logues,[33–38] the model emphasizes the balance of intercala-
tion propensity that must be possessed by the pairing nu-
volved in base pair recognition are inherent to the base pair
and not dependent on the polymerase supports the interpre-
tation of the structure in terms of replication.
The structural models highlight the importance of how
the different substituents affect the partitioning of the un-
natural nucleobases between intercalated and deintercalated
states, which appear to be required for synthesis and exten-
sion, respectively. In the intercalated state, the major groove
substituents form a central part of the nucleobase packing
interface, but upon deintercalation, these substituents are
more solvent exposed in a more traditional-like major
groove. The models suggest that the more efficient replica-
tion of dDMO–d5SICS results from an optimized balance of
forces governing the stability of the intercalated and deinter-
calated states. Synthesis is likely favored by optimized pack-
ing interactions between the major groove methyl groups of
dDMO and d5SICS. In addition, the structure adopted by
dDMO orients the oxygen lone pairs toward the major
groove, where upon deintercalation, they may engage in sta-
bilizing interactions with proximal water molecules and/or
metals, thus favoring unnatural base pair extension. While
anisole is generally not a strong metal ligand or hydrogen-
bond acceptor due to electron delocalACTHNUTRGENNUGizaAHCTUNGTERNNtUGN ion, both interac-
tions are favored when the conjugation is disrupted by rota-
tion,[39–42] as is observed in the modeled structure of dDMO–
d5SICS. The increased substituent size of dTMO appears to
induce subtle structural changes without any significant
affect on replication. In contrast, the cyclic structure of
dFMO appears to force the oxygen lone pairs directly into
the hydrophobic interface between the nucleobases, which is
likely destabilizing.[43–45] Moreover, if the furanyl oxygen is
solvated as the free triphosphate,[46,47] then this stabilizing
solvation will be lost upon insertion without being replaced
with any other favorable interactions. Moreover, deinterca-
lation is expected to force the hydrophobic methines further
into the hydrophilic major groove, which is likely further de-
stabilizing. Thus, with the aid of the structural models, the
intercalative mechanism nicely explains the relatively large
effects of the modifications on unnatural base pair synthesis
and extension.
ACHTUNGTRENNUNGcleobases: they must intercalate sufficiently for synthesis,
but not so much that extension is inhibited. This model
nicely explains a large body of previously reported kinet-
ic[5–10] and structural data.[12]
The solution structure of the parental dMMO2–d5SICS
pair, as well as the derivative model structures of the
dDMO–d5SICS, dTMO–d5SICS, and dFMO–d5SICS pairs
in duplex DNA supports the intercalative model of replica-
tion (Figure 2). The structures clearly reveal that the nucleo-
tides are accommodated within a B-form duplex, adopt anti-
orientations about their glycosidic bonds, and importantly,
pair in an intercalative manner. The data further reveal that
the stacking interface between the nucleobases is comprised
of the methyl group and proximal portion of the associated
aromatic ring of d5SICS and the para substituent of
dMMO2 or a dMMO2 analogue. It should be emphasized
that the structures suggest that the unnatural nucleobase an-
alogues only partially intercalate, they do not fully insert
into the opposite strand due to their size and the constraints
imposed by the duplex (nonetheless, we refer to the interac-
tion as intercalation for simplicity). Importantly, it is clear
that the various substituents examined are predicted to be
positioned within the stacking interface between the unnatu-
ral nucleobases, which accounts for their effects on replica-
tion. It should also be emphasized that the structural data is
based on the analogues embedded within a duplex, and not
at a primer terminus bound to a DNA polymerase. Howev-
er, the fact that at least some of the specific interactions in-
Conclusion
We have identified dDMO–d5SICS as an unnatural base
pair that is better replicated than the parental dMMO2–
d5SICS pair. In addition structural studies support an inter-
calative model of replication, as previously proposed based
on kinetic data[9] and help to explain the observed effects of
the various modifications. The intercalative mode of pairing
is likely not limited to the analogues examined in the pres-
ent work. Indeed, it is similar to that observed in the DNA
zipper motif, where alternating natural nucleobases are in-
terdigitated, as opposed to interacting in an edge-on
manner.[48–56] Moreover, a similar mode of pairing has been
observed previously by our group,[12] as well as by the Leu-
mann group[57] with unnatural nucleotides bearing large aro-
Chem. Eur. J. 2010, 16, 12650 – 12659
ꢀ 2010 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
12657