Cross-Linked Amino Acids and Deoxynucleosides
A R T I C L E S
peptide, and 5 mM deoxynucleoside. For coupling between amino
acids and trinucleotides in 50 µL reaction volumes, final concentra-
tions were 50 mM formaldehyde, 5 mM amino acid, and 1 mM
trinucleotide.
NMR (125 MHz, DMSO-d6) δ 172.6 COOH-Cys, 156.7 C6, 155.3
COO-t-Boc, 151.9 C2, 150.0 C4, 136.1 C8, 120.0 C5, 87.8 C4′,
82.9 C1′, 78.2 C-t-Boc, 70.5 C3′, 61.5 C5′, 53.9 CR, 43.2 CH2-
linker, 39.3 C2′, 31.8 Cꢀ, 28.0 CH3-t-Boc.
Cys-CH2-dA. 1H NMR (500 MHz, DMSO-d6) δ 8.42 (bs, 1H,
N6H), 8.37 (s, 1H, H8), 8.26 (s, 1H, H2), 6.35 (Ψt, 1H, H1′, J )
6.8 Hz), 6.01 (d, 1H, NRH-Cys, J ) 5.26 Hz), 4.66-4.77 (m, 1H,
CH2a-linker), 4.50-4.61 (m, 1H, CH2b-linker), 4.40-4.44 (m, 1H,
H3′), 3.85-3.89 (m, 1H, H4′), 3.61 (dd, 1H, H5′, J ) 11.9, 4.3
Hz), 3.66-3.74 (m, 1H, CRH), 3.53 (dd, 1H, H5′′, J ) 11.9, 4.3
Hz), 2.96-3.07 (m, 2H, CꢀH2), 2.66-2.74 (m, 1H, H2′), 2.27 (ddd,
1H, H2′′, J ) 13.1, 6.8, 3.2 Hz), 1.33 (s, 9H, CH3-t-Boc). 13C NMR
(125 MHz, DMSO-d6) δ 172.2 COOH-Cys, 154.6 COO-t-Boc,
153.6 C6, 150.5 C2, 148.5 C4, 139.9 C8, 120.2 C5, 87.3 C4′, 82.7
C1′, 78.0 C-t-Boc, 70.4 C3′, 61.5 C5′, 55.2 CR, 43.0 CH2-linker,
39.3 C2′, 34.7 Cꢀ, 27.8 CH3-t-Boc.
Large scale reactions between amino acids and deoxynucleosides
for structural characterization by NMR were run with 20 mg of
deoxynucleoside and 40 mg of NR-Boc-amino acid in 5 mL of 10
mM potassium phosphate buffer (pH 7.2) and 100 mM formalde-
hyde for 12 h to 1 week, monitored by HPLC until the chromato-
graphic trace remained constant. The reaction mixtures were
separated by semipreparative HPLC, and collected products were
characterized by mass spectrometry and NMR. Exact masses are
given in Table 3. MS/MS data are presented in full as Supporting
Information. H and 13C shifts are shown below. For the Lys-dG
reactions, HPLC fractions were collected on dry ice, lyophilized,
and then stored at -80 °C until analysis.
Kinetics of Cross-Link Formation. Solutions were made up
to final concentrations of 4 mM amino acid, 4 mM deoxynucleoside,
and 50 mM formaldehyde in 50 mM potassium phosphate buffer
(pH 7.2). The reaction mixtures were analyzed at 2, 4, 8, 16, 24,
48, and 72 h by HPLC with UV detection at 254 nm. For the Lys-
dG coupling reaction, additional samples were analyzed at 10, 20,
30, and 60 min.
Stability of Cross-Linked Lys-dG Products. To measure the
kinetics of cross-link degradation, a reaction mixture containing
Lys and dG as described above was treated with 50 mM
formaldehyde for 48 h and then separated by HPLC. Fractions
eluting at 17.2 and 26.5 min were collected on dry ice, and 50 µL
aliquots of each fraction were added to 950 µL of 50 mM phosphate
buffer maintained at 37 °C for 1, 5, 10, 20, or 30 min and then
analyzed by HPLC with ESI Q-TOF mass spectrometry.
2-Amino-6-(10-oxo-triazino[1,2-a]purin-7-yl)hexanoic Acid
(TPHA-1). 1H NMR (500 MHz, DMSO-d6) δ 7.91 (s,1H, H2), 7.87
(s, 1H, N5H, J ) 2.1 Hz), 6.74 (d, 1H, Boc-NRH-, J ) 4.7 Hz),
6.09 (dd, 1H, H1′, J ) 7.8, 5.9 Hz), 4.89 (s, 2H, C8H2), 4.33 (m,
1H, H3′), 4.24 (bs, 2H, C6H2), 3.83 (m,1H, H4′), 3.75 (dt, 1, CRH,
J ) 8.1, 8.1, 4.7 Hz), 3.54 (dd, 2H, H5′, J ) 11.7, 4.5 Hz), 3.47
(dd, 2H, H5′′, J ) 11.7, 4.5 Hz), ∼2.47 (CεH2 (overlaps DMSO-
d6)), ∼2.50 (H2′ (overlaps DMSO-d6)), 2.18 (ddd, 1H, H2′′, J )
13.1, 5.9, 3.0 Hz), 1.34-1.48 (m, 2H, CδH2), 1.24-1.37 (m, 2H,
CγH2), 1.44-1.58 (m, 2H, CꢀH2), 1.35 (s, 9H, CH3-Boc). 13C NMR
(125 MHz, DMSO-d6) δ 173.9 COOH, 155.8 C10, 155.2 COO-
Boc, 150.2 C4a, 149.0 C3a, 134.9 C2, 115.5 C10a, 87.3 C4′, 82.1
C1′, 77.4 C-Boc, 70.5 C3′, 61.4 C5′, 60.5 C8, 59.5 C6, 53.8 CR,
49.2 Cε, 39.3 C2′, 30.8 Cꢀ, 27.9 CH3-Boc, 26.6 Cδ, 22.8 Cγ.
2-Amino-6-(5-hydroxymethyl-10-oxo-triazino[1,2-a]purin-
7-yl)hexanoic Acid (TPHA-2). 1H NMR (500 MHz, DMSO-d6) δ
7.96 (s,1H, H2), 6.93-6.99 (m,1H, NRH-Lys), 6.19 (dd, 1H, H1′,
J ) 7.8, 5.9 Hz), 4.99 (d, 2H, N5CH2OH, J ) 1.3 Hz), 4.95 (s,
2H, C8H2), 4.45 (s, 2H, C6H2), 4.35 (m, 1H, H3′), 3.78-3.83 (m,
1H, H4′), 3.77-3.84 (m, 1H, CRH), 3.55 (dd, 1H, H5′, J ) 11.6,
4.8 Hz), 3.49 (dd, 1H, H5, J ) 11.6, 4.7 Hz), 2.58 (ddd, 1H, H2′,
J ) 13.2, 7.7, 5.9 Hz), 2.49 (m, 2H, CεH2), 2.21 (ddd,1H, H2′′, J
) 13.2, 7.8, 3.1 Hz), 1.50-1.69 (m, 2H, CꢀH2), 1.36-1.50 (m,
2H, CδH2), 1.36 (s, 9H, CH3-t-Boc), 1.28-1.34 (m, 2H, CγH2).
13C NMR (125 MHz, DMSO-d6) δ 173.9 COOH-Lys, 156.0 C10,
155.4 COO-t-Boc, 149.4 C4a, 148.2 C3a, 136.0 C2, 115.9 C1a,
87.4 C4′, 82.3 C1′, 77.6 C-t-Boc, 70.7 C3′, 68.5 N5CH2OH, 63.9
C6, 61.5 C5′, 61.2 C8, 53.2 CR, 49.0 Cε, 39.2 C2′, 30.4 Cꢀ, 27.9
CH3-t-Boc, 26.6 Cδ, 22.9 Cγ.
Cys-CH2-dC. 1H NMR (500 MHz, DMSO-d6) δ 8.59 (bs, 1H,
N4H), 7.83 (d, 1H, H6, J ) 7.5 Hz), 6.11 (Ψt, 1H, H1′ J ) 6.9
Hz), 5.99 (d, 1H, NRH-Cys, J ) 4.2 Hz), 5.74 (d, 1H, H5, J ) 7.4
Hz), 4.48 (dd, 1H, CH2a-linker, J ) 13.4, 6.6 Hz), 4.26 (dd, 1H,
CH2b- linker, J ) 13.4 6.5 Hz), 4.20-4.24 (m, 1H, H3′), 3.71-3.74
(m, 1H, H4′), 3.64-3.69 (m, 1H, CRH), 3.53-3.60 (m, 2H, H5′,
H5′′), 2.98 (ddd, 2H, CꢀH2, J ) 42.10, 13.5 4.3 Hz), 2.11 (ddd,
1H, H2′, J ) 12.9, 6.9, 4.4; Hz), 1.93 (td, 1H, H2′′, J ) 12.9, 6.9,
6.4 Hz), 1.35 (s, 9H, CH3-t-Boc). 13C NMR (125 MHz, DMSO-d6)
δ 171.1 COOH-Cys, 162.5 C4, 154.7 C2, 154.3 COO-t-Boc, 140.0
C6, 94.7 C5, 87.1 C4′, 84.5 C1′, 77.2 C-t-Boc, 69.5 C3′, 60.8 C5′,
55.2 CR, 42.3 CH2-linker, 40.1 C2′, 35.1 Cꢀ, 27.8 CH3-t-Boc.
His-CH2-dA. 1H NMR (500 MHz, DMSO-d6) δ 8.83 (bs, 1H,
NRH-His), 8.42 (s, 1H, H8), 8.34 (s, 1H, H2), 7.61 (s, 1H, Hε1-
His), 6.96 (s, 1H, Hδ2-His), 6.4 (bs, 1, N6H), 6.36 (Ψt, 1H, H1′, J
) 6.7 Hz), 5.59 (bs, 2H, CH2-linker), 5.32 (bs, 1H, OH3′), 5.12
(bs, 1H, OH5′), 4.39-4.43 (m, 1H, H3′), 3.84-3.89 (m, 1H, H4′),
3.83-3.87 (m, 1H, CRH), 3.48-3.65 (m, 2H, H5′,H5′′), 2.69-2.81
(m, 2H, CꢀH2-His), 2.70-2.75 (m, 1H, H2′), 2.27 (ddd, 1H, H2′′,
J ) 12.8, 6.7, 2.8 Hz), 1.27 (s, 9H, CH3-t-Boc). 13C NMR (125
MHz, DMSO-d6) δ 173.4 COOH-His, 154.6 COO-t-Boc, 148.7 C4,
140.23 C8, 139.3 C2, 138.2 Cγ-His, 136.1 Cε2-His, 119.5 C5, 115.5
Cδ1-His, 87.8 C4′, 83.7 C1′, 77.1 C-t-Boc, 70.6 C3′, 61.5 C5′, 54.0
CR-His, 49.9 CH2-linker, 39.1 C2′, 30.2 Cꢀ-His, 27.9 CH3-t-Boc.
Trp-CH2-dG. 1H NMR (500 MHz, DMSO-d6) δ 7.90 (s, 1H,
H8), 7.06 (d, 1H, Hε3-Trp, J ) 7.14 Hz), 6.91-6.98 (m, 1H, Hη2-
Trp), 6.50-6.58 (m, 2H, Hꢁ2, Hꢁ3-Trp), 6.14 (ψt, 1H, H1′, J ) 6.8
Hz), 6.24-6.06 (m, 1H, N2H), 5.30 (bs, 2H, CH2-linker), 4.36-4.32
(m, 1H, H3′), 4.29-4.21 (m, 1H, CHR), 3.79-3.83 (m, 1H, H4′),
3.68-3.66 (m, 1H, CꢀH2a), 3.58-3.44 (m, 3H, CꢀH2b, + H5′, H5′′),
2.51-2.61 (overlapping DMSO-d6), 2.17-2.25 (m, 1H, H2′′), 1.40
(s, 4H, CH3-t-Boc1), 1.34 (s, 5H, CH3-t-Boc2). 13C NMR (125
MHz, DMSO-d6) δ 156.3 C6, 152.8 CO2H, 150.0 C4, 150.0 Cδ2,
135.5 C8, 129.6 Cε2, 128.1 Cη2, 122.8 Cε3, 116.9 Cꢁ3, 116.8 C5,
108.4 Cꢁ2, 87.5 C4′, 82.8 C1′, 79.8 CH2-linker, 79.0 C-t-Boc, 70.8
C3′, 61.5 C5′, 56.4 CR, 45.3 Cꢀ, 39.3 C2′, 27.9 CH3-t-Boc.
Results and Discussion
Eight amino acids previously reported to form stable adducts
with formaldehyde15 were investigated (as NR-Boc derivatives)
in coupling reactions with all four nucleosides to determine
which would be of interest for characterization of reactions with
oligonucleotides and oligopeptides. No cross-links could be
detected with Arg, Gln, Tyr, or Asn, and consistent with
previous studies,10-13 the endocyclic nitrogen of dT did not form
a coupling product with any of the amino acids. Figure 1 shows
the differences in extent and rate of formation; reactions between
Lys and dG, and Cys and dG are essentially complete after 24 h,
whereas product increased over 72 h for the less reactive
combinations Cys and dA, Cys and dC, His and dA, and Trp
and dG.
Cys-CH2-dG. 1H NMR (500 MHz, DMSO-d6) δ 12.73 (s, 1H,
COOH-Cys), 10.82 (s, 1H, N1H-dG), 7.99 (s, 1, H8), 7.11 (bs,
1H, N2H-dG), 6.99-7.08 (m, 1H, NRH-Cys), 6.16 (ψt, 1H, H1′, J
) 6.8 Hz), 4.48-4.57 (m, 2H, CH2-linker, J ) 10 Hz), 4.35 (td, 1,
H3′, J ) 3.2, 3.2, 6.1 Hz), 4.06-4.12 (m, 1H, CRH-Cys), 3.78-3.82
(m, 1H, H4), 3.55 (dd, 1H, H5′, J ) 11.6, 4.8), 3.48 (dd, 1H, H5′′,
J ) 11.62, 4.8 Hz), 3.03 (dd, 1H, CꢀH2a, J ) 13.5, 4.47 Hz), 2.84
(dd, 1H, CꢀH2b, J ) 13.5, 9.19 Hz), 2.62-2.65 (m, 1H, H2′), 2.21
(ddd, 1H, H2′′, J ) 13.1, 6.8, 3.2 Hz), 1.36 (s, 9H, CH3-t-Boc. 13
C
9
J. AM. CHEM. SOC. VOL. 132, NO. 10, 2010 3391