DOI: 10.1002/cbic.200900690
A Genetically Encoded e-N-Methyl Lysine in Mammalian Cells
Dan Groff,[a] Peng R. Chen,[b] Francis B. Peters,[a] and Peter G. Schultz*[a]
The post-translational methylation of lysine modulates the
activity, stability, localization and biomolecular interactions of
many eukaryotic proteins. For example, monomethylation of
Lys372 in the mammalian tumor suppressor p53 has been
shown to affect protein stability and localization.[1] Protein
methylation plays a particularly important role in gene expres-
sion due to its involvement in the histone code, in which spe-
cific modifications to histone proteins modulate the transcrip-
tional status of specific genes. Methylation of distinct histone
lysine residues has been correlated with both transcriptional
activation and repression depending on the lysine modified.[2]
To better understand the functional consequences of lysine
methylation, methods are needed to generate proteins with
defined methylation status both in vitro and in living cells.
One straightforward in vitro method for lysine methylation
makes use of methyltransferase enzymes,[3] which transfer the
S-methyl group of the cofactor S-adenosyl methionine to the
sidechains of lysine and arginine. These enzymes have very
specific sequence requirements, and their activity can lead to
heterogeneous mixtures of methylation products.[4] To circum-
vent these limitations, one can use solid-phase peptide synthe-
sis to directly incorporate methylated lysine residues into full-
length proteins by using native chemical ligation.[5] In addition,
in vitro translation has been adapted for the production of his-
tone tails with multiple modified lysine residues.[6] However,
these methods can suffer from low yields, restrictions on the
site of modification, and are not easily adapted to cellular stud-
ies. Recently, an in vitro chemical modification strategy was de-
veloped that takes advantage of the unique reactivity of cys-
teine. The reaction between cysteine and N-methyl aminoeth-
ylhalides generates a thioether adduct structurally similar to
methyl lysine. Mono-, di-, and tri-methylated lysine analogues
have been generated in this way (Scheme 1).[7] By using a simi-
lar strategy, unnatural amino acid mutagenesis was used to se-
lectively incorporate phenylselenocysteine into proteins. It can
be subsequently oxidized to dehydroalanine and reacted with
aminoethylthiols to again produce methyl lysine analogues.
However, an initial oxidation step is required, so this strategy is
not compatible with proteins containing redox active cysteine
or methionine residues.[8]
Biosynthetic approaches have also been used to incorporate
methyl lysine into proteins. For example, the e-methyl lysine
precursor Ne-tert-butyl-oxycarbonyl Ne-methyl-l-lysine (Boc
methyl lysine) has been genetically inserted into proteins in
E. coli. This strategy produces good yields of homogeneously
monomethylated proteins largely independent of sequence
context.[4] However, Boc deprotection in aqueous TFA is neces-
sary to generate monomethyl lysine and can cause denatura-
tion and loss of biological activity with many proteins.[9]
All of the above methods for introducing methyl lysine re-
quire in vitro manipulation to produce the final, methylated
protein. This restriction generally prevents studies of methylat-
ed proteins in their native cellular context. To overcome this
limitation, we have genetically encoded the photocaged N-
methyl lysine, Ne-o-nitrobenzyl-oxycarbonyl-Ne-methyl-l-lysine
1, in both bacteria and mammalian cells. Photocaged second
messengers and proteins are widely used tools for the spatial
and temporal control of a variety of cellular processes because
light allows noninvasive generation of the active photoprod-
ucts in the cell. Furthermore, it has been previously demon-
strated that o-nitrobenzyl-O-tyrosine and dimethoxy-o-nitro-
benzyl-O-serine can be efficiently deprotected with light in
E. coli,[10] Xenopus oocytes,[11] and yeast.[12]
The synthesis of photocaged methyl lysine 1 involves two
sequential reductive aminations of Na-tert-butyl-oxycarbonyl-l-
lysine with benzaldehyde and formaldehyde using STABH to
afford Na-tert-butyl-oxycarbonyl-Ne-benzyl-Ne-methyl-l-lysine.[13]
Reductive debenzylation quantitatively yielded Na-tert-butyl-
oxycarbonyl-Ne-methyl-l-lysine 4 which was then coupled to o-
nitrobenzyl chloroformate to produce 5. Boc deprotection pro-
ceeded quantitatively with HCl in dioxane to generate photo-
caged methyl lysine 1.
To genetically encode 1, we used a pyrrolysyl-tRNA synthe-
Pyl
CUA
tase from M. barkeri (MbPylRS) and a pyrrolysyl tRNA (tRNA
)
from M. mazei,[14] which previously were adapted for the site-
specific incorporation of unnatural amino acids into proteins in
response to the amber nonsense codon TAG.[15–17] The ortho-
gonality of this pair to endogenous tRNAs and aminoacyl-tRNA
synthetases (aaRSs) has been demonstrated in both E. coli and
mammalian cells, so that an aaRS evolved to incorporate the
unnatural amino acid Ne-o-nitrobenzyloxycarbonyl-l-lysine in
E. coli can also be used in mammalian cells. An aaRS library
was created in E. coli in which the codons for residues L270,
Y271, L274 and C313 of the pyrrolysyl-tRNA synthetase from
M. barkeri were all randomized as NNK. Directed evolution with
two positive rounds and one negative round of selection as
previously described[18] resulted in the identification of the
aaRS G12 (experimental details in Supporting Information),
which is capable of incorporating 1 into proteins. aaRS G12
has the following mutations: Y271I, L274M, and C313A.
[a] Dr. D. Groff,+ F. B. Peters,+ Prof. Dr. P. G. Schultz
Department of Chemistry, The Scripps Research Institute
10550 N. Torrey Pines, La Jolla, CA 92037 (USA)
Fax: (+1)858-784-9440
[b] Dr. P. R. Chen+
Department of Chemical Biology, Peking University
Beijing 100871 (China)
[+] These authors contributed equally to this work.
Supporting information for this article is available on the WWW under
http://dx.doi.org/10.1002/cbic.200900690.
1066
ꢀ 2010 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
ChemBioChem 2010, 11, 1066 – 1068