DOI: 10.1002/anie.201101331
Protein Chemistry
Modular Assembly of Macrocyclic Organo–Peptide Hybrids Using
Synthetic and Genetically Encoded Precursors**
Jessica M. Smith, Francesca Vitali, Steven A. Archer, and Rudi Fasan*
Macrocyclic peptides and peptide-containing molecules are
attractive molecular scaffolds for the development of bioac-
tive compounds to modulate biomolecular interactions. These
structures combine a high degree of functional complexity
with restricted conformational flexibility, which make them
well-suited to achieve selective and tight binding to extended
biomolecular interfaces, such as those mediating protein–
protein and protein–nucleic acid complex formation.[1] Com-
pared to linear peptides, conformationally constrained pep-
tide-based ligands often exhibit higher proteolytic stability,[2]
enhanced cell permeability,[3] and higher affinity towards the
target biomolecule,[4] which render them valuable as probes
and potential pharmacological agents. Indeed, many cyclic
and lariat peptides isolated from natural sources[5] exhibit
potent biological activities and have provided a source of
viable drugs.[1d]
the size and composition of which can be readily diversified
by varying the nature of the synthetic and biosynthetic
precursors (Scheme 1).
Our approach takes advantage of the reactivity of intein
proteins[12] and the opportunity to introduce bioorthogonal
functionalities into proteins by amber stop codon suppres-
sion.[13] We envisioned that incorporating an alkyne-bearing
non-natural amino acid within the N-terminal portion of an
intein-fused polypeptide would yield a recombinant protein
carrying two functional groups with orthogonal reactivity,
namely the alkyne moiety and the thioester bond transiently
formed at the junction with the intein by reversible N!S acyl
transfer. A tandem chemoselective reaction could thus be
exploited to mediate coupling of this biosynthetic precursor
(BP) to an azide/hydrazide-containing synthetic precursor
(SP) and promote the formation of an organo–peptide
macrocycle (Scheme 1).
To test our design, we first constructed a plasmid
(pBP_MG6) encoding for a 6mer target sequence (TS6:
TGSYGT) preceded by Met, Gly, and the amber stop codon
TAG and fused at the C terminus to the N-terminal cysteine
of intein GyrA from Mycobacterium xenopi.[14] This construct
was expressed in E. coli in the presence of O-propargyltyr-
osine (OpgY) and a previously described mutant tRNACUA
(MjtRNACUA)/tyrosyl-tRNA synthetase (MjTyrRS) pair[15]
encoded by a second vector (pEVOL[16]). The latter allow
the site-selective incorporation of OpgY at the N-terminal
end of the target sequence in the biosynthetic precursor by
stop codon suppression. The resulting protein, called MG6,
was purified by nickel-affinity chromatography and its
identity confirmed by MALDI-TOF (Figure 1a). To test the
macrocyclization reaction, the bifunctional synthetic precur-
sor 1 was synthesized and coupled to MG6 by CuI-catalyzed
azide–alkyne 1,3-dipolar cycloaddition[17] (CuAAC; 20 min)
followed by removal of the copper catalyst and excess 1 by
fast buffer exchange (2 min). Formation of the MG6-1 adduct
occurred quantitatively and was followed by complete splic-
ing of the GyrA intein after 16 h as indicated by MALDI-
TOF analysis (Figure 1a). This process was accompanied by
the accumulation of a product with molecular mass (m/z
1016.3) corresponding to the desired organo–peptide macro-
cycle 7, as revealed by LC-MS (Figure 1b). Along with the
major macrocyclic product, the formation of a small amount
(ca. 20%) of the acyclic peptide H2N-G(OpgY-1)TGSYGT-
COOH (8; m/z 1034.3) was also observed, indicating that
hydrolysis of the MG6-1 adduct competes to a minor extent
with the macrocyclization process. The cyclic backbone of the
predominant product (7) was further evidenced by MS/MS
analysis (Figure 1c), which showed few fragments as a result
of multiple ring-opening pathways leading to acylium ions of
Both biosynthetic[6] and synthetic[7] methods have been
implemented to afford peptides in cyclic or conformationally
constrained configurations. Genetic encoding offers the
unrivalled advantage that vast molecular libraries (108–1010)
can be rapidly created by combinatorial mutagenesis and
readily explored using genetic selection or ultrahigh-through-
put screening methods.[6,8] However, the pool of building
blocks available for construction of biological peptide libra-
ries remains restricted, limiting the degree of ligand diversity
achievable through these approaches. In contrast, synthetic
methods can draw upon a much broader spectrum of
precursor structures, including non-natural amino acids,[9]
peptoids,[10] and amino acid unrelated scaffolds,[11] which can
be exploited to confer improved or novel conformational and
target-binding properties to peptide-based ligands.
Integrating the advantages of biological and synthetic
approaches would open unprecedented opportunities for
ligand diversification and molecular discovery. Towards this
goal, we have developed a method that allows the embedding
of non-proteogenic synthetic moieties into genetically en-
coded peptidic frameworks. This strategy enables the modular
assembly of macrocyclic organo–peptide hybrids (MOrPHs),
[*] J. M. Smith,[+] Dr. F. Vitali,[+] S. A. Archer, Prof. Dr. R. Fasan
Department of Chemistry, RC Box 270216, University of Rochester
Rochester, NY 14627 (USA)
E-mail: fasan@chem.rochester.edu
[+] These authors contributed equally to this work.
[**] This work was supported by startup funds from the University of
Rochester and a NSF Graduate Fellowship to J.M.S. The authors
thank Prof. Peter Schultz for kindly providing the pEVOL vector
encoding for the mutant MjtRNA and MjTyrRS.
Supporting information for this article is available on the WWW
Angew. Chem. Int. Ed. 2011, 50, 5075 –5080
ꢀ 2011 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
5075