2
Tetrahedron
1. Introduction
refractory to incorporation into nascent RNA due to the
placement of the PGA-specific protecting group (Fig. 1, B).
Cytidine and uridine are more reasonable choices for metabolic
labeling as they are evenly distributed throughout the genome,
and as we have seen before bulky substituents can prevent
processing by endogenous kinase enzymes, such as
uridine/cytidine kinase 1, thus preventing production of the
triphosphate.8 Furthermore, functionalizing the 5-position with
an alkyne is not predicted to alter the sugar orientation or the
anti-position of the nucleobase to disrupt RNA structure.
Cells exist in complex environments in Nature. Understanding
the molecular signatures of individual cell types has emerged as a
key challenge in biology and medicine. However, precise
chemical tools to do so are still overall lacking.1 RNA molecules,
once regarded as transient messengers between the genome and
proteome, have now emerged as key players in regulating cell-
fate decisions, development, cancer and neurological disorders.2-4
Cell-specific profiling of RNA molecules, in the form of gene
expression analysis, is extremely challenging. Such analyses rely
on cell-sorting techniques, which have been demonstrated to be
both notoriously dirty and often detrimental to the stability of
RNA molecules.5, 6 Such obstacles manifest themselves in high
false-positive rates in data analysis that can often mislead
biologists aiming to understand changes in gene expression or the
function of RNA.7 Cell-specific metabolic labeling has emerged
as a powerful technique in understanding the proteome and
metabolome, yet, with few exceptions, analogous methods for
RNA are still lacking. We recently reported the utility of
protected nucleoside and nucleobase analogs for cell-specific
metabolic labeling of RNA (Fig. 1, A).8, 9 In such a case an
exogenous enzyme can be expressed, within a cell-type of
interest, to reveal RNA metabolic intermediates containing bio-
orthogonal chemical handles. This approach allows the imaging
and isolation of nascent and cell-type specific RNA. Such an
approach is extremely powerful as it opens the door for cell-
specific RNA profiling with the potential to reveal novel
functions and gene expression signatures.
We previously demonstrated the enzyme uracil
phosphoribosyl transferase (UPRT) can be used to incorporate 5-
ethynyluracil into nascent RNA.8 Although useful, UPRT is
expressed in many lower organisms and as such the host
pyrimidine salvage pathway compromises its utility inside these
animals.10 As a viable alternative, we matched a N6-
phenylacetyladenosine analog with a 2’-azido handle (2’-AzA),
with an enzyme, penicillin G amidase (PGA).8 PGA is a
bacterial-specific enzyme that cleaves an amide bond.11 In our
work, cellular RNA is endowed with azido functionality only
when cells express PGA. While useful, we reasoned extending
the toolset of the PGA enzymatic removal paradigm could further
reveal additional nucleosides that can be used for metabolic
labeling of RNA.
Figure 1. Protected analogs for cell-specific metabolic labeling
of RNA. (A) Schematic of cell-specific metabolic labeling of
RNA. (B) Structures of analogs synthesized herein and their
proposed reactions with PGA.
PAC-5EC was synthesized in five steps starting from
commercially available 5-iodocytidine (5-IC; ESI). 5-IC was
subjected to a Sonogashira coupling to install the trimethylsilyl
alkyne handle at the 5-position, followed by protection of the
sugar alcohols with tert-butyldimethylsilyl groups. To synthesize
the PAC portion, 2-phenylacetic acid was treated with DCC to
yield 2-phenylacetic anhydride. The exocyclic amine of the silyl-
protected cytidine was then reacted with 2-phenylacetic
anhydride to install the phenylacetyl group, followed by
tetrabutylammonium fluoride deprotection of the silyl groups to
yield the final product, PAC-5EC, in an overall yield of 19%.
PACQM-5EU was synthesized in 13 steps starting from
commercially available uridine (ESI). Iodination of uridine
yielded 5-iodouridine, which was then subjected to a Sonogashira
coupling to install the trimethylsilyl alkyne handle at the 5
position. To synthesize the PAC portion, para-
aminobenzylalcohol was coupled to phenylacetic acid using 2-
ethoxy-1-ethoxycarbonyl-1,2-dihyrdoquinoline to generate N-(4-
(hydroxymethyl)phenyl)-2-phenylacetamide. The benzyl alcohol
was then brominated with PBr3, followed by Finkelstein reaction
to install the iodine. The iodo-intermediate was then coupled with
the TMS-protected uridine using potassium carbonate, followed
by tetrabutylammonium fluoride deprotection of the TMS group
to yield the final product, PACQM-5EU, in an overall yield of
11%.
We reasoned that the azido moiety of 2’-AzA could be prone
to reduction inside the cell, and more complicated environments
in vivo, which would negate its ability to be utilized for affinity
and purification experiments. Azide reduction has been observed
with many other azido-containing analogs, and modified
nucleosides, and can have a gross effect on their activity in cells
and animals.12 We also reasoned that adenosine analogs would be
highly biased towards RNAs that have long polyA tails, a
hallmark of messenger RNAs and long non-coding RNAs. Our
own work with 2’-Az-A has demonstrated that it is largely
incorporated into polyA tails.13 This is a well-documented
problem when understanding RNA expression levels through
RNA sequencing – the current state of the art method of
characterizing RNA expression.14 Together, these matters
underscore the need to expand the scope of novel analogs for
cell-specific metabolic labeling of RNA.
2. Results and Discussion
We sought to utilize a different nucleobase structure for
metabolic labeling of RNA. Penicillin G amidase is our current
enzyme of choice and as such we decided to design and
synthesize an alkynyl cytidine and uridine, which would be