Journal of the American Chemical Society
Article
protein binding partners from each PAL probe (Figure 5b).
Examination of the number of PSMs per probe revealed a
bimodal distribution that clustered as a function of positively
charged probes versus neutral or negatively charged probes
(Figure 5b, inset). Positively charged probes produced on
average 418 PSMs corresponding to 50 unique binding sites
per probe, whereas neutral probes produced an average of 65
PSMs (14 unique binding sites) and negatively charge probes
produced an average of 14 PSMs (5 unique binding sites). No
statistical difference between the neutral or negatively charged
PAL probes was observed, although neutral PAL probes with a
larger hydrophobic surface area tended to yield a greater
number of PSMs (e.g., JN3, JN939, and JN38). Similar to
observations from the proteome, we found that a majority of
the binding sites were from membrane proteins (58% of
PSMs) (Figure 5c). The wider structural diversity of the 32
PAL probes suggests the increased labeling of membrane
proteins is a result of the labeling propensity of alkyl diazirine
chemistry in contrast to the probes’ individual structures.
Although the positively charged members of the library were
six of the top seven probes by PSM counting, the positively
charged probe JN942 was an outlier at rank 16 of 32 probes.
We investigated the cause of this outlier in vitro and observed
unaffected labeling efficiencies on individual amino acids and
proteins (Figure S13), suggesting that in addition to the net
charge, cellular accessibility of the small molecule to
biomolecular targets should also be considered in probe
design.
As further evidence of labeling preferences between alkyl
diazirines and acidic amino acid residues in the whole
proteome, we compared the occurrence of amino acids
found within labeled peptides to their natural occurrence in
the human proteome (Figure 5d). Glutamic acid was the most
enriched amino acid in labeled peptides and increased
specifically in peptides labeled by positively charged probes;
most of this enrichment was eliminated in binding sites
measured from negatively charged probes. The limited
enrichment of aspartate residues reflects previous trends
observed from labeling propensities with bona fide diazo
esterification reagents.20 This may be caused by differences in
steric accessibility or the generally lower pKa of Asp relative to
Glu, which would favor reactivity with Glu and disfavor
reactivity with Asp. In sum, for alkyl diazirine PAL experiments
in cells, glutamic acid has the highest propensity for labeling in
comparison to other amino acids, and the labeling frequency is
influenced by the local pKa environment and the physical
properties of the photoaffinity probe.
electrostatic potential of each protein surface was calculated
using the adaptive Poisson−Boltzmann solver (APBS) and
overlaid on the structure of the protein. For clarity, only the
negative electrostatic surface map is displayed in red. For
binding sites that were observed as labeled by only one or two
PAL probes, the mapped peptide has a minor degree of
negative electrostatic density nearby (Figure 5f). In some
cases, the mapped peptide is within 9 Å21 to a known binding
site for the natural protein ligand. For example, ADP-
ribosyltransferase protein PARP15 has a nicotinamide binding
cleft that is adjacent to the observed binding site.45 PD6-
interacting protein (PDCD6IP) binds to a natural peptide
ligand in close proximity to the observed small molecule
binding site (peptide ligand highlighted in orange, Figure 5f).46
The 14−3−3 proteins are receptors for peptides in a cleft
within 9 Å of the mapped binding site.47 Transferrin receptor 1
is a membrane-bound protein; the binding site is in the
extracellular region.48
In contrast, binding sites on proteins that were mapped to
multiple PAL probes exhibited significantly greater negative
electrostatic densities around the labeled peptide (highlighted
in dark blue, Figure 5g). Vimentin is a filamentous protein that
is frequently identified in alkyl diazirine labeling experiments,
likely because of the significant negative electrostatic density
on vimentin (net charge = −27). Unsurprisingly, the 17 unique
peptides conjugated across 19 different PAL probes identified
from vimentin colocalize with the negative electrostatic density
map. Likewise, 8−12 PAL probes conjugated to VDAC1, ER
chaperone BiP, and cathepsin B, which each display similar
negative electrostatic densities in close proximity to the
mapped peptide binding site. Taken together, these data
suggest that the subset of proteins that are labeled more
frequently by alkyl diazirine chemistry typically have negative
electrostatic regions that are characterized by a density of
acidic residues or those with a relatively high localized pKa,
resulting in an increase in the degree of protonation of the
amino acid for labeling through the diazo intermediate during
PAL experiments.
DISCUSSION
■
Despite the growing use and application of diazirine chemistry
in PAL experiments, a systematic understanding of the labeling
preferences of the diazirine in cells has yet to emerge. Here, we
report a systematic analysis of diazirines and their preferential
reactivity pattern with protein biomolecules in vitro and in
cells. We find that alkyl diazirines preferentially react with
acidic amino acid residues in a pH-dependent manner in vitro,
which can be rationalized by the formation of a long-lived
diazo intermediate that is intercepted by organic acids prior to
formation of an alkyl carbene. By contrast, the aryl-
trifluorodiazirine forms insertion products with all 20 amino
acids, is less dependent on pH, and its labeling chemistry is
readily quenched by water; these are characteristics of a
reaction pathway through a carbene intermediate. Although
reactions with individual amino acids in neat conditions are
only an approximation of the PAL probe reactivity with a
target protein in cells, the impact of the alkyl diazirine
reactivity pattern translates to a predictable enhancement of
cellular labeling with alkyl diazirine probes carrying a net
positive charge from a library of 32 alkyl diazirine probes,
which may arise from enhanced association of the probe with
the matched acidic regions on protein surfaces. In line with this
expectation, chemoproteomics and binding site data show that
Across all the mapped probe-labeled binding sites, we found
that a binding site on VDAC1 was frequently observed
(representative assignment, Figure 5e). This peptide on
VDAC1 is in close proximity to two negatively charged
residues on the protein (Y67 and E73). VDAC1 E73 is notable
for its unusually high pKa (predicted membrane pKa = 7.4) and
ability to sensitively mediate dimerization of VDAC1 in a pH-
dependent manner.44 Protonation of VDAC1 E73 would
increase the propensity for E73 to react with a diazo
intermediate. VDAC1 and VDAC2, but not VDAC3, possess
this glutamic acid residue, which explains why only VDAC1
and VDAC2 are frequently enriched targets in alkyl diazirine
studies.13
Intrigued by this analysis with VDAC1, we next visualized a
subset of the binding sites onto available protein structures
(binding sites highlighted in dark blue, Figure 5f,g). The
6697
J. Am. Chem. Soc. 2021, 143, 6691−6700