Journal of the American Chemical Society
Communication
approaching a local maximum in activity or may need
stabilizing mutations before further activating mutations can
be found. Experimental procedure may also need to be
reworked to avoid possible limitations in dynamic range as
enzymes improve. As we sought to evolve on this platform as a
proof of principle and managed to do so, we decided to end
the campaign.
To acquire the fold improvement of each parent over their
predecessor, we ran the top variants from each round at
analytical scale and measured their activities against their
parents by GCMS. The final variant displayed a 2-fold
improvement (1300 total turnovers (TTNs)) (Figure 3).
The use of a thiol-tagged reactant to allow immobilization of
the product was a convenient choice in this work but would
not be compatible with many reactions. Hence, we repeated
the C−H insertion reactions using a molecule that lacked a
thiol group and that could be immobilized to a monolayer
using a “traceless” immobilization method (Figure S7). In this
scheme, a monolayer presenting a diazirine group reveals a
carbene on irradiation, which reacts nonselectively to
immobilize nearly all molecules.37,38 We performed reactions
for five additional substrates and used the traceless
immobilization scheme to analyze products and, in each
case, observed peaks in the mass spectra that corresponded to
reactant and product (Figure S8). This example demonstrates
that reactants need not be functionalized for immobilization
and, in turn, suggests that this method will have a very broad
relevance in directed evolution.
Throughout the course of this study, 22 944 spectra were
generated and processed. With each plate requiring only 30
min per run, SAMDI collected data more than 100-fold faster
than classic GCMS and approximately 10-fold faster than
recent developments in the state-of-the-art thereof.20,39 If
higher throughput is desired, the method may be accelerated
by approximately 2-fold by working with groups of 16 96-well
plates in 50 min per run, with each sample requiring only 0.5
μL from each well.33
SAMDI-MS has been used extensively to profile enzymatic
activity both in biochemical reactions and from complex
lysates, while permitting the analysis of up to thousands of
samples per hour and more than 30 000 experiments per
day.31,32,36,40−42 Hence, it is clear that the throughput in this
study was not limited by the number of variants that could be
screened.
The approach described here has the benefits that it is high-
throughput, compatible with all library diversification techni-
ques performed in multiwell plates and may be applied to any
reaction that produces a shift in mass.33 While epPCR allowed
SAMDI to find modest fold improvements in the present work,
we expect that utilizing other diversification techniques will
result in greater improvements.
Figure 3. We used GCMS to characterize the hits and obtain the total
turnover (TTN) for each variant. The evolutionary lineage of P411
for C−H alkylation is displayed. Bars represent mean yields
(performed from two independent cell cultures, each used for
duplicate reactions). Reaction conditions were as follows: cytochrome
P411 in E. coli whole cells (optical density at 600 nm, OD600, of 1), 5
mM substrate, 5 mM ethyl diazoacetate, 5 vol % EtOH in M9-N
buffer at room temperature under anaerobic conditions for 18 h. The
asterisk symbol represents the introduction of a stop codon. See the
SAMDI-MS can accommodate a wide variety of chemical
transformationswithout sacrificing throughputas other
immobilization strategies have been demonstrated and are
readily available.35,37,38,40,43 In this way, the assay is not limited
to certain classes of reactions but can be adapted to many
organic transformations. We note that this method cannot be
applied to reactions where the product and substrate share the
same massincluding stereoisomeric and tautomeric struc-
turesand would in those cases require a second reaction step
(that is selective for one of the molecules), tandem mass
spectrometry, or a separation step.
As directed evolution continues to add new chemistries to
Nature’s repertoire, generating small molecules with increasing
complexity,44 the need for high-throughput and generalizable
screening tools is paramount. It is the use of immobilization
chemistry that distinguishes SAMDI’s throughput and
substantiates the method to be well suited for evaluating
variants in applications of directed evolution. This platform
enables directed evolution efforts to evolve enzymes for
improved activity and interrogate wider areas of protein space.
We anticipate that further use of this method will lead to
exploring larger areas of chemical space in high throughput and
help uncover unexpected solutions for creating better enzymes.
Interestingly, none of the beneficial mutations were in the
active site of the enzyme or at sites previously mutated in
rational approaches (Figure S4). By not restricting the
sequence space explored, we were able to identify potential
allosteric effects and provide new sites that may be investigated
in targeted evolution.
To demonstrate the reproducibility of the SAMDI
technique, we selected and scaled up one variant from the
final round to be screened repeatedly with SAMDI. Here we
found a standard deviation of 2.3% with a resolving power of
0.1 m/z (Figure S5). The primary source of variability about
the mean is likely due to application of matrix, which leads to
modest differences in signal strength from spot to spot.34 We
also note that while the SAMDI technique is able to accurately
quantitate the extent of each reaction,36 we only required
relative product yields to proceed with evolution, and thus,
accurate yields were determined only for the variants validated
by GCMS. Experimental reproducibility at-large was shown by
inducing multiple colonies of the same clone for the initial
variant. Here, we found a coefficient of variation (CV) of 14%
C
J. Am. Chem. Soc. XXXX, XXX, XXX−XXX