and identifying active site residues likely to clash with large
substrates based on computational models of these com-
pounds docked in the enzyme active site. Because it is difficult
to predict which of the chosen residues might actually impact
catalysis,[11] a combinatorial library design that includes all per-
mutations of alanine substitutions was utilized. Library mem-
bers were then screened for activity with a panel of bulky sub-
strates that possess methylated heteroatoms by detecting
formaldehyde released during P450-catalyzed removal of Me
or MOM groups.[8] These substrates thus serve as efficient
probes for enzyme activity and for compatibility of valuable
natural product scaffolds with BM3 variants.
substrate scope of the library. Each of these is relatively large,
possesses several methylated heteroatoms, and belongs to a
privileged class of compounds (i.e., monosaccharides, alkaloids,
and steroids), for which novel hydroxylation catalysts could be
highly valuable (vide infra). High-throughput screening was
carried out in 96-well microtiter plate format[8] by addition of
NADPH to solutions of the appropriate substrate and cell
lysate. Following incubation at room temperature, the reac-
tions were quenched with a basic solution of Purpald, which
reacts with the formaldehyde produced from P450-catalyzed
heteroatom demethylation to generate a purple dye. The
parent enzyme, 9-10AF87VTS, displayed only weak activity
with these substrates, and provided a signal of 20% over back-
ground in the best cases (data not shown). However, many
variants with high activity against several substrates were im-
mediately evident based on visual inspection, and the formal-
dehyde concentration (and thus the extent of demethylation)
was quantified by using a plate reader (A550nm).
BM3 variant 9-10AF87VTS, a thermostable form of an
enzyme (9-10AF87V) previously found to have activity against
a-1,2,3,4-tetramethoxymethyl xylose (1),[8] was selected as a
parent for library creation. Structures for the most stable con-
formers of 1 and previously unreactive monosaccharide sub-
strates, including 1,2,3,4,6-pentamethoxymethyl glucose (2),
galactose (3), and mannose (4), were generated by using the
program Omega.[12] The resulting conformers were then placed
into the active site of a model of 9-10AF87VTS such that their
terminal methyl CÀH bonds
Variants with 4.1–7.9-fold improvement in demethoxymethy-
lation activity toward each of the MOM-protected hexose
derivatives were identified (Table 1). Furthermore, GC or HPLC
were oriented according to the
Table 1. Sequence–activity relationship for alanine substitution.
transition state geometry for cy-
tochrome P450-catalyzed H-ab-
straction proposed by Rydberg
et al.[13] The transition state en-
semble was expanded by vary-
ing rotational degrees of free-
dom and reduced by eliminating
substrate poses that clashed
with the enzyme (within 2.5 ꢁ of
a backbone or b carbon). Inspec-
tion of the final ensemble of
prospective transition state con-
formations led to the identifica-
tion of eight residues for re-
placement with alanine: K69,
L75, M177, L181, T260, I263,
T268, and L437. Combinatorial
substitution was accomplished
Substrate
Variant
Alanine substitution (+) at residue
Fold
69
75
177
181
260
263
268
437
improvement[a]
2A1
4H9
8C7
–
–
–
+
–
+
–
–
–
+
+
+
–
+
–
–
–
–
–
–
–
+
+
–
4.4
4.1
7.9
thioglycosides (5–7)
4H5
4H9
7A1
8C7
8F11
–
–
–
–
–
+
–
+
+
–
+
–
+
–
+
+
+
+
–
–
–
–
–
–
–
–
–
–
–
–
–
+
–
–
+
2.7
3.9
2.8
2.7
3.3
+
+
–
alkaloids (8, 9)
steroid (10)
–
–
8F11
–
–
–
–
–
–
–
+
n.a.[b]
[a] Ratio of A550 measurement for reaction of each variant to that of the parent (9-10AF87VTS). Ratio for sub-
strate with maximum improvement shown. [b] Improvement identified by visual inspection; substrate insolubil-
ity complicated plate reader measurement.
by cloning fragments of the parent gene containing these resi-
dues by using degenerate primers or primer mixtures encod-
ing either the original residue or alanine at each site (see the
Supporting Information). These fragments were assembled to
generate a 28 (256) member library containing the desired ala-
nine substitutions. E. coli were transformed with the library
mixture and 767 single colonies (ca. 3ꢂ theoretical library size)
were picked and used to inoculate media in 96-well deep-well
plates. Following protein expression and cell lysis, CO binding
analysis revealed that 65% of the enzymes were properly
folded.[14] Sequences from 1% of the library indicated unbiased
incorporation of alanine at all of the desired sites, and an aver-
age alanine substitution per sequence of 3.9.
analysis of the crude mixtures indicated that these reactions
proceeded in moderate to high conversion while still maintain-
ing high regioselectivity (vide infra). On the other hand, few
variants with improved activity against the smaller pentose
substrate, 1, were obtained from this library; this is consistent
with the expanded active site providing an advantage only for
the reaction of larger substrates (see the Supporting Informa-
tion). This library also contained variants with marked improve-
ments in activity with alkaloid and steroid substrates 8, 9, and
10, despite the fact that such structures were not used in the
library design. For most substrates, the best variants possessed
at least two alanine substitutions, and in general, positions 75,
177, 181, 260, 263, and 437 proved advantageous, while ala-
nine substitution at 69, 263, and 268 did not.
Several compounds, including MOM-protected thioglyco-
sides 5, 6, and 7, thebaine (8), dextromethorphan (9),[15] and
trimethyl estriol (10; Scheme 1), were selected to probe the
Given that these variants possessed sufficient activity with
substrates 2–10 to enable their detection with the aforemen-
ChemBioChem 2010, 11, 2502 – 2505
ꢀ 2010 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
2503