8
V. Ferrario et al. / Journal of Molecular Catalysis B: Enzymatic 101 (2014) 7–15
are combined into a ‘principal component’ (or latent variable) so
that objects are projected on a space of reduced dimensionality (i.e.
a reduced number of independent variables corresponding to the
new components) [12]. In particular, when a certain response (Y
variable) of the system must be modeled and optimized, the new
variables or components are extracted to give the best fit of both
Y and X variable matrices. This is accomplished by applying the
PLS analysis, where the Y latent variables are correlated to X latent
variables [13].
fermented as submerged culture in shake flasks and the lipase
variants secreted into the fermentation medium. After the fermen-
tation, the lipase variants were purified from the sterile filtered
fermentation medium in a three-step procedure with (i) hydropho-
bic interaction chromatography on decylamine-agarose, (ii) buffer
exchange by gel filtration, and (iii) ion exchange chromatography
with cation exchange on SP-sepharose at pH 4.5. The lipase variant
solutions were stored frozen.
In principle, data calculated using molecular simulation meth-
ods could be analyzed by PLS to reveal the inherent structure of the
[14]. Although QSAR methods are well established in drug design
strategies [15], they have been applied to biocatalysis only recently
for the study of enzyme thermostability [16], enzyme specificity
[17,18] and enantioselectivity [19].
2.2. Experimental screening of mutants
The amidase activity was determined with a fluorimetric assay
similar to the one described previously [21].
The aqueous reaction mixture contained 0.03 mg/ml CaLB vari-
ant, 5 mM substrate (benzyl chloroacetamide), 25 mM phosphate
(potassium salt) pH 7.0, 10% (w/v) tetrahydrofuran and was set
up in 96-well microtiter plates. The concentration of the enzyme
stock solution was determined by measuring the absorbance at
280 nm and calculated based on the extinction coefficient of 1.21
for CaLB. The microtiter plate was covered with parafilm and incu-
bated for 18 h at 37 ◦C and 300 rpm in the MTP Thermomixer
Comfort (form Eppendorf AG). Afterwards 50 ml of 20 mM 4-
chloro-7-nitrobenzofurazan (NBDCl) in 1-hexanol was pipetted to
200 ml reaction mixture and incubated for 1 h at 37 ◦C and 500 rpm.
Afterwards the fluorescence intensity was measured with the fluo-
rimeter Fluostar Optima (from BMG Labtech GmbH) with excitation
filter at 485 nm and emission filter at 540 nm. The specific activity of
CaLB wild type is 1.27 0.16 × 10−2 mol/mg/h. The activity value
for each enzyme variant is the average of three replications, in three
wells. In parallel, each enzyme variant was also set up in three dif-
ferent wells without substrate, to measure the small background
fluorescents from each variant. Further three wells were set up
without substrate and without enzyme, thus only aqueous medium
with buffer and solvent, to measure the small background fluores-
cents from the medium. The average value from the measurements
with enzyme and with substrate is named ‘es’; the value with
enzyme and without substrate is ‘e’; the value without enzyme and
with substrate is ‘s’; the value without enzyme and without sub-
strate is ‘o’. To correct for fluorescents from enzyme, medium and
autohydrolysis the following subtractions were calculated: (es-e)-
(s-o) = (es-s)-(e-o) = es-e-s + o = corrected enzyme variant activity
value. S-o = autohydrolysis of substrate. Because these measure-
ments are in arbitrary fluorescents units, these values were
normalized to percentage of substrate that reacted. Zero percent
means no reaction of substrate, 100% means complete reaction of
substrate to products. For this normalization two wells for each
enzyme variant were set up in parallel. These wells contained the
same composition as the other wells, except that they contained
instead of substrate the products at a concentration that corre-
sponds to 10% reaction. Thus, the concentrations in these two wells
were 0.03 mg/ml enzyme, 0.5 mM hydrolysis products (i.e. 0.5 mM
benzylamine and 0.5 mM chloroacetic acid), 25 mM phosphate pH
7.0, 10% (w/v) tetrahydrofuran. The average value from the mea-
surements with enzyme and with product is named ‘ep’. To correct
for background fluorescents from the enzyme variants, the average
value from the measurements with enzyme and without product
(and without substrate), which was named ‘e’, was subtracted. In
short, the normalized and corrected enzyme variant activity was
calculated as 10((es-e)-(s-o))/(ep-e).
The present study represents, to the best of our knowledge, the
first attempt to apply 3D-QSAR analysis for correlating structures of
mutants and their activity within an enzyme engineering strategy.
In order to fully exploit fundamental knowledge while avoiding
conceptual biases, computational and statistical techniques were
evaluated in the context of enhancing amidase activity in Candida
antarctica lipase B (CaLB). As is widely known, most proteases, ami-
dases and lipases contain the same type of catalytic machinery as
serine hydrolases. Nonetheless, esterases/lipases have very low or
undetectable amidase activity and this fact has been extensively
discussed [20]. The QSAR study here reported allowed obtaining
quantitative information on variables affecting amidase activity
and their interactions, thus providing guidelines for mutagene-
sis strategies. Overall, by “learning” from a model that correlates
structural information with kinetic data, the methodology allows to
extract objectively information that actually affects the activation
energy of the reaction of interest. Of course, QSAR predictive mod-
els are reliant on the data on which they are based and on the overall
quality of the information, including the item to be modeled. There-
fore, the validity and extendibility of the method strongly depends
on the data set used for training the model. The 3D-QSAR math-
ematical analysis here reported provides a semi-quantitative tool
for in silico screening of virtual mutants and the perspectives for the
exploitation of these computational approaches to rational enzyme
engineering are also discussed herein.
2. Experimental
2.1. Production of CaLB mutants
The experimental CaLB variants were produced according the
procedure previously reported by Suplatov et al. [21].
Variants of CaLB were generated by polymerase chain reaction-
based (PCR) site-directed mutagenesis. The PCR was set up with
the proof-reading KOD DNA polymerase (from Novagen, Toyobo)
and a 7467 basepair Escherichia coli–Aspergillus plasmid that was
previously methylated with CpG methyltransferase (from NEB).
The PCR products were used to transform competent E. coli DH5a
cells (from TaKaRa). Plasmid DNA was recovered and sequenced
to verify the presence of the desired substitution. Confirmed plas-
mid variants were used to transform an Aspergillus oryzae strain
that is negative in pyrG (orotidine-5-O-phosphate decarboxylase)
and that is also negative in the proteases pepC (a serine protease
homologous to yscB), alp (an alkaline protease) NpI (a neutral
metalloprotease I) to avoid degradation of the lipase variants during
and after fermentation. The transformed Aspergillus strains were
2.3. Modeling CaLB WT and mutants
All the calculations were performed using GROMACS software
[22] version 4 compiled on a virtual ROKCS linux cluster with 4
nodes.