Substrate Docking Algorithms of Cytochrome P450cam
J. Am. Chem. Soc., Vol. 119, No. 24, 1997 5491
and assigning a score to the fit of each compound, and (c)
selection of compounds to be tested from the resulting list based
on additional criteria (see below). In effect, a 20 000 compound
subset of the Available Chemical Directory (ACD)19b was
screened with DOCK. Contact scoring, which assigns the
highest scores to compounds that maximize nonpenetrating
contacts between the substrate and the active site, was used to
evaluate the goodness of fit.18 High contact scores are
considered desirable for the identification of potential inhibitors,
but the suitability of high contact scores for the evaluation of
potential substrates had not been examined prior to this study.
Of the 500 compounds with the highest contact scores, a small
group of structurally diverse compounds was selected (Table
1) that did not have functionalities ionized at physiological pH,
were relatively simple structures, and represented a diversity
of structural types.11,19
To test the ability of this computer-assisted approach to
discriminate between substrates and nonsubstrates, a group of
compounds predicted not to fit within the P450cam active site
was also identified (Table 2). In order to make this a rigorous
test, the P450cam active site model based on the crystal
coordinates was modified by replacing the side chain of Leu-
244 with that of an alanine (i.e., by removing three carbons
from the crystal coordinates) while holding the rest of the atoms
fixed in their original positions. Dock was then run on the
“mutated” active site exactly as done for the native enzyme,
and a difference list was made of the compounds that fit within
the model L244A but not wild-type active site. This approach,
which requires that DOCK differentiate between ligands that
bind to sites that differ by only a small volume, generated a list
of predicted nonsubstrates closely related to the predicted
substrates (Table 2).11
search. Since better three-dimensional structures than those
provided by CONCORD can be generated by using the
molecular mechanics functions in the SYBYL modeling
package,19d the molecules selected from the initial DOCK
analysis of the ACD were minimized by using the default values
in SYBYL and were reDOCKed into the active site. Optimiza-
tion of the structures with the SYBYL molecular mechanics
programs resulted in two compounds (10, 11) migrating from
the nonsubstrate to the substrate list and one (12) migrating in
the opposite direction.
Dependence of Prediction on DOCK Parameters. Initial
efforts to predict P450cam substrates were carried out with the
default settings for the adjustable parameters in DOCK.
However, as these default settings are not necessarily optimal
for the identification of substrates, we have examined the effects
of varying some of these parameters. Studies of the minimum
distance allowed between an atom of the substrate and an atom
of the enzyme before the contact is classified as prohibitive have
been particularly fruitful. Two such distances are defined in
DOCK, one for polar and one for nonpolar interactions: the
default values for these distances are 2.3 and 2.8 Å, respectively.
These distances have been found to be critical determinants of
the substrate predictions. The results in Tables 1 and 2 are based
on the default values, but increasing the distances to 2.4 and
2.9 Å for polar and nonpolar contacts, respectively, shifts
compounds 8-10 from the predicted substrate to the nonsub-
strate column. The two contact distances have also been set to
the same value, and the DOCKing of the compounds, using
both the CONCORD and SYBYL-minimized structures, has
been reexamined as a function of this single minimum contact
distance. As shown in Figure 1, if the minimal contact distance
is set at 2.9 Å, all but one of the 16 compounds are correctly
predicted to be either substrates or nonsubstrates of P450cam
.
CONCORD Versus SYBYL Generated Structures. Three-
dimensional structures of the compounds in the Available
Chemicals Directory were generated with the program
CONCORD,19c which uses a set of empirical bond length and
bond angle rules to convert two-dimensional to three-dimen-
sional structures. After DOCK analysis of a 20 000 compound
subset of the ACD, the 500 compounds with the highest contact
scores were saved and visually screened. Ten compounds (1-
9, 12) were chosen as potential substrates from the wild-type
P450cam screen, and six compounds (10, 11, 13-16) were
chosen as probable nonsubstrates from the L244A P450cam
The exceptions are compound 3, using the CONCORD struc-
tures, and compound 6, using the SYBYL structures.
The best active site model to be used for the substrate docking
studies is also of concern. The ferric, camphor-bound enzyme
after subtraction of the camphor coordinates was used for the
above studies, but docking to the ferrous, ferrous dioxy, or
putative ferryl states of the protein could also be envisaged.
However, of these enzyme forms, coordinates are only available
for the ferrous state. Fortunately, although NMR and X-ray
crystallography indicate that ferric P450BM-3 undergoes a major
conformational change when the enzyme is reduced to the
ferrous state,21 the data on P450cam indicate that its active site
is little altered by changes in the redox and ligation states of
the protein.6a,12a The ferrous-CO complex, for which a crystal
structure is available,20 can therefore be used with some
confidence as a model for the ferrous dioxy complex, although
the CO and O2 ligands differ in that the first is essentially linear
and the second bent. As shown in Figure 1, correct predictions
can be made for 15 of the 16 compounds if the minimum contact
distance for binding to the Fe2+-CO complex is set to 2.7 Å
rather than the 2.9 Å found for docking to the ferric protein.
However, from the results, there is no obvious advantage to
selecting the ferrous-CO complex over the better defined ferric
site as the docking target.
(16) (a) DesJarlais, R. L.; Seibel, G. L.; Kuntz, I. D.; Furth, P. S.; Alvarez,
J. C.; Ortiz de Montellano, P. R.; DeCamp, D. L.; Babe´, L. M.; Craik, C.
S. Proc. Natl. Acad. Sci. U.S.A. 1990, 87, 6644-6648. (b) Shoichet, B. K.;
Stroud, R. M.; Santi, D. V.; Kuntz, I. D.; Perry, K. M. Science 1993, 259,
1445-1450. (c) Ring, C. S.; Sun, E.; McKerrow, J. H.; Lee, G. K.;
Rosenthal, P. J.; Kuntz, I. D.; Cohen, F. E. Proc. Natl. Acad. Sci. U.S.A.
1993, 90, 3583-3587. (d) Bodian, D. L.; Yamasaki, R. B.; Buswell, R. L.;
Stearns, J. F.; White, J. M.; Kuntz, I. D. Biochemistry 1993, 32, 2967-
2978.
(17) (a) Richards, F. M. Annu. ReV. Biophys. Bioeng. 1977, 6, 151-
176. (b) Connolly, M. L. Science 1983, 221, 709-713.
(18) Kuntz, I. D.; Meng, E. C.; Shoichet, B. K. Acc. Chem. Res. 1994,
27, 117-123.
(19) (a) Molecular graphics visualization and molecular surface genera-
tion were carried out with the MidasPlus program from the Computer
Graphics Laboratory, University of California, San Francisco (supported
by NIH RR-01081): Ferrin, T. E.; Huang, C. C.; Jarvis, L. E.; Langridge,
R. J. Mol. Graphics 1988, 6, 13-27. (b) MDL Information Systems, Inc.,
San Leandro, CA, USA. (c) Rusinko, A.; Sheridan, R. P.; Nilakatan, R.;
Haraki, K. S.; Bauman, N.; Venkataraghavan, R. J. Chem. Inf. Comput.
Sci. 1989, 29, 251. (d) SYBYL 6.0 Tripos Inc., 1699 South Hanley Road,
St. Louis, MO 63144-2913 USA; (e) Molecular graphics images were
produced using the MidasPlus program from the Computer Graphics
Laboratory, University of California, San Francisco (supported by NIH RR-
01081): Ferrin, T. E., Huang, C. C., Jarvis, L. E., and Langridge, R., J.
Mol. Graphics 6, 13 (1988); (f) SPHGEN: Kuntz, I. D., Blaney, J. M.,
Oatley, S. L., Langridge, R., and Ferrin, T. E. (1982) J. Mol. Biol. 161,
269-288.
Binding of Compounds to P450cam
.
The binding of
compounds to P450cam was evaluated spectroscopically by
quantitating the low to high spin transition evidenced by a
decrease in the absorption maximum of the resting enzyme at
417 nm and the corresponding increase in the enzyme-substrate
(20) Raag, R.; Poulos, T. L. Biochemistry 1989, 28, 7586-7592.
(21) (a) Modi, S.; Sutcliffe, M. J.; Primrose, W. U.; Lian, L.; Roberts,
G. C. K. Nature Struct. Biol. 1996, 3, 414-417. (b) Li, H., and Poulos, T.
L. Biochimie 1996, 78, 695-699.