A R T I C L E S
Pentelute et al.
6
Scheme 1. Predicted 81-Residue Glycine-Rich Amino Acid Sequence of sfAFP
1
0
18,19
j
molecules up to 200 or more amino acid residues in size.
in space group P1 were obtained.
These pioneering studies
Recent refinements have made total chemical synthesis an
increasingly practical way of obtaining useful amounts of protein
showed that it is possible to obtain centrosymmetric crystals
from a racemic protein mixture; however, in neither case was
crystallization of the natural L-protein limiting, so the studies
did not speak to whether or not racemic crystallization could
be used to facilitate obtaining crystals from a difficult-to-
crystallize protein target.
1
1–14
for study.
In an accompanying paper, we report the total
chemical synthesis of the mirror image forms of the snow flea
1
5
antifreeze protein: L-sfAFP and D-sfAFP; the antifreeze
activities of both of these folded, synthetic proteins were verified
15
16–22
by ice recrystallization inhibition assays. With a ready supply
of sfAFP to hand, we set out to determine the experimental
structure of sfAFP using racemic protein crystallography, a
technique that is made possible only by using chemical protein
synthesis.
In this paper we report the use of racemic crystallization
to obtain high-quality crystals of the protein sfAFP for X-ray
diffraction studies. Crystal formation occurred much more
readily from a racemic mixture of D-sfAFP and L-sfAFP than
with the L-protein alone. More facile crystal formation also
occurred from a quasi-racemic mixture of D-sfAFP and L-Se-
sfAFP, a chemical protein analogue that contains an additional
-SeCH2- moiety at one residue and thus differs slightly from
the true enantiomer. Multiple wavelength anomalous dispersion
(MAD) phasing of diffraction data from a quasi-racemate crystal
was then used to solve the X-ray structure of the sfAFP protein
molecule. The resulting model was used to solve by molecular
replacement the structure of L-sfAFP to a resolution of 0.98 Å.
The X-ray structure of the true racemate made up of the protein
While X-ray structures have been determined for many
thousands of distinct protein molecules, it is not infrequently
observed that a particular protein may be difficult to crystallize
or may form crystals unsuitable for the collection of single-
crystal X-ray diffraction data. This has become one of the
principal obstacles to the high throughput determination of
protein X-ray structures. As will be described below, we
encountered severe difficulties in the crystallization of sfAFP,
which we were able to overcome as follows.
j
Proteins found in nature (i.e., proteins made up of L-amino
acids and glycine) are inherently chiral entities and can therefore
crystallize only in one of the 65 “biological” space groups that
lack an inversion center or mirror plane, out of the total of 230
unique space groups that describe all possible crystal sym-
enantiomers L-sfAFP and D-sfAFP in space group P1 was also
analyzed.
Results and Discussion
Crystallization of L-sfAFP. Using sfAFP prepared by total
1
6
15
metries. The frequency of protein crystallization in different
space groups is markedly nonuniform, with most protein crystals
belonging to one of just a few space groups. P212121 is by far
the most commonly observed space group for globular protein
molecules. Yeates has explained this preference for certain space
groups based on an entropic model: “the favored space groups
are simply less restrictive than others... they allow the molecules
more rigid-body degrees of freedom and can therefore be
chemical synthesis, extensive crystallization trials were carried
out over a period of six months with L-sfAFP alone, varying
temperature, protein concentration, and the concentration of
(NH4)2SO4 or other precipitants. Crystals were eventually
1
6
realized in a greater number of ways.” This same model
predicts that, for globular proteins, a racemic protein mixture
will crystallize more readily. This is so because the space group
j
most favored for globular proteins on theoretical grounds, P1,
contains inversion symmetry and is thus not accessible to natural
16,17
j
L-proteins.
However, the P1 space group would in principle
be accessible to the achiral entity made up of a D-protein and
the corresponding L-protein as a molecular pair. The more facile
crystallization of racemic protein mixtures has never been
experimentally verified. Only two proteins have been crystallized
as racemates; in both cases centrosymmetric racemic crystals
(
(
10) Dawson, P. E.; Kent, S. B. Annu. ReV. Biochem. 2000, 69, 923–960.
11) Bang, D.; Kent, S. B. Angew. Chem., Int. Ed. Engl. 2004, 43, 2534–
Figure 1. A direct comparison of the results for attempted crystallization
of the racemic protein mixture [D-sfAFP + L-sfAFP] or the single
enantiomer L-sfAFP under three sets of conditions. Protein crystals appeared
rapidly from a solution containing a racemic mixture of D-sfAFP and
L-sfAFP, but not for L-sfAFP alone, when screened under a standard set of
crystallization conditions. Representative examples are shown in this Figure.
X-ray diffraction was used to verify that the crystals were in fact protein.
The crystallization conditions shown are a subset of the main findings
reported here and are as follows: (a) 0.1 M bicine pH ) 9.0, 2% v/v 1,4-
dioxane, 10% w/v polyethylene glycol 20,000; (b) 0.1 M bis-Tris pH )
6.5, 1.15 M ammonium sulfate; (c) 0.1 M Tris pH ) 8.5, 25% w/v
polyethylene glycol 3350. All crystallization trials were carried out at room
temperature (∼23 °C).
2
538.
(
(
12) Johnson, E. C.; Kent, S. B. J. Am. Chem. Soc. 2006, 128, 6640–6646.
13) Durek, T.; Torbeev, V. Y.; Kent, S. B. Proc. Natl. Acad. Sci. U.S.A.
2
007, 104, 4846.
(
(
(
(
14) Torbeev, V. Y.; Kent, S. B. Angew. Chem., Int. Ed. 2007, 46 (10),
1
667–1670.
15) Pentelute, B. L.; Gates, Z. P.; Dashnau, J. L.; Vanderkooi, J. M.; Kent,
S. B. H. J. Am. Chem. Soc. 2008, 130, 9702–9707.
16) Wukovitz, S. W.; Yeates, T. O. Nat. Struct. Biol. 1995, 2 (12), 1062–
1
067.
17) Pellegrini, M.; Wukovitz, S. W.; Yeates, T. O. Proteins 1997, 28,
15.
5
9
696 J. AM. CHEM. SOC. 9 VOL. 130, NO. 30, 2008