Evaluation Only. Created with Aspose.PDF. Copyright 2002-2021 Aspose Pty Ltd.
CD2-LIKE YEAST CELL ADHESION PROTEIN
107
with kinesins 15 , and yeast Mata2p with homeodomain conserved in multiple alignments of a-agglutinin and the
(
)
(
)
proteins such as engrailed 16 . In fact, CD2 was identi ed as C. albicans Als epithelial adhesion proteins with the CD2/CD4
(
)
a member of the Ig superfamily only after determination of its alignment of Fig. 2 4; data not shown . The “Bioinbgu” thread-
(
)
structure by crystallography and nuclear magnetic resonance ing program also identi ed CD2 Protein Database le 1HNF as
(
)
17, 18 .
the most likely homolog of a-agglutinin, with a Z 7.1 com-
=
(
Potential reference structures were identi ed by BLAST and pared with Z 3.8 for the next best match. This match was
=
JPRED and a search of the Protein Data Bank for b-rich struc- found by a referee for this manuscript; the program is avail-
(
)
)
«
(
( )
family structures aligned within conserved sequence motifs see erated by Bioinbgu similar to one previously discussed 5 and
–
)
below .
is identical to that in Fig. 2 in strands A C of domain I, although
For a-agglutinin, matches to the Ig family were found by a it diverges thereafter. These matches con rm that a-agglutinin
visual sequence motif comparison, which had originally iden- has CD2-like sequence characteristics.
ti ed sequences around the Cys residues as being similar to
The match scores for the alignment shown in Fig. 2 are much
(
)
those in Ig-like proteins 1, 5, 8 . Of the six Cys residues in better than those for matches to a previous IgV consensus, which
–
the binding region of a-agglutinin, ve are in sequences of 6 8 showed only 50% and 33% identity for domains I and II, respec-
–
( )
residues characteristic of the Cys residue regions in Ig proteins. tively 5 . Therefore, the alignment of residues 20 200 of a-
Because several such similarities were available, we could then agglutinin to the published structure-based sequence alignment
(
)
search for other critical structural motifs, such as the conserved of CD2 and CD4 17 was the basis for modeling a-agglutinin
–
Trp residues and the Asp Arg ion pairs that stabilize many Ig domains I and II.
(
)
domain cores 7, 20, 21 .
Construction of the Model
The model was constructed by using the Homology features
Alignment
(
)
of the program QUANTA Insight/Quanta, San Diego, CA . The
We generated alternative alignments of the a-agglutinin do-
mains I and II regions to Ig consensus sequences, both by hand
and by using several pairwise and multiple alignment algo-
rithms. The most effectivemethod was hand-generated; italigned
conserved consensus residues in regions of maximum b-sheet
potential, allowing gaps only in loop and turn regions 8 .
With this approach, a-agglutinin residues 20 120 domain I
were matched to an Ig Variable Domain IgV consensus in
22 of 24 positions with a single one-residue gap data not
main chain atoms were assigned the coordinates of the corre-
(
)
sponding atoms of CD2 Protein Data Bank le 1HNF in those
regions with no gaps or only small gaps. These residues included
all of the b strands, as well as domain I loops AB, BC, CC0,
C0C00, DE, and EF, and domain II loops BC, CC0, EF, and FG.
Then side chains were built using the QUANTA AMINO.RTF
library. Each b-strand region was then “regularized,” to repair
single residue gaps or insertions and to correct con guration of
the backbone.
( )
–
(
)
(
)
(
)
shown; 20 .
An independent analysis of Ig variable-like domains of low
sequence similarity de ned a V-frame pro le that identi es a
Separatesearches were performed tobuild the coordinates for
residues of the loops C00D and FG from domain I and AB from
(
)
small number of key sites 21, 22 ; residues that occupy these
positions have crucial structuralroles andarehighly conserved in
Table 1
a
–
all IgV structures. Again, we were able to align residues 20 120
(
)
Sequence identities and similarities %
of a-agglutinin to such a pro le, based on the consensus and b-
strand probability maxima in 15 of 18 positions for a-agglutinin
Core
CD2 human CD4 human consensusb
(
)
(
)
(
)
domain I Fig. 2 and Table 1 . Randomized sequences of a-
–
agglutinin residues 20 122 could not be t to the IgV consensus
First domains
(
)
(
)
20 , nor to the V-frame pro le 22 , nor to a secondary structure
(
—
)
(
)
)
(
)
a-Agglutinin
16 40
16 34
83 88
(
)
prediction pro le 5, 8 .
(
)
(
(
94 100
)
)
CD2 human
15 44
Matches of the a-agglutinin sequence were better with CD2
or CD4 as the reference protein than with other members of the
Ig superfamily. The CD2/CD4 subfamily consists of proteins
with tandem Ig-like domains. CD2, has a V-like domain with
nine0 b strands in two sheets consisting of strands ABED and
Second domains
(
)
)
(
)
)
)
(
a-Agglutinin
16 44
16 46
92 100
(
)
(
CD2 human
—
26 45
100
c
(
(
(
)
Third domain,
14 33
14 33
74 79
a-agglutinin
00
(
)
C C CFG, respectively 22 . The second domain is of the trun-
cated 7 strand S-type, with b-sheets composed of strands ABE
and C0CFG, respectively CD4 has four Ig domains in a V-S-
V-S tandem array. The pro le-based match for domain I was
extended into a match for domain II of 11 of 12 key posi-
tions for an alignment of domain II with truncated S-type do-
a
(
)
The percent identity and similarity in parentheses at aligned posi-
(
)
tions isshown for the alignmentsof Fig. 2 bottom panel . The similarity
setswere: AFILMPVWY, DEKR, ND, NQ, EQ, KRH, ST, FHWY, and
AG.
b
(
)
Based on conserved consensus residues bolded and named in
(
)
Fig. 2 bottom panel .
cSee reference 8.
(
)
mains Fig. 2 and Table 1 . Consensus residues were similarly