postsource decay13 and collision-induced dissociation14 (CID)
produce only y-type ions. The absence of other fragment ion types
facilitates de novo sequence interpretation. Although the presence
of proline residues sometimes limits sequence coverage, this
technique generally leads to highly informative fragmentation
spectra. A drawback of this approach is decreased sensitivity in
positive ion mode experiments due to the presence of a permanent
negative charge. Gaskell and co-workers demonstrated that
labeling peptide N-termini with phenyl isothiocyanate enables gas-
phase Edman degradation.15 They observed enhanced cleavage
of the N-terminal peptide bond leading to the predominant
formation of complementary b1 and yn-1 fragment ion pairs in CID
experiments. Although this enables identification of N-terminal
residues, no other sequence ions are typically formed due to the
high efficiency of this fragmentation pathway. Despite this
limitation, they demonstrated that, with high mass accuracy FT-
ICR-MS, the N-terminal residue identity and precursor ion mass
were sufficient constraints to enable database matching.16 Of
course, this would not be sufficient to identify PTM sites, genetic
mutations or account for database errors.
A number of groups have taken advantage of isotopic labeling
that, in addition to serving as mass-coded tags for relative
quantitation, have also provided mass signatures for N- and
C-terminal fragment ions and facilitated de novo sequencing. For
example, proteolytic 18O/16O labeling has been utilized to code
peptide C-termini during the course of tryptic digestion.17-19 As a
result, y-type fragment ions were formed as mass-separated pairs
that could easily be distinguished from b-ions. Alternatively,
several researchers have implemented deuterium labeling strat-
egies.20-22 These methods typically involve postdigestion chemical
derivatizations. For example, Goodlett and co-workers esterified
acidic groups with either D0- or D3-methanol to facilitate the
differentiation of C- and N-terminal ion-types.21 Similarly, James
and co-workers modified peptide N-termini with D0-/D4-nicotinyl-
N-hydroxysuccininmide to identify N-terminal fragment ions in
complementary MS/MS spectra.20 While this approach to de novo
sequencing is globally applicable, the derivatization chemistry
leads to unwanted side reactions with lysine and tyrosine residues
that complicate data. As an alternative to most deuterium-based
labeling techniques, Chen and co-workers devised an in vivo D4-/
D0-lysine incorporation strategy that takes advantage of the
specificity of tryptic proteolysis to generate C-terminally mass-
coded peptides.22 This approach provides a means of avoiding wet
chemistry steps following tryptic digestion. However, it is not of
general utility since the introduction of an isotopic label in vivo is
not always feasible. Also, tryptic peptides terminated with arginine
do not contain the isotopic label. Although it does not involve
isotopic labeling, the mass-coded abundance tagging technique
developed by Cagney and Emili also employs a mass signature
at the C-termini of tryptic peptides.23 In this approach, lysine
residues are converted to homoarginines using O-methylisourea
and are thus mass separated from their unlabeled counterparts.
Therefore, the C-termini of lysine-containing tryptic peptides are
shifted by 42 Da and the C-terminal fragment ions can be
distinguished from N-terminal ones. Much like the approach
involving in vivo incorporation of D4-/D0-lysine, this technique is
not global, as arginine-containing peptides are not labeled.
Similarly, Brancia and co-workers recently presented a strategy
that employed 15N- and 13C-coded guanidine tags to impart a 3-Da
mass shift on lysine residues and facilitate sequence interpreta-
tion.24
We now present a derivatization strategy that utilizes both
guanidination25-27 and amidination28,29 to assist peptide sequencing.
This approach facilitates identification of N- and C-terminal
fragment ions by labeling N-termini with amidine moieties that
differ by a methylene group (i.e., 14 u). Lysine residues are
converted to homoarginines to prevent amidination of the side-
chain ꢀ-amino groups. The simple and efficient reactions are
inexpensive and can be completed rapidly with minimal side
reactions.28,30 This is a global approach to protein identifications
since peptide N-termini are mass-coded. The only peptides
excluded from this labeling are those whose N-terminus is blocked
(e.g., by acetylation of the protein N-terminus). Furthermore, the
amidine groups promote specific fragmentation pathways29 that
facilitate de novo sequencing by providing sequence information
that is often absent and internal calibrants that can dramatically
improve mass accuracy. We demonstrate this sequencing ap-
proach using the tryptic peptides of several standard proteins.
EXPERIMENTAL SECTION
Materials. Hemoglobin (human), R-casein (bovine), serum
albumin (bovine, BSA), and TPCK-treated trypsin (bovine) were
obtained from Sigma (St. Louis, MO). Tris(hydroxymethyl)-
aminomethane (Trizma base), S-methylisothiourea hemisulfate,
and ammonium hydroxide were also supplied by Sigma. Aceto-
nitrile and trifluoroacetic acid (TFA) were purchased from EM
Science (Gibbstown, NJ). Thiopropionamide was supplied by TCI
America (Portland, OR). Anhydrous diethyl ether, thioacetamide,
and ammonium bicarbonate were purchased from Fisher (Fair
Lawn, NJ). Iodomethane, iodoacetamide, dithiothreitol (DTT),
poly(propylene glycol), and formic acid were obtained from
(14) Keough, T.; Lacey, M. P.; Fieno, A. M.; Grant, R. A.; Sun, Y. P.; Bauer, M.
D.; Begley, K. B. Electrophoresis 2000, 21, 2252-2265.
(15) Summerfield, S. G.; Steen, H.; O’malley, M.; Gaskell, S. J. Int. J. Mass
Spectrom. 1999, 188, 95-103.
(16) Van Der Rest, G.; He, F.; Emmett, M. R.; Marshall, A. G.; Gaskell, S. J. J.
Am. Soc. Mass Spectrom. 2001, 12, 288-295.
(17) Shevchenko, A.; Chernushevich, I.; Ens, W.; Standing, K. G.; Thomson, B.;
Wilm, M.; Mann, M. Rapid Commun. Mass Spectrom. 1997, 11, 1015-
1024.
(18) Qin, J.; Herring, C. J.; Zhang, X. L. Rapid Commun. Mass Spectrom. 1998,
12, 209-216.
(23) Cagney, G.; Emili, A. Nat. Biotechnol. 2002, 20, 163-170.
(24) Brancia, F. L.; Montgomery, H.; Tanaka, K.; Kumashiro, S. Anal. Chem.
2004, 76, 2748-2755.
(25) Beardsley, R. L.; Karty, J. A.; Reilly, J. P. Rapid Commun. Mass Spectrom.
(19) Uttenweiler-Joseph, S.; Neubauer, G.; Christoforidis, A.; Zerial, M.; Wilm,
2000, 14, 2147-2153.
M. Proteomics 2001, 1, 668-682.
(20) Munchbach, M.; Quadroni, M.; Miotto, G.; James, P. Anal. Chem. 2000,
72, 4047-4057.
(26) Brancia, F. L.; Oliver, S. G.; Gaskell, S. J. Rapid Commun. Mass Spectrom.
2000, 14, 2070-2073.
(27) Hale, J. E.; Butler, J. P.; Knierman, M. D.; Becker, G. W. Anal. Biochem.
(21) Goodlett, D. R.; Keller, A.; Watts, J. D.; Newitt, R.; Yi, E. C.; Purvine, S.;
Eng, J. K.; Von Haller, P.; Aebersold, R.; Kolker, E. Rapid Commun. Mass
Spectrom. 2001, 15, 1214-1221.
(22) Gu, S.; Pan, S. Q.; Bradbury, E. M.; Chen, X. Anal. Chem. 2002, 74, 5774-
5785.
2000, 287, 110-117.
(28) Beardsley, R. L.; Reilly, J. P. J. Proteome Res. 2003, 2, 15-21.
(29) Beardsley, R. L.; Reilly, J. P. J. Am. Soc. Mass Spectrom. 2004, 15, 158-
167.
(30) Beardsley, R. L.; Reilly, J. P. Anal. Chem. 2002, 74, 1884-1890.
Analytical Chemistry, Vol. 77, No. 19, October 1, 2005 6301