|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 1:139-147, 2002.
© 2002 by The American Society for Biochemistry and Molecular Biology, Inc.
,


Department of Microbiology, University of Virginia, Charlottesville, Virginia 22908
** Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22908
¶ Department of Pharmacology, Duke University, Durham, North Carolina 27710
We describe two novel sequence similarity search algorithms, FASTS and FASTF, that use multiple short peptide sequences to identify homologous sequences in protein or DNA databases. FASTS searches with peptide sequences of unknown order, as obtained by mass spectrometry-based sequencing, evaluating all possible arrangements of the peptides. FASTF searches with mixed peptide sequences, as generated by Edman sequencing of unseparated mixtures of peptides. FASTF deconvolutes the mixture, using a greedy heuristic that allows rapid identification of high scoring alignments while reducing the total number of explored alternatives. Both algorithms use the heuristic FASTA comparison strategy to accelerate the search but use alignment probability, rather than similarity score, as the criterion for alignment optimality. Statistical estimates are calculated using an empirical correction to a theoretical probability. These calculated estimates were accurate within a factor of 10 for FASTS and 1000 for FASTF on our test dataset. FASTS requires only 1520 total residues in three or four peptides to robustly identify homologues sharing 50% or greater protein sequence identity. FASTF requires about 25% more sequence data than FASTS for equivalent sensitivity, but additional sequence data are usually available from mixed Edman experiments. Thus, both algorithms can identify homologues that diverged 100 to 500 million years ago, allowing proteomic identification from organisms whose genomes have not been sequenced.

Supported in part by Grant LM04969 from the National Library of Medicine, with additional support from the Compaq Computer Corporation. To whom correspondence should be addressed. Tel.: 434-924-2818; Fax: 434-924-5069; E-mail: wrp{at}virginia.edu.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
I. J. Tetlow, K. G. Beisel, S. Cameron, A. Makhmoudova, F. Liu, N. S. Bresolin, R. Wait, M. K. Morell, and M. J. Emes Analysis of Protein Complexes in Wheat Amyloplasts Reveals Functional Interactions among Starch Biosynthetic Enzymes Plant Physiology, April 1, 2008; 146(4): 1878 - 1891. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. Iwai, M. Yoshida, A. Sadahiro, W. R. da Silva, M. L. Marin, A. C. Goldberg, M. A. Juliano, L. Juliano, M. A. Shikanai-Yasuda, J. Kalil, et al. T-Cell Recognition of Paracoccidioides brasiliensis gp43-Derived Peptides in Patients with Paracoccidioidomycosis and Healthy Individuals Clin. Vaccine Immunol., April 1, 2007; 14(4): 474 - 476. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sandra, P. Dolashka-Angelova, B. Devreese, and J. Van Beeumen New insights in Rapana venosa hemocyanin N-glycosylation resulting from on-line mass spectrometric analyses Glycobiology, February 1, 2007; 17(2): 141 - 156. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Luo, E. Nieves, J. Kzhyshkowska, and R. H. Angeletti Endogenous Transforming Growth Factor-{beta} Receptor-mediated Smad Signaling Complexes Analyzed by Mass Spectrometry Mol. Cell. Proteomics, July 1, 2006; 5(7): 1245 - 1260. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. I. Orsborn, L. F. Shubitz, T. Peng, E. M. Kellner, M. J. Orbach, P. A. Haynes, and J. N. Galgiani Protein Expression Profiling of Coccidioides posadasii by Two-Dimensional Differential In-Gel Electrophoresis and Evaluation of a Newly Recognized Peroxisomal Matrix Protein as a Recombinant Vaccine Candidate Infect. Immun., March 1, 2006; 74(3): 1865 - 1872. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Kremer, T. Haystead, and I. G. Macara Mammalian Septins Regulate Microtubule Stability through Interaction with the Microtubule-binding Protein MAP4 Mol. Biol. Cell, October 1, 2005; 16(10): 4648 - 4659. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. D. Halligan, V. Ruotti, S. N. Twigger, and A. S. Greene DeNovoID: a web-based tool for identifying peptides from sequence and mass tags deduced from de novo peptide sequencing by mass spectroscopy Nucleic Acids Res., July 1, 2005; 33(suppl_2): W376 - W381. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Vergote, P.-E. Sautiere, F. Vandenbulcke, D. Vieau, G. Mitta, E. R. Macagno, and M. Salzet Up-regulation of Neurohemerythrin Expression in the Central Nervous System of the Medicinal Leech, Hirudo medicinalis, following Septic Injury J. Biol. Chem., October 15, 2004; 279(42): 43828 - 43837. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Baxter, J. S. Rosenblum, S. Knutson, M. R. Nelson, J. S. Montimurro, J. A. Di Gennaro, J. A. Speir, J. J. Burbaum, and J. S. Fetrow Synergistic Computational and Experimental Proteomics Approaches for More Accurate Detection of Active Serine Hydrolases in Yeast Mol. Cell. Proteomics, March 1, 2004; 3(3): 209 - 225. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Habermann, J. Oegema, S. Sunyaev, and A. Shevchenko The Power and the Limitations of Cross-Species Protein Identification by Mass Spectrometry-driven Sequence Similarity Searches Mol. Cell. Proteomics, March 1, 2004; 3(3): 238 - 249. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. J. Tetlow, R. Wait, Z. Lu, R. Akkasaeng, C. G. Bowsher, S. Esposito, B. Kosar-Hashemi, M. K. Morell, and M. J. Emes Protein Phosphorylation in Amyloplasts Regulates Starch Branching Enzyme Activity and Protein-Protein Interactions PLANT CELL, March 1, 2004; 16(3): 694 - 708. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. E. Corcoran, J. D. Joseph, J. A. MacDonald, C. D. Kane, T. A. J. Haystead, and A. R. Means Proteomic Analysis of Calcium/Calmodulin-dependent Protein Kinase I and IV in Vitro Substrates Reveals Distinct Catalytic Preferences J. Biol. Chem., March 14, 2003; 278(12): 10516 - 10522. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Graves and T. A.J. Haystead A Functional Proteomics Approach to Signal Transduction Recent Prog. Horm. Res., January 1, 2003; 58(1): 1 - 24. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Graves, J. J. Kwiek, P. Fadden, R. Ray, K. Hardeman, A. M. Coley, M. Foley, and T. A. J. Haystead Discovery of Novel Targets of Quinoline Drugs in the Human Purine Binding Proteome Mol. Pharmacol., December 1, 2002; 62(6): 1364 - 1372. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |