Abstract
A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks--"lookup peaks"--that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search tools--SEQUEST, Mascot, and X!Tandem--on both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.
Keywords
Affiliated Institutions
Related Publications
De novo assembly of human genomes with massively parallel short read sequencing
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length...
Genome sequencing in microfabricated high-density picolitre reactors
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly...
Lipidomic and proteomic analysis of exosomes from mouse cortical collecting duct cells
ABSTRACT Exosomes are endosome‐derived nanovesicles that are involved in cellular communication and signaling. Exosomes are produced by epithelial cells and are found in biologi...
The Coronavirus Spike Protein Is a Class I Virus Fusion Protein: Structural and Functional Characterization of the Fusion Core Complex
ABSTRACT Coronavirus entry is mediated by the viral spike (S) glycoprotein. The 180-kDa oligomeric S protein of the murine coronavirus mouse hepatitis virus strain A59 is posttr...
UCHIME improves sensitivity and speed of chimera detection
Abstract Motivation: Chimeric DNA sequences often form during polymerase chain reaction amplification, especially when sequencing single regions (e.g. 16S rRNA or fungal Interna...
Publication Info
- Year
- 2007
- Type
- article
- Volume
- 79
- Issue
- 4
- Pages
- 1393-1400
- Citations
- 213
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1021/ac0617013