Abstract
BackgroundAmino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to the deposition of vast numbers of protein sequences necessitating rapid functional annotation techniques, with subcellular localization being a key feature. Of the myriad software prediction tools developed to automate the task of assigning the SP cleavage site of these new sequences, we review here, the performance and reliability of commonly used SP prediction tools.ResultsThe available signal peptide data has been manually curated and organized into three datasets representing eukaryotes, Gram-positive and Gram-negative bacteria. These datasets are used to evaluate thirteen prediction tools that are publicly available. SignalP (both the HMM and ANN versions) maintains consistency and achieves the best overall accuracy in all three benchmarking experiments, ranging from 0.872 to 0.914 although other prediction tools are narrowing the performance gap.ConclusionThe majority of the tools evaluated in this study encounter no difficulty in discriminating between secretory and non-secretory proteins. The challenge clearly remains with pinpointing the correct SP cleavage site. The composite scoring schemes employed by SignalP may help to explain its accuracy. Prediction task is divided into a number of separate steps, thus allowing each score to tackle a particular aspect of the prediction.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
QUANTIFICATION OF ANNEXIN I IN SUBCELLULAR FRACTIONS OF HUMAN NEUTROPHILS REVEALS AN EXCLUSIVE CYTOSOLIC LOCALISATION
Annexin I is an abundant cytosolic protein in human neutrophils. Besides its intracellular location, annexin I is found as an extracellular protein and the pathway for secretion...
Complete localization of the intrachain disulphide bonds and the <i>N</i>-glycosylation points in the α-subunit of human platelet glycoprotein IIb
Glycoprotein IIb (GPIIb), one of the two molecular components of the inducible receptor for fibrinogen on the platelet surface, is formed from two subunits, GPIIb alpha (114 kDa...
Protein homology detection by HMM–HMM comparison
Abstract Motivation: Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and evolution. Results: We have gene...
Cyclic coordinate descent: A robotics algorithm for protein loop closure
Abstract In protein structure prediction, it is often the case that a protein segment must be adjusted to connect two fixed segments. This occurs during loop structure predictio...
UTOPIA—user‐friendly tools for operating informatics applications
Abstract Bioinformaticians routinely analyse vast amounts of information held both in large remote databases and in flat data files hosted on local machines. The contemporary to...
Publication Info
- Year
- 2009
- Type
- article
- Volume
- 10
- Issue
- S15
- Pages
- S2-S2
- Citations
- 70
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1186/1471-2105-10-s15-s2
- PMID
- 19958512
- PMCID
- PMC2788353