Abstract
Abstract Motivation: Subcellular localization is a key functional characteristic of proteins. A fully automatic and reliable prediction system for protein subcellular localization is needed, especially for the analysis of large-scale genome sequences. Results: In this paper, Support Vector Machine has been introduced to predict the subcellular localization of proteins from their amino acid compositions. The total prediction accuracies reach 91.4% for three subcellular locations in prokaryotic organisms and 79.4% for four locations in eukaryotic organisms. Predictions by our approach are robust to errors in the protein N-terminal sequences. This new approach provides superior prediction performance compared with existing algorithms based on amino acid composition and can be a complementary method to other existing methods based on sorting signals. Availability: A web server implementing the prediction method is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/. Contact: sunzhr@mail.tsinghua.edu.cn; huasj00@mails.tsinghua.edu.cn Supplementary information: Supplementary material is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/. * To whom correspondence should be addressed.
Keywords
Affiliated Institutions
Related Publications
Using neural networks for prediction of the subcellular location of proteins
Neural networks have been trained to predict the subcellular location of proteins in prokaryotic or eukaryotic cells from their amino acid composition. For three possible subcel...
PSORT-B: improving protein subcellular localization prediction for Gram-negative bacteria
Automated prediction of bacterial protein subcellular localization is an important tool for genome annotation and drug discovery. PSORT has been one of the most widely used comp...
Multiple parameter cross‐species protein identification using MultiIdent ‐ a world‐wide web accessible tool
Abstract Recent increases in the number of genome sequencing projects means that the amount of protein sequence in databases is increasing at an astonishing pace. In proteome st...
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier
Abstract Motivation A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional ...
Signal-3L: A 3-layer approach for predicting signal peptides
Functioning as an "address tag" that directs nascent proteins to their proper cellular and extracellular locations, signal peptides have become a crucial tool in finding new dru...
Publication Info
- Year
- 2001
- Type
- article
- Volume
- 17
- Issue
- 8
- Pages
- 721-728
- Citations
- 856
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/bioinformatics/17.8.721