Abstract

Abstract Motivation: Subcellular localization is a key functional characteristic of proteins. A fully automatic and reliable prediction system for protein subcellular localization is needed, especially for the analysis of large-scale genome sequences. Results: In this paper, Support Vector Machine has been introduced to predict the subcellular localization of proteins from their amino acid compositions. The total prediction accuracies reach 91.4% for three subcellular locations in prokaryotic organisms and 79.4% for four locations in eukaryotic organisms. Predictions by our approach are robust to errors in the protein N-terminal sequences. This new approach provides superior prediction performance compared with existing algorithms based on amino acid composition and can be a complementary method to other existing methods based on sorting signals. Availability: A web server implementing the prediction method is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/. Contact: sunzhr@mail.tsinghua.edu.cn; huasj00@mails.tsinghua.edu.cn Supplementary information: Supplementary material is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/. * To whom correspondence should be addressed.

Keywords

Subcellular localizationSupport vector machineProtein subcellular localization predictionSortingComputer scienceProtein Sorting SignalsProtein targetingArtificial intelligencePseudo amino acid compositionComputational biologyData miningBiologyAlgorithmPeptide sequenceGeneBiochemistrySignal peptideMembrane protein

Affiliated Institutions

Related Publications

Publication Info

Year
2001
Type
article
Volume
17
Issue
8
Pages
721-728
Citations
856
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

856
OpenAlex

Cite This

Sujun Hua, Zhirong Sun (2001). Support vector machine approach for protein subcellular localization prediction. Bioinformatics , 17 (8) , 721-728. https://doi.org/10.1093/bioinformatics/17.8.721

Identifiers

DOI
10.1093/bioinformatics/17.8.721