Prediction of protein cellular attributes using pseudo‐amino acid composition

Kuo‐Chen Chou

doi:10.1002/prot.1035

Abstract

Abstract The cellular attributes of a protein, such as which compartment of a cell it belongs to and how it is associated with the lipid bilayer of an organelle, are closely correlated with its biological functions. The success of human genome project and the rapid increase in the number of protein sequences entering into data bank have stimulated a challenging frontier: How to develop a fast and accurate method to predict the cellular attributes of a protein based on its amino acid sequence? The existing algorithms for predicting these attributes were all based on the amino acid composition in which no sequence order effect was taken into account. To improve the prediction quality, it is necessary to incorporate such an effect. However, the number of possible patterns for protein sequences is extremely large, which has posed a formidable difficulty for realizing this goal. To deal with such a difficulty, the pseudo‐amino acid composition is introduced. It is a combination of a set of discrete sequence correlation factors and the 20 components of the conventional amino acid composition. A remarkable improvement in prediction quality has been observed by using the pseudo‐amino acid composition. The success rates of prediction thus obtained are so far the highest for the same classification schemes and same data sets. It has not escaped from our notice that the concept of pseudo‐amino acid composition as well as its mathematical framework and biochemical implication may also have a notable impact on improving the prediction quality of other protein features. Proteins 2001;43:246–255. © 2001 Wiley‐Liss, Inc.

Keywords

Pseudo amino acid compositionAmino acidProtein sequencingSequence (biology)Computational biologyComposition (language)Computer sciencePeptide sequenceBiochemistryBiological systemBiologyAlgorithmGene

Related Publications

Signal-3L: A 3-layer approach for predicting signal peptides

Hong‐Bin Shen , Kuo‐Chen Chou

Functioning as an "address tag" that directs nascent proteins to their proper cellular and extracellular locations, signal peptides have become a crucial tool in finding new dru...

2007 Biochemical and Biophysical Research ... 237 citations

Defining a similarity threshold for a functional protein sequence pattern: The signal peptide cleavage site

Henrik Nielsen , Jacob Engelbrecht , Gunnar von Heijne +1 more

When preparing data sets of amino acid or nucleotide sequences it is necessary to exclude redundant or homologous sequences in order to avoid overestimating the predictive perfo...

1996 Proteins Structure Function and Bioin... 89 citations

Multiple parameter cross‐species protein identification using MultiIdent ‐ a world‐wide web accessible tool

Marc R. Wilkins , Elisabeth Gasteiger , Colin Wheeler +6 more

Abstract Recent increases in the number of genome sequencing projects means that the amount of protein sequence in databases is increasing at an astonishing pace. In proteome st...

1998 Electrophoresis 53 citations

Support vector machine approach for protein subcellular localization prediction

Sujun Hua , Zhirong Sun

Abstract Motivation: Subcellular localization is a key functional characteristic of proteins. A fully automatic and reliable prediction system for protein subcellular localizati...

2001 Bioinformatics 856 citations

The HDOCK server for integrated protein–protein docking

Yumeng Yan , Huanyu Tao , Jiahua He +1 more

The HDOCK server (http://hdock.phys.hust.edu.cn/) is a highly integrated suite of homology search, template-based modeling, structure prediction, macromolecular docking, biologi...

2020 Nature Protocols 1546 citations

Publication Info

Year: 2001
Type: article
Volume: 43
Issue: 3
Pages: 246-255
Citations: 1926
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Prediction of protein cellular attributes using pseudo‐amino acid composition

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1926

OpenAlex

Cite This

APA Style

                            
                                    Kuo‐Chen Chou
                                
                            (2001). 
                            Prediction of protein cellular attributes using pseudo‐amino acid composition. 
                            Proteins Structure Function and Bioinformatics
                            , 43
                            (3)
                            , 246-255.
                            https://doi.org/10.1002/prot.1035

Identifiers

DOI: 10.1002/prot.1035