Abstract

Recent progress in genomic sequencing, computational biology, and ontology development has presented an opportunity to investigate biological systems from a unique perspective, that is, examining genomes and transcriptomes through the multiple and hierarchical structure of Gene Ontology (GO). We report here our development of GO Engine, a computational platform for GO annotation, and analysis of the resultant GO annotations of human proteins. Protein annotation was centered on sequence homology with GO-annotated proteins and protein domain analysis. Text information analysis and a multiparameter cellular localization predictive tool were also used to increase the annotation accuracy, and to predict novel annotations. The majority of proteins corresponding to full-length mRNA in GenBank, and the majority of proteins in the NR database (nonredundant database of proteins) were annotated with one or more GO nodes in each of the three GO categories. The annotations of GenBank and SWISS-PROT proteins are available to the public at the GO Consortium web site.

Keywords

GenBankAnnotationBiologyGene ontologyComputational biologyOntologyGenomeGenome projectProteomeGeneBioinformaticsGeneticsGene expression

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
letter
Volume
12
Issue
5
Pages
785-794
Citations
115
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

115
OpenAlex

Cite This

Hanqing Xie, Alon Wasserman, Zurit Levine et al. (2002). Large-Scale Protein Annotation through Gene Ontology. Genome Research , 12 (5) , 785-794. https://doi.org/10.1101/gr.86902

Identifiers

DOI
10.1101/gr.86902