Abstract
The goal of the NCBI Reference Sequence (RefSeq) project is to provide the single best non-redundant and comprehensive collection of naturally occurring biological molecules, representing the central dogma. Nucleotide and protein sequences are explicitly linked on a residue-by-residue basis in this collection. Ideally all molecule types will be available for each well-studied organism, but the initial database collection pragmatically includes only those molecules and organisms that are most readily identified. Thus different amounts of information are available for different organisms at any given time. Furthermore, for some organisms additional intermediate records are provided when the genome sequence is not yet finished. The collection is supplied by NCBI through three distinct pipelines in addition to collaborations with community groups. The collection is curated on an ongoing basis. Additional information about the NCBI RefSeq project is available at http://www.ncbi.nih.gov/RefSeq/.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
RepeatModeler2 for automated genomic discovery of transposable element families
The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable element...
miRBase: from microRNA sequences to function
This FAIRsharing record describes: The miRBase database is a searchable database of published miRNA sequences and annotation. Each entry in miRBase represents a predicted hairpi...
KEGG for taxonomy-based analysis of pathways and genomes
Abstract KEGG (https://www.kegg.jp) is a manually curated database resource integrating various biological objects categorized into systems, genomic, chemical and health informa...
QUAST: quality assessment tool for genome assemblies
Abstract Summary: Limitations of genome sequencing techniques have led to dozens of assembly algorithms, none of which is perfect. A number of methods for comparing assemblers h...
The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data
The Catalytic Site Atlas (CSA) provides catalytic residue annotation for enzymes in the Protein Data Bank. It is available online at http://www.ebi.ac.uk/thornton-srv/databases/...
Publication Info
- Year
- 2003
- Type
- article
- Volume
- 31
- Issue
- 1
- Pages
- 34-37
- Citations
- 169
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/nar/gkg111
- PMID
- 12519942
- PMCID
- PMC165558