Finishing the euchromatic sequence of the human genome

2004 Nature 4,843 citations

Abstract

The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.

Keywords

Human genomeGenomeENCODEEuchromatinGeneticsGenome projectWhole genome sequencingReference genomeBiologySequence (biology)Complete sequenceGeneComputational biologyGenome evolutionChromosome

MeSH Terms

Amino Acid SequenceBase SequenceCentromereChromosomesArtificialBacterialChromosomesHumanDNAComplementaryEuchromatinGene DuplicationGenesGenomeHumanHeterochromatinHuman Genome ProjectHumansMolecular Sequence DataMultigene FamilyPhysical Chromosome MappingPlasmidsPseudogenesResearch DesignSensitivity and SpecificitySequence AnalysisDNATelomere

Affiliated Institutions

Related Publications

The Phusion Assembler

The Phusion assembler has assembled the mouse genome from the whole-genome shotgun (WGS) dataset collected by the Mouse Genome Sequencing Consortium, at ∼7.5× sequence coverage,...

2002 Genome Research 220 citations

Publication Info

Year
2004
Type
article
Volume
431
Issue
7011
Pages
931-945
Citations
4843
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

4843
OpenAlex
4
Influential
3767
CrossRef

Cite This

Uma Maheswari, Kamel Jabbari, Jean‐Louis Petit et al. (2004). Finishing the euchromatic sequence of the human genome. Nature , 431 (7011) , 931-945. https://doi.org/10.1038/nature03001

Identifiers

DOI
10.1038/nature03001
PMID
15496913

Data Quality

Data completeness: 86%