Abstract

Abstract The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.

Keywords

BiobankImputation (statistics)GenotypeBiologyPopulationGeneticsHuman genetic variationBiorepositoryAlleleGenomeHuman genomeGeneMedicineMissing dataEnvironmental healthComputer science

MeSH Terms

AdultAgedAllelesBiomarkersBody HeightBrainCohort StudiesDatabasesFactualDatabasesGeneticElectronic Health RecordsFamilyFemaleGenome-Wide Association StudyGenomicsHaplotypesHumansLife StyleMajor Histocompatibility ComplexMaleMiddle AgedPhenotypeQuality ControlRacial GroupsUnited Kingdom

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
article
Volume
562
Issue
7726
Pages
203-209
Citations
8828
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

8828
OpenAlex
651
Influential

Cite This

Clare Bycroft, Colin Freeman, Desislava Petkova et al. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature , 562 (7726) , 203-209. https://doi.org/10.1038/s41586-018-0579-z

Identifiers

DOI
10.1038/s41586-018-0579-z
PMID
30305743
PMCID
PMC6786975

Data Quality

Data completeness: 86%