Comprehensive functional genomic resource and integrative model for the human brain

Daifeng Wang , Shuang Liu , Jonathan Warrell , Daifeng Wang , Shuang Liu , Jonathan Warrell , Hyejung Won , Xu Shi , Fábio C. P. Navarro , Declan Clarke , Mengting Gu , Prashant S. Emani , Yucheng Yang , Min Xu , Michael J. Gandal , Shaoke Lou , Jing Zhang , Jonathan J. Park , Chengfei Yan , Suhn K. Rhie , Kasidet Manakongtreecheep , Holly Zhou , Aparna Nathan , Mette A. Peters , Eugenio Mattei , Dominic Fitzgerald , Tonya M. Brunetti , Jill E. Moore , Yan Jiang , Kiran Girdhar , Gabriel E. Hoffman , Selim Kalaycı , Zeynep H. Gümüş , Gregory E. Crawford , Panos Roussos , Schahram Akbarian , Andrew E. Jaffe , Kevin P. White , Zhiping Weng , Nenad Šestan , Daniel H. Geschwind , James A. Knowles , Mark Gerstein , Allison E. Ashley‐Koch , Gregory E. Crawford , Melanie E. Garrett , Lingyun Song , Alexias Safi , Graham D. Johnson , Gregory A. Wray , Timothy E. Reddy , Fernando S. Goes , Peter P. Zandi , Julien Bryois , Andrew E. Jaffe , Amanda J. Price , Nikolay A. Ivanov , Leonardo Collado‐Torres , Thomas M. Hyde , Emily E. Burke , Joel E. Kleiman , Ran Tao , Joo Heon Shin , Schahram Akbarian , Kiran Girdhar , Yan Jiang , Marija Kundaković , Leanne Brown , Bibi Kassim , Royce Park , Jennifer Wiseman , Elizabeth Zharovsky , Rivka Jacobov , Olivia Devillers , Elie Flatow , Gabriel E. Hoffman , Barbara K. Lipska , David A. Lewis , Vahram Haroutunian , Chang-Gyu Hahn , Alexander W. Charney , Stella Dracheva , Alexey Kozlenkov , Judson Belmont , Diane M. Del Valle , Nancy Francoeur , Evi Hadjimichael , Dalila Pinto , Harm van Bakel , Panos Roussos , John F. Fullard , Jaroslav Bendl , Mads E. Hauberg , Lara M. Mangravite , Mette A. Peters , Yooree Chae , Junmin Peng , Mingming Niu , Xusheng Wang , Maree J. Webster , Thomas G. Beach , Chao Chen , Yi Jiang
2018 Science 1,021 citations

Abstract

INTRODUCTION Strong genetic associations have been found for a number of psychiatric disorders. However, understanding the underlying molecular mechanisms remains challenging. RATIONALE To address this challenge, the PsychENCODE Consortium has developed a comprehensive online resource and integrative models for the functional genomics of the human brain. RESULTS The base of the pyramidal resource is the datasets generated by PsychENCODE, including bulk transcriptome, chromatin, genotype, and Hi-C datasets and single-cell transcriptomic data from ~32,000 cells for major brain regions. We have merged these with data from Genotype-Tissue Expression (GTEx), ENCODE, Roadmap Epigenomics, and single-cell analyses. Via uniform processing, we created a harmonized resource, allowing us to survey functional genomics data on the brain over a sample size of 1866 individuals. From this uniformly processed dataset, we created derived data products. These include lists of brain-expressed genes, coexpression modules, and single-cell expression profiles for many brain cell types; ~79,000 brain-active enhancers with associated Hi-C loops and topologically associating domains; and ~2.5 million expression quantitative-trait loci (QTLs) comprising ~238,000 linkage-disequilibrium–independent single-nucleotide polymorphisms and of other types of QTLs associated with splice isoforms, cell fractions, and chromatin activity. By using these, we found that >88% of the cross-population variation in brain gene expression can be accounted for by cell fraction changes. Furthermore, a number of disorders and aging are associated with changes in cell-type proportions. The derived data also enable comparison between the brain and other tissues. In particular, by using spectral analyses, we found that the brain has distinct expression and epigenetic patterns, including a greater extent of noncoding transcription than other tissues. The top level of the resource consists of integrative networks for regulation and machine-learning models for disease prediction. The networks include a full gene regulatory network (GRN) for the brain, linking transcription factors, enhancers, and target genes from merging of the QTLs, generalized element-activity correlations, and Hi-C data. By using this network, we link disease genes to genome-wide association study (GWAS) variants for psychiatric disorders. For schizophrenia, we linked 321 genes to the 142 reported GWAS loci. We then embedded the regulatory network into a deep-learning model to predict psychiatric phenotypes from genotype and expression. Our model gives a ~6-fold improvement in prediction over additive polygenic risk scores. Moreover, it achieves a ~3-fold improvement over additive models, even when the gene expression data are imputed, highlighting the value of having just a small amount of transcriptome data for disease prediction. Lastly, it highlights key genes and pathways associated with disorder prediction, including immunological, synaptic, and metabolic pathways, recapitulating de novo results from more targeted analyses. CONCLUSION Our resource and integrative analyses have uncovered genomic elements and networks in the brain, which in turn have provided insight into the molecular mechanisms underlying psychiatric disorders. Our deep-learning model improves disease risk prediction over traditional approaches and can be extended with additional data types (e.g., microRNA and neuroimaging). A comprehensive functional genomic resource for the adult human brain. The resource forms a three-layer pyramid. The bottom layer includes sequencing datasets for traits, such as schizophrenia. The middle layer represents derived datasets, including functional genomic elements and QTLs. The top layer contains integrated models, which link genotypes to phenotypes. DSPN, Deep Structured Phenotype Network; PC1 and PC2, principal components 1 and 2; ref, reference; alt, alternate; H3K27ac, histone H3 acetylation at lysine 27.

Keywords

Expression quantitative trait lociComputational biologyQuantitative trait locusBiologyChromatinGenome-wide association studySchizophrenia (object-oriented programming)GeneEnhancerPopulationTraitRNA splicingGene regulatory networkGenetic associationDiseaseGeneticsEvolutionary biologyGene expressionSingle-nucleotide polymorphismComputer sciencePsychologyGenotypeMedicineRNAPsychiatry

Affiliated Institutions

Related Publications

The human disease network

A network of disorders and disease genes linked by known disorder–gene associations offers a platform to explore in a single graph-theoretic framework all known phenotype and di...

2007 Proceedings of the National Academy o... 3182 citations

Publication Info

Year
2018
Type
article
Volume
362
Issue
6420
Citations
1021
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1021
OpenAlex

Cite This

Daifeng Wang, Shuang Liu, Jonathan Warrell et al. (2018). Comprehensive functional genomic resource and integrative model for the human brain. Science , 362 (6420) . https://doi.org/10.1126/science.aat8464

Identifiers

DOI
10.1126/science.aat8464