Abstract

A general principle of biology is the self-assembly of proteins into functional complexes. Characterizing their composition is, therefore, required for our understanding of cellular functions. Unfortunately, we lack knowledge of the comprehensive set of identities of protein complexes in human cells. To address this gap, we developed a machine learning framework to identify protein complexes in over 15,000 mass spectrometry experiments which resulted in the identification of nearly 7,000 physical assemblies. We show our resource, hu.MAP 2.0, is more accurate and comprehensive than previous state of the art high-throughput protein complex resources and gives rise to many new hypotheses, including for 274 completely uncharacterized proteins. Further, we identify 253 promiscuous proteins that participate in multiple complexes pointing to possible moonlighting roles. We have made hu.MAP 2.0 easily searchable in a web interface (http://humap2.proteincomplexes.org/), which will be a valuable resource for researchers across a broad range of interests including systems biology, structural biology, and molecular explanations of disease.

Keywords

BiologyCompendiumComputational biologyIdentification (biology)Systems biologyInterface (matter)Set (abstract data type)Structural biologyProteomicsResource (disambiguation)Computer scienceGeneticsBiochemistryGene

Affiliated Institutions

Related Publications

Publication Info

Year
2021
Type
article
Volume
17
Issue
5
Pages
e10016-e10016
Citations
141
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

141
OpenAlex

Cite This

Kevin Drew, John B. Wallingford, Edward M. Marcotte (2021). hu.MAP 2.0: integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies. Molecular Systems Biology , 17 (5) , e10016-e10016. https://doi.org/10.15252/msb.202010016

Identifiers

DOI
10.15252/msb.202010016