Abstract
A general principle of biology is the self-assembly of proteins into functional complexes. Characterizing their composition is, therefore, required for our understanding of cellular functions. Unfortunately, we lack knowledge of the comprehensive set of identities of protein complexes in human cells. To address this gap, we developed a machine learning framework to identify protein complexes in over 15,000 mass spectrometry experiments which resulted in the identification of nearly 7,000 physical assemblies. We show our resource, hu.MAP 2.0, is more accurate and comprehensive than previous state of the art high-throughput protein complex resources and gives rise to many new hypotheses, including for 274 completely uncharacterized proteins. Further, we identify 253 promiscuous proteins that participate in multiple complexes pointing to possible moonlighting roles. We have made hu.MAP 2.0 easily searchable in a web interface (http://humap2.proteincomplexes.org/), which will be a valuable resource for researchers across a broad range of interests including systems biology, structural biology, and molecular explanations of disease.
Keywords
Affiliated Institutions
Related Publications
Structural space of protein–protein interfaces is degenerate, close to complete, and highly connected
At the heart of protein–protein interactions are protein–protein interfaces where the direct physical interactions occur. By developing and applying an efficient structural alig...
Human Protein Reference Database--2009 update
Human Protein Reference Database (HPRD--http://www.hprd.org/), initially described in 2003, is a database of curated proteomic information pertaining to human proteins. We have ...
CASTp 3.0: computed atlas of surface topography of proteins
Geometric and topological properties of protein structures, including surface pockets, interior cavities and cross channels, are of fundamental importance for proteins to carry ...
The Protein Information Resource: an integrated public resource of functional annotation of proteins
The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific disc...
Identification of transformation sensitive proteins recorded in human two‐dimensional gel protein databases by mass spectrometric peptide mapping alone and in combination with microsequencing
Abstract A comprehensive human keratinocyte two‐dimensional (2‐D) gel protein database has been established to study the expression levels and properties of the thousands of pro...
Publication Info
- Year
- 2021
- Type
- article
- Volume
- 17
- Issue
- 5
- Pages
- e10016-e10016
- Citations
- 141
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.15252/msb.202010016