Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer

2021 Computational and Structural Biotechnology Journal 1,673 citations

Abstract

Extensive research is directed to uncover new biomarkers capable to stratify breast cancer patients into clinically relevant cohorts. However, the overall performance ranking of such marker candidates compared to other genes is virtually absent. Here, we present the ranking of all survival related genes in chemotherapy treated basal and estrogen positive/HER2 negative breast cancer. We searched the GEO repository to uncover transcriptomic datasets with available follow-up and clinical data. After quality control and normalization, samples entered an integrated database. Molecular subtypes were designated using gene expression data. Relapse-free survival analysis was performed using Cox proportional hazards regression. False discovery rate was computed to combat multiple hypothesis testing. Kaplan-Meier plots were drawn to visualize the best performing genes. The entire database includes 7,830 unique samples from 55 independent datasets. Of those with available relapse-free survival time, 3,382 samples were estrogen receptor-positive and 696 were basal. In chemotherapy treated ER positive/ERBB2 negative patients the significant prognostic biomarker genes achieved hazard rates between 1.76 and 3.33 with a p value below 5.8E-04. The significant prognostic genes in adjuvant chemotherapy treated basal breast cancer samples reached hazard rates between 1.88 and 3.61 with a p value below 7.2E-04. Our integrated platform was extended enabling the validation of future biomarker candidates. A reference ranking for all genes in two chemotherapy treated breast cancer cohorts is presented. The results help to neglect those with unlikely clinical significance and to focus future research on the most promising candidates.

Keywords

Breast cancerOncologyProportional hazards modelBiomarkerHazard ratioInternal medicineMedicineTranscriptomeCancerSurvival analysisBioinformaticsGeneBiologyGene expressionGenetics

Affiliated Institutions

Related Publications

Postoperative chemotherapy and tamoxifen compared with tamoxifen alone in the treatment of positive-node breast cancer patients aged 50 years and older with tumors responsive to tamoxifen: results from the National Surgical Adjuvant Breast and Bowel Project B-16.

The National Surgical Adjuvant Breast and Bowel Project (NSABP) conducted a randomized clinical trial to determine whether tamoxifen (TAM) plus chemotherapy is more effective th...

1990 Journal of Clinical Oncology 294 citations

Trastuzumab and Vinorelbine as First-Line Therapy for HER2-Overexpressing Metastatic Breast Cancer: Multicenter Phase II Trial With Clinical Outcomes, Analysis of Serum Tumor Markers as Predictive Factors, and Cardiac Surveillance Algorithm

Purpose: Trastuzumab-based therapy improves survival for women with human epidermal growth factor receptor 2 (HER2)–positive advanced breast cancer. We conducted a multicenter p...

2003 Journal of Clinical Oncology 343 citations

Pretreatment Quality of Life and Functional Status Assessment Significantly Predict Survival of Elderly Patients With Advanced Non—Small-Cell Lung Cancer Receiving Chemotherapy: A Prognostic Analysis of the Multicenter Italian Lung Cancer in the Elderly Study

Purpose To study the prognostic value for overall survival of baseline assessment of functional status, comorbidity, and quality of life (QoL) in elderly patients with advanced ...

2005 Journal of Clinical Oncology 477 citations

Publication Info

Year
2021
Type
article
Volume
19
Pages
4101-4109
Citations
1673
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1673
OpenAlex
37
Influential
762
CrossRef

Cite This

Balázs Győrffy (2021). Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer. Computational and Structural Biotechnology Journal , 19 , 4101-4109. https://doi.org/10.1016/j.csbj.2021.07.014

Identifiers

DOI
10.1016/j.csbj.2021.07.014
PMID
34527184
PMCID
PMC8339292

Data Quality

Data completeness: 86%