Abstract
Background: Laryngeal and hypopharyngeal cancers (LHCs) exhibit heterogeneous outcomes after definitive radiotherapy (RT). Large language models (LLMs) may enhance prognostic stratification by integrating complex clinical and imaging data. This study validated two pre-trained LLMs—GPT-4o-2024-08-06 and Gemma-2-27b-it—for outcome prediction in LHC. Methods: Ninety-two patients with non-metastatic LHC treated with definitive (chemo)radiotherapy at Linkou Chang Gung Memorial Hospital (2006–2013) were retrospectively analyzed. First-order and 3D radiomic features were extracted from intra- and peritumoral regions on pre- and mid-RT CT scans. LLMs were prompted with clinical variables, radiotherapy notes, and radiomic features to classify patients as high- or low-risk for death, recurrence, and distant metastasis. Model performance was assessed using sensitivity, specificity, AUC, Kaplan–Meier survival analysis, and McNemar tests. Results: Integration of radiomic features significantly improved prognostic discrimination over clinical/RT plan data alone for both LLMs. For death prediction, pre-RT radiomics were the most predictive: GPT-4o achieved a peak AUC of 0.730 using intratumoral features, while Gemma-2-27b reached 0.736 using peritumoral features. For recurrence prediction, mid-RT peritumoral features yielded optimal performance (AUC = 0.703 for GPT-4o; AUC = 0.709 for Gemma-2-27b). Kaplan–Meier analyses confirmed statistically significant separation of risk groups: pre-RT intra- and peritumoral features for overall survival (for both GPT-4o and Gemma-2-27b, p < 0.05), and mid-RT peritumoral features for recurrence-free survival (p = 0.028 for GPT-4o; p = 0.017 for Gemma-2-27b). McNemar tests revealed no significant performance difference between the two LLMs when augmented with radiomics (all p > 0.05), indicating that the open-source model achieved comparable accuracy to its proprietary counterpart. Both models generated clinically coherent, patient-specific rationales explaining risk assignments, enhancing interpretability and clinical trust. Conclusions: This external validation demonstrates that pre-trained LLMs can serve as accurate, interpretable, and multimodal prognostic engines for LHC. Pre-RT radiomic features are critical for predicting mortality and metastasis, while mid-RT peritumoral features uniquely inform recurrence risk. The comparable performance of the open-source Gemma-2-27b-it model suggests a scalable, cost-effective, and privacy-preserving pathway for the integration of LLM-based tools into precision radiation oncology workflows to enhance risk stratification and therapeutic personalization.
Affiliated Institutions
Related Publications
The prognostic value of peritumoral regulatory T cells and its correlation with intratumoral cyclooxygenase‐2 expression in clear cell renal cell carcinoma
OBJECTIVE To investigate the prognostic value of regulatory T cells (Tregs) and its correlation with cyclooxygenase‐2 (COX‐2) expression in clear cell renal cell carcinoma (RCC)...
CT-based radiomic signature predicts distant metastasis in lung adenocarcinoma
Radiomics provides opportunities to quantify the tumor phenotype non-invasively by applying a large number of quantitative imaging features. This study evaluates computed-tomogr...
Radiotherapy and Temozolomide for Newly Diagnosed Glioblastoma: Recursive Partitioning Analysis of the EORTC 26981/22981-NCIC CE3 Phase III Randomized Trial
Purpose The European Organisation for Research and Treatment of Cancer and National Cancer Institute of Canada trial on temozolomide (TMZ) and radiotherapy (RT) in glioblastoma ...
Prognostic significance of tumor grade in clinical trials of adjuvant therapy for breast cancer with axillary lymph node metastasis
The prognostic significance of histologic tumor grade has been evaluated in 1537 women entered into the Ludwig Trials I-IV of adjuvant therapy for node-positive breast cancer. T...
Clinical prognostic factors affecting survival in patients with newly diagnosed Glioblastoma Multiforme (GBM)
9599 Background:In the EORTC 26981–22981/NCIC CE3 phase III trial, 573 pts with newly diagnosed GBM were randomized to receive radiotherapy (RT) or RT plus Temozolomide (results...
Publication Info
- Year
- 2025
- Type
- article
- Volume
- 12
- Issue
- 12
- Pages
- 1345-1345
- Citations
- 0
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.3390/bioengineering12121345