Abstract
We introduce ParkMAE, a robust multilingual speech foundation model and comprehensive benchmarking system for Parkinson's disease assessment. We curated multiple large-scale speech datasets comprising approximately 750 h of pretraining data and 100 h of evaluation data across four languages and diverse clinical populations. Our self-supervised masked autoencoder approach, pretrained on this multilingual corpus, demonstrates superior performance achieving 39% F1 score for cross-linguistic diagnosis, outperforming existing acoustic markers (eGeMAPS) by a significant margin and maintaining comparable performance to generic speech models (Whisper), while using 89% fewer parameters. Besides, ParkMAE shows exceptional generalizability to unseen languages without language-specific finetuning. Beyond diagnosis, we systematically evaluate medication state monitoring and disease staging tasks, revealing that despite promising literature reports, current publicly available datasets and speech-based approaches fail to reliably capture these clinical dimensions. For cognitive assessment (MoCA), our model demonstrated predictive capability (F1 = 0.56), suggesting potential for speech-based cognitive monitoring. This comprehensive evaluation establishes both the capabilities and current limitations of speech-based Parkinson's disease assessment, providing a reproducible framework for future clinical development.
Related Publications
Evaluation of Text Summarization in a Cross-lingual Information Retrieval Framework
We report on research in multi-document summarization and on evaluation of summarization in the framework of cross-lingual information retrieval. This work was carried out durin...
Deep Facial Expression Recognition: A Survey
With the transition of facial expression recognition (FER) from laboratory-controlled to challenging in-the-wild conditions and the recent success of deep learning techniques in...
Echocardiographic assessment of valve stenosis: EAE/ASE recommendations for clinical practice
AR = aortic regurgitation AS = aortic stenosis AVA = aortic valve area CSA = cross sectional area CWD = continuous wave Doppler D = diameter HOCM = hypertrophic obstructiv...
European Association of Urology Guidelines on Renal Cell Carcinoma: The 2019 Update
The European Association of Urology Renal Cell Carcinoma (RCC) Guideline Panel has prepared evidence-based guidelines and recommendations for the management of RCC. To provide a...
Recommendations from the international evidence‐based guideline for the assessment and management of polycystic ovary syndrome
Summary Study Question What is the recommended assessment and management of women with polycystic ovary syndrome ( PCOS ), based on the best available evidence, clinical experti...
Publication Info
- Year
- 2025
- Type
- article
- Citations
- 0
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1038/s41598-025-30251-7