Abstract
In this research work, we hand out a comprehensive study on predicting the solubility of tolfenamic acid and the density of supercritical carbon dioxide (SC-CO<sub>2</sub>) using a combination of machine learning models and hyper-parameter tuning techniques. The dataset is composed of input features, specifically temperature and pressure, which are used to predict the target outputs, namely the density of SC-CO<sub>2</sub> and the solubility of tolfenamic acid. Three distinct models, namely ADA-GPR (AdaBoost on Gaussian Process Regression), ADA-SVR (AdaBoost on Support Vector Regression), and ADA-LR (AdaBoost on Linear Regression), were employed to correlate the relationships between the inputs and outputs for the dataset. The hyperparameters of these models were optimized using the Chimp Optimization Algorithm (ChOA) to enhance performance. In predicting the solubility of tolfenamic acid, ADA-GPR achieved excellent results, with an R-squared value of 0.98806, an RMSE of 0.10133, and an MAE of 0.07790. Additionally, ADA-SVR and ADA-LR delivered competitive outcomes, attaining R-squared scores of 0.96056 and 0.86815, respectively. In the realm of SC-CO<sub>2</sub> density prediction, it is noteworthy to highlight that the ADA-GPR model has emerged as the preeminent performer with an exceptional R-squared score of 0.99265, RMSE of 9.7870, and MAE of 7.81506. ADA-SVR and ADA-LR exhibited favorable performance as well, achieving R-squared scores of 0.8841 and 0.87774, respectively. This study helps pharmaceutical and chemical companies predict tolfenamic acid solubility and SC-CO<sub>2</sub> density. The proposed models and ChOA hyper-parameter optimization solve solubility and density prediction problems in research and industry.
Affiliated Institutions
Related Publications
The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variabl...
performance: An R Package for Assessment, Comparison and Testing of Statistical Models
A crucial part of statistical analysis is evaluating a model's quality and fit, or performance.During analysis, especially with regression models, investigating the fit of model...
R-Squared Measures for Count Data Regression Models with Applications to Health-Care Utilization
For regression models other than the linear model, R-squared type goodness-to-fit summary statistics have been constructed for particular models using a variety of methods. The ...
CO <sub>2</sub> electrolysis to multicarbon products in strong acid
Potassium helps CO 2 compete in acid Electrochemical reduction of carbon dioxide (CO 2 ) is a promising means of converting this greenhouse gas into valuable fuels and chemicals...
Design of Single-Atom Co–N<sub>5</sub> Catalytic Site: A Robust Electrocatalyst for CO<sub>2</sub> Reduction with Nearly 100% CO Selectivity and Remarkable Stability
We develop an N-coordination strategy to design a robust CO<sub>2</sub> reduction reaction (CO<sub>2</sub>RR) electrocatalyst with atomically dispersed Co-N<sub>5</sub> site anc...
Publication Info
- Year
- 2025
- Type
- article
- Citations
- 0
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1038/s41598-025-31759-8