STI/HIV Risk Prediction Model Development-A Novel Use of Public Data To Forecast STIs/HIV Risk For Men Who Have Sex With Men
Article
Article Title | STI/HIV Risk Prediction Model Development-A Novel Use of Public Data To Forecast STIs/HIV Risk For Men Who Have Sex With Men |
---|---|
ERA Journal ID | 200526 |
Article Category | Article |
Authors | Ji, Xiaopeng, Tang, Zhaohui, Osborne, Sonya R., Nguyen, Thi Phuoc Van, Mullens, Amy B., Dean, Judith A. and Li, Yan |
Journal Title | Frontiers in Public Health |
Journal Citation | 12 |
Article Number | 1511689 |
Number of Pages | 15 |
Year | 2025 |
Publisher | Frontiers Media SA |
Place of Publication | Switzerland |
ISSN | 2296-2565 |
Digital Object Identifier (DOI) | https://doi.org/10.3389/fpubh.2024.1511689 |
Web Address (URL) | https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2024.1511689/full |
Abstract | A novel automatic framework is proposed for global sexually transmissible infections (STIs) and HIV risk prediction. Four machine learning methods, namely, Gradient Boosting Machine (GBM), Random Forest (RF), XG Boost, and Ensemble learning GBM-RF-XG Boost are applied and evaluated on the Demographic and Health Surveys Program (DHSP), with thirteen features ultimately selected as the most predictive features. Classification and generalization experiments are conducted to test the accuracy, F1-score, precision, and area under the curve (AUC) performance of these four algorithms. Two imbalanced data solutions are also applied to reduce bias for classification performance improvement. The experimental results of these models demonstrate that the Random Forest algorithm yields the best results on HIV prediction, whereby the highest accuracy, and AUC are 0.99 and 0.99, respectively. The performance of the STI prediction achieves the best when the Synthetic Minority Oversampling Technique (SMOTE) is applied (Accuracy = 0.99, AUC = 0.99), which outperforms the state-of-the-art baselines. Two possible factors that may affect the classification and generalization performance are further analyzed. This automatic classification model helps to improve convenience and reduce the cost of HIV testing. |
Keywords | human immunodeficiency virus; sexually transmissible infections; artificial intelligence; machine learning; risk prediction |
Contains Sensitive Content | Does not contain sensitive content |
ANZSRC Field of Research 2020 | 320211. Infectious diseases |
460207. Modelling and simulation | |
420603. Health promotion | |
Byline Affiliations | Centre for Health Research |
School of Mathematics, Physics and Computing | |
School of Nursing and Midwifery | |
Institute for Resilient Regions | |
School of Psychology and Wellbeing | |
University of Queensland |
https://research.usq.edu.au/item/zv1vv/sti-hiv-risk-prediction-model-development-a-novel-use-of-public-data-to-forecast-stis-hiv-risk-for-men-who-have-sex-with-men
Download files
Published Version
STI HIV Risk Prediction Model in Froniers in Public Health 2024.pdf | ||
License: CC BY 4.0 | ||
File access level: Anyone |
0
total views1
total downloads0
views this month0
downloads this month