Explainable artificial intelligence-machine learning models to estimate overall scores in tertiary preparatory general science course
Article
Article Title | Explainable artificial intelligence-machine learning models to estimate overall scores in tertiary preparatory general science course |
---|---|
ERA Journal ID | 212149 |
Article Category | Article |
Authors | Ghimire, Sujan, Abdulla, Shahab, Joseph, Lionel P., Prasad, Salvin, Murphy, Angela, Devi, Aruna, Barua, Prabal Datta, Deo, Ravinesh C., Acharya, Rajendra and Yaseen, Zaher Mundher |
Journal Title | Computers and Education: Artificial Intelligence |
Journal Citation | 7 |
Article Number | 100331 |
Number of Pages | 30 |
Year | 2024 |
Publisher | Elsevier |
Place of Publication | United Kingdom |
ISSN | 2666-920X |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.caeai.2024.100331 |
Web Address (URL) | https://www.sciencedirect.com/science/article/pii/S2666920X24001346 |
Abstract | Educational data mining is valuable for uncovering latent relationships in educational settings, particularly for predicting students’ academic performance. This study introduces an interpretable hybrid model, optimised through Tree-structured Parzen Estimation (TPE) and Support Vector Regression (SVR), to predict overall scores (OT) utilising five assignments and one examination mark as predictors. Neural Network-based, Tree-Based, Ensemble-Based, and Boosting-based methods are evaluated against the hybrid TPE-optimised SVR model for forecasting final examination grades among 492 students enrolled in the TPP7155 (General Science) course at the University of Southern Queensland, Australia, during the 2020-2021 academic year. Additionally, Local Interpretable Model-agnostic Explanations (LIME) and SHapley Additive explanations (SHAP) techniques are employed to elucidate the inner workings of these prediction models. The findings highlight the superior performance of the proposed model, exhibiting the lowest Root Mean Squared Error (𝑅𝑀𝑆𝐸) and Relative Root Mean Squared Error (𝑅𝑅𝑀𝑆𝐸), as well as the highest Willmott’s index (𝑊 𝐼), Legates–McCabe index (𝐿𝑀), and Nash–Sutcliffe Efficiency (𝑁𝑆). With assignment and examination marks identified as pivotal predictors of OT. SHAP and LIME analyses reveal the examination score (ET) as the most influential feature, impacting predicted OT by an average of ±4.93. Conversely, Assignment 1 emerges as the least informative feature, contributing merely ±0.64 to OT predictions. This research underscores the efficacy of the proposed interpretable hybrid TPE-optimised SVR model in discerning relationships among continuous learning variables, thereby empowering educators with early intervention capabilities and enhancing their ability to anticipate student performance prior to course completion. |
Keywords | Tertiary preparatory program; Student performance prediction; Students at risk; Education models; Deep learning; Machine learning; Support vector regression |
Contains Sensitive Content | Does not contain sensitive content |
ANZSRC Field of Research 2020 | 3999. Other Education |
Byline Affiliations | School of Mathematics, Physics and Computing |
UniSQ College | |
Fiji National University, Fiji | |
Academic Division | |
University of the Sunshine Coast | |
King Fahd University of Petroleum and Minerals, Saudi Arabia |
https://research.usq.edu.au/item/zq814/explainable-artificial-intelligence-machine-learning-models-to-estimate-overall-scores-in-tertiary-preparatory-general-science-course
Download files
61
total views7
total downloads24
views this month1
downloads this month