Cyanobacteria blue-green algae prediction enhancement using hybrid machine learning–based gamma test variable selection and empirical wavelet transform
Article
Article Title | Cyanobacteria blue-green algae prediction enhancement using hybrid machine learning–based gamma test variable selection and empirical wavelet transform |
---|---|
ERA Journal ID | 5827 |
Article Category | Article |
Authors | Heddam, Salim, Yaseen, Zaher Mundher, Falah, Mayadah W., Goliatt, Leonardo, Tan, Mou Leong, Sa’adi, Zulfaqar, Ahmadianfar, Iman, Saggi, Mandeep, Bhatia, Amandeep and Samui, Pijush |
Journal Title | Environmental Science and Pollution Research |
Journal Citation | 29 (51), pp. 77157-77187 |
Number of Pages | 31 |
Year | 2022 |
Publisher | Springer |
Place of Publication | Germany |
ISSN | 0944-1344 |
1614-7499 | |
Digital Object Identifier (DOI) | https://doi.org/10.1007/s11356-022-21201-1 |
Web Address (URL) | https://link.springer.com/article/10.1007/s11356-022-21201-1 |
Abstract | This study aims to evaluate the usefulness and effectiveness of four machine learning (ML) models for modelling cyanobacteria blue-green algae (CBGA) at two rivers located in the USA. The proposed modelling framework was based on establishing a link between five water quality variables and the concentration of CBGA. For this purpose, artificial neural network (ANN), extreme learning machine (ELM), random forest regression (RFR), and random vector functional link (RVFL) are developed. First, the four models were developed using only water quality variables. Second, based on the results of the first, a new modelling strategy was introduced based on preprocessing signal decomposition. Hence, the empirical mode decomposition (EMD), the variational mode decomposition (VMD), and the empirical wavelet transform (EWT) were used for decomposing the water quality variables into several subcomponents, and the obtained intrinsic mode functions (IMFs) and multiresolution analysis (MRA) components were used as new input variables for the ML models. Results of the present investigation show that (i) using single models, good predictive accuracy was obtained using the RFR model exhibiting an R and NSE values of ≈0.914 and ≈0.833 for the first station, and ≈0.944 and ≈0.884 for the second station, while the others models, i.e., ANN, RVFL, and ELM, have failed to provide a good estimation of the CBGA; (ii) the decomposition methods have contributed to a significant improvement of the individual models performances; (iii) among the thee decomposition methods, the EMD was found to be superior to the VMD and EWT; and (iv) the ANN and RFR were found to be more accurate compared to the ELM and RVFL models, exhibiting high numerical performances with R and NSE values of approximately ≈0.983, ≈0.967, and ≈0.989 and ≈0.976, respectively. |
Keywords | Modelling ; CBGA ; Water quality; ELM ; ANN ; RVFL ; RFR ; EMD ; VMD ; EWT |
Public Notes | File reproduced in accordance with the copyright policy of the publisher/author. |
Byline Affiliations | University of 20 August 1955 of Skikda, Algeria |
School of Mathematics, Physics and Computing | |
National University of Malaysia | |
Al-Ayen University, Iraq | |
Al-Mustaqbal University College, Iraq | |
Behbahan Khatam Alanbia University of Technology, Iran | |
University of Science, Malaysia | |
University of Technology Malaysia, Malaysia | |
Thapar Institute of Engineering and Technology, India | |
Thapar University, India | |
National Institute of Technology Patna, India |
https://research.usq.edu.au/item/z0209/cyanobacteria-blue-green-algae-prediction-enhancement-using-hybrid-machine-learning-based-gamma-test-variable-selection-and-empirical-wavelet-transform
Download files
62
total views86
total downloads4
views this month1
downloads this month