Streamflow and soil moisture forecasting with hybrid data intelligent machine learning approaches: case studies in the Australian Murray-Darling basin

PhD Thesis


Prasad, Ramendra. 2018. Streamflow and soil moisture forecasting with hybrid data intelligent machine learning approaches: case studies in the Australian Murray-Darling basin . PhD Thesis Doctor of Philosophy. University of Southern Queensland. https://doi.org/10.26192/5f6974e4dccd5
Title

Streamflow and soil moisture forecasting with hybrid data intelligent machine learning approaches: case studies
in the Australian Murray-Darling basin

TypePhD Thesis
Authors
AuthorPrasad, Ramendra
SupervisorDeo, Ravinesh C.
Li, Yan
Maraseni, Tek
Institution of OriginUniversity of Southern Queensland
Qualification NameDoctor of Philosophy
Number of Pages221
Year2018
Digital Object Identifier (DOI)https://doi.org/10.26192/5f6974e4dccd5
Abstract

For a drought-prone agricultural nation such as Australia, hydro-meteorological imbalances and increasing demand for water resources are immensely constraining terrestrial water reservoirs and regional-scale agricultural productivity. Two important components of the terrestrial water reservoir i.e., streamflow water level (SWL) and soil moisture (SM), are imperative both for agricultural and hydrological applications. Forecasted SWL and SM can enable prudent and sustainable decisionmaking for agriculture and water resources management. To feasibly emulate SWL and SM, machine learning data-intelligent models are a promising tool in today’s rapidly advancing data science era. Yet, the naturally chaotic characteristics of hydro-meteorological variables that can exhibit non-linearity and non-stationarity behaviors within the model dataset, is a key challenge for non-tuned machine learning models. Another important issue that could confound model accuracy or applicability is the selection of relevant features to emulate SWL and SM since the use of too fewer inputs can lead to insufficient information to construct an accurate model while the use of an excessive number and redundant model inputs could obscure the performance of the simulation algorithm.

This research thesis focusses on the development of hybridized dataintelligent models in forecasting SWL and SM in the upper layer (surface to 0.2 m) and the lower layer (0.2–1.5 m depth) within the agricultural region of the Murray-Darling Basin, Australia. The SWL quantifies the availability of surface water resources, while, the upper layer SM (or the surface SM) is important for surface runoff, evaporation, and energy exchange at the Earth-Atmospheric interface. The lower layer (or the root zone) SM is essential for groundwater recharge purposes, plant uptake and transpiration. This research study is constructed upon four primary objectives designed for the forecasting of SWL and SM with subsequent robust evaluations by means of statistical metrics, in tandem with the diagnostic plots of observed and modeled datasets.

The first objective establishes the importance of feature selection (or optimization) in the forecasting of monthly SWL at three study sites within the Murray-Darling Basin. Artificial neural network (ANN) model optimized with iterative input selection (IIS) algorithm named IIS-ANN is developed whereby the IIS algorithm achieves feature optimization. The IIS-ANN model outperforms the standalone models and a further hybridization is performed by integrating a nondecimated and advanced maximum overlap discrete wavelet transformation (MODWT) technique. The IIS selected inputs are transformed into wavelet subseries via MODWT to unveil the embedded features leading to IIS-W-ANN model. The IIS-W-ANN outperforms the comparative IIS-W-M5 Model Tree, IIS-based and standalone models.

In the second objective, improved self-adaptive multi-resolution analysis (MRA) techniques, ensemble empirical mode decomposition (EEMD) and complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) are utilized to address the non-stationarity issues in forecasting monthly upper and lower layer soil moisture at seven sites. The SM time-series are decomposed using EEMD/CEEMDAN into respective intrinsic mode functions (IMFs) and residual components. Then the partial-auto correlation function based significant lags are
utilized as inputs to the extreme learning machine (ELM) and random forest (RF) models. The hybrid EEMD-ELM yielded better results in comparison to the CEEMDAN-ELM, EEMD-RF, CEEMDAN-RF and the classical ELM and RF models.

Since SM is contingent upon many influential meteorological, hydrological and atmospheric parameters, for the third objective sixty predictor inputs are collated
in forecasting upper and lower layer soil moisture at four sites. An ANN-based ensemble committee of models (ANN-CoM) is developed integrating a two-phase feature optimization via Neighborhood Component Analysis based feature selection algorithm for regression (fsrnca) and a basic ELM. The ANN-CoM shows better predictive performance in comparison to the standalone second order Volterra, M5 Model Tree, RF, and ELM models.

In the fourth objective, a new multivariate sequential EEMD based modelling is developed. The establishment of multivariate sequential EEMD is an advancement
of the classical single input EEMD approach, achieving a further methodological improvement. This multivariate approach is developed to allow for the utilization of
multiple inputs in forecasting SM. The multivariate sequential EEMD optimized with cross-correlation function and Boruta feature selection algorithm is integrated with the ELM model in emulating weekly SM at four sites. The resulting hybrid multivariate sequential EEMD-Boruta-ELM attained a better performance in comparison with the multivariate adaptive regression splines (MARS) counterpart (EEMD-Boruta-MARS) and standalone ELM and MARS models.

The research study ascertains the applicability of feature selection algorithms integrated with appropriate MRA for improved hydrological forecasting. Forecasting at shorter and near-real-time horizons (i.e., weekly) would help reinforce scientific tenets in designing knowledge-based systems for precision agriculture and climate change adaptation policy formulations.

Keywordsstreamflow, soil moisture, forecasting, machine learning, Murray-Darling Basin, non-stationarity
ANZSRC Field of Research 2020370199. Atmospheric sciences not elsewhere classified
370105. Atmospheric dynamics
300207. Agricultural systems analysis and modelling
460207. Modelling and simulation
469999. Other information and computing sciences not elsewhere classified
490199. Applied mathematics not elsewhere classified
Byline AffiliationsSchool of Agricultural, Computational and Environmental Sciences
Permalink -

https://research.usq.edu.au/item/q541q/streamflow-and-soil-moisture-forecasting-with-hybrid-data-intelligent-machine-learning-approaches-case-studies-in-the-australian-murray-darling-basin

Download files


Published Version
Ramendra_Thesis_NOT_highlighted.pdf
File access level: Anyone

  • 371
    total views
  • 3231
    total downloads
  • 6
    views this month
  • 13
    downloads this month

Export as

Related outputs

Designing Deep-based Learning Flood Forecast Model with ConvLSTM Hybrid Algorithm
Moishin, Mohammed, Deo, Ravinesh C., Prasad, Ramendra, Raj, Nawin and Abdulla, Shahab. 2021. "Designing Deep-based Learning Flood Forecast Model with ConvLSTM Hybrid Algorithm." IEEE Access. 9, pp. 50982-50993. https://doi.org/10.1109/ACCESS.2021.3065939
Modelling and Real-time Optimisation of Air Quality Predictions for Australia through Artificial Intelligence Algorithm
Sharma, Ekta, Deo, Ravinesh C., Prasad, Ramendra and Parisi, Alfio V.. 2019. "Modelling and Real-time Optimisation of Air Quality Predictions for Australia through Artificial Intelligence Algorithm." AMSI Optimise 2019. Perth, Australia 17 - 21 Jun 2019 Perth, Australia.
Hybrid Convolutional Neural Network-Multilayer Perceptron Model for Solar Radiation Prediction
Ghimire, Sujan, Nguyen-Huy, Thong, Prasad, Ramendra, Deo, Ravinesh C., Casillas-Perez, David, Salcedo-sanz, Sancho and Bhandari, Binayak. 2023. "Hybrid Convolutional Neural Network-Multilayer Perceptron Model for Solar Radiation Prediction." Cognitive Computation. 15 (2), pp. 645-671. https://doi.org/10.1007/s12559-022-10070-y
Coupled online sequential extreme learning machine model with ant colony optimization algorithm for wheat yield prediction
Ali, Mumtaz, Deo, Ravinesh C., Xiang, Yong, Prasad, Ramendra, Li, Jianxin, Farooque, Aitazaz and Yaseen, Zaher Mundher. 2022. "Coupled online sequential extreme learning machine model with ant colony optimization algorithm for wheat yield prediction." Scientific Reports. 12 (1), pp. 1-23. https://doi.org/10.1038/s41598-022-09482-5
Forecasting Daily Flood Water Level Using Hybrid Advanced Machine Learning Based Time‑Varying Filtered Empirical Mode Decomposition Approach
Jamei, Mehdi, Ali, Mumtaz, Malik, Anurag, Prasad, Ramendra, Abdulla, Shahab and Yaseen, Zaher Mundher. 2022. "Forecasting Daily Flood Water Level Using Hybrid Advanced Machine Learning Based Time‑Varying Filtered Empirical Mode Decomposition Approach." Water Resources Management. 36 (12), p. 4637–4676. https://doi.org/10.1007/s11269-022-03270-6
Novel hybrid deep learning model for satellite based PM10 forecasting in the most polluted Australian hotspots
Sharma, Ekta, Deo, Ravinesh C., Soar, Jeffrey, Prasad, Ramendra, Parisi, Alfio V. and Raj, Nawin. 2022. "Novel hybrid deep learning model for satellite based PM10 forecasting in the most polluted Australian hotspots." Atmospheric Environment. 279, pp. 1-13. https://doi.org/10.1016/j.atmosenv.2022.119111
Advanced extreme learning machines vs. deep learning models for peak wave energy period forecasting: A case study in Queensland, Australia
Ali, Mumtaz, Prasad, Ramendra, Xiang, Yong, Sankaran, Adarsh, Deo, Ravinesh C., Xiao, Fuyuan and Zhu, Shuyu. 2021. "Advanced extreme learning machines vs. deep learning models for peak wave energy period forecasting: A case study in Queensland, Australia." Renewable Energy. 177, pp. 1033-1044. https://doi.org/10.1016/j.renene.2021.06.052
Deep Air Quality Forecasts: Suspended Particulate Matter Modeling With Convolutional Neural and Long Short-Term Memory Networks
Sharma, Ekta, Deo, Ravinesh C., Prasad, Ramendra, Parisi, Alfio and Raj, Nawin. 2020. "Deep Air Quality Forecasts: Suspended Particulate Matter Modeling With Convolutional Neural and Long Short-Term Memory Networks." IEEE Access. 8, pp. 209503-209516. https://doi.org/10.1109/ACCESS.2020.3039002
Development of Flood Monitoring Index for daily flood risk evaluation: case studies in Fiji
Moishin, Mohammed, Deo, Ravinesh C., Prasad, Ramendra, Raj, Nawin and Abdulla, Shahab. 2021. "Development of Flood Monitoring Index for daily flood risk evaluation: case studies in Fiji." Stochastic Environmental Research and Risk Assessment. 35 (7), pp. 1387-1402. https://doi.org/10.1007/s00477-020-01899-6
Short-term electrical energy demand prediction under heat island effects using emotional neural network integrated with genetic algorithm
Karalasingham, Sagthitharan, Deo, Ravinesh and Prasad, Ramendra. 2021. "Short-term electrical energy demand prediction under heat island effects using emotional neural network integrated with genetic algorithm." Deo, Ravinesh, Samui, Pijush and Roy, Sanjiban Sekhar (ed.) Predictive modelling for energy management and power systems engineering. Amsterdam, Netherlands. Elsevier. pp. 271-298
Daily flood forecasts with intelligent data analytic models: multivariate empirical mode decomposition-based modeling methods
Prasad, Ramendra, Charan, Dhrishna, Joseph, Lionel, Nguyen-Huy, Thong, Deo, Ravinesh C. and Singh, Sanjay. 2021. "Daily flood forecasts with intelligent data analytic models: multivariate empirical mode decomposition-based modeling methods." Deo, Ravinesh C., Samui, Pijush, Kisi, Ozgur and Yaseen, Zaher Mundher (ed.) Intelligent data analytics for decision-support systems in hazard mitigation: theory and practice of hazard mitigation. Singapore. Springer. pp. 359-381
Bayesian Markov Chain Monte Carlo-based copulas: factoring the role of large-scale climate indices in monthly flood prediction
Nguyen-Huy, Thong, Deo, Ravinesh C., Yaseen, Zaher Mundher, Mushtaq, Shahbaz and Prasad, Ramendra. 2021. "Bayesian Markov Chain Monte Carlo-based copulas: factoring the role of large-scale climate indices in monthly flood prediction." Deo, Ravinesh C., Samui, Pijush, Kisi, Ozgur and Yaseen, Zaher Mundher (ed.) Intelligent data analytics for decision-support systems in hazard mitigation: theory and practice of hazard mitigation. Singapore. Springer. pp. 29-47
Near real-time significant wave height forecasting with hybridized multiple linear regression algorithms
Ali, Mumtaz, Prasad, Ramendra, Xiang, Yong and Deo, Ravinesh C.. 2020. "Near real-time significant wave height forecasting with hybridized multiple linear regression algorithms." Renewable and Sustainable Energy Reviews. 132. https://doi.org/10.1016/j.rser.2020.110003
A hybrid air quality early-warning framework: an hourly forecasting model with online sequential extreme learning machines and empirical mode decomposition algorithms
Sharma, Ekta, Deo, Ravinesh C., Prasad, Ramendra and Parisi, Alfio V.. 2020. "A hybrid air quality early-warning framework: an hourly forecasting model with online sequential extreme learning machines and empirical mode decomposition algorithms." Science of the Total Environment. 709, pp. 1-23. https://doi.org/10.1016/j.scitotenv.2019.135934
Significant wave height forecasting via an extreme learning machine model integrated with improved complete ensemble empirical mode decomposition
Ali, Mumtaz and Prasad, Ramendra. 2019. "Significant wave height forecasting via an extreme learning machine model integrated with improved complete ensemble empirical mode decomposition." Renewable and Sustainable Energy Reviews. 104, pp. 281-295. https://doi.org/10.1016/j.rser.2019.01.014
Designing a multi-stage multivariate empirical mode decomposition coupled with ant colony optimization and random forest model to forecast monthly solar radiation
Prasad, Ramendra, Ali, Mumtaz, Kwan, Paul and Khan, Huma. 2019. "Designing a multi-stage multivariate empirical mode decomposition coupled with ant colony optimization and random forest model to forecast monthly solar radiation." Applied Energy. 236, pp. 778-792. https://doi.org/10.1016/j.apenergy.2018.12.034
Weekly soil moisture forecasting with multivariate sequential, ensemble empirical mode decomposition and Boruta-random forest hybridizer algorithm approach
Prasad, Ramendra, Deo, Ravinesh C., Li, Yan and Maraseni, Tek. 2019. "Weekly soil moisture forecasting with multivariate sequential, ensemble empirical mode decomposition and Boruta-random forest hybridizer algorithm approach." Catena. 177, pp. 149-166. https://doi.org/10.1016/j.catena.2019.02.012
Soil moisture forecasting by a hybrid machine learning technique: ELM integrated with ensemble empirical mode decomposition
Prasad, Ramendra, Deo, Ravinesh C., Li, Yan and Maraseni, Tek. 2018. "Soil moisture forecasting by a hybrid machine learning technique: ELM integrated with ensemble empirical mode decomposition." Geoderma. 330, pp. 136-161. https://doi.org/10.1016/j.geoderma.2018.05.035
Ensemble committee-based data intelligent approach for generating soil moisture forecasts with multivariate hydro-meteorological predictors
Prasad, Ramendra, Deo, Ravinesh C., Li, Yan and Maraseni, Tek. 2018. "Ensemble committee-based data intelligent approach for generating soil moisture forecasts with multivariate hydro-meteorological predictors." Soil and Tillage Research. 181, pp. 63-81. https://doi.org/10.1016/j.still.2018.03.021
Input selection and performance optimization of ANN-based streamflow forecasts in the drought-prone Murray Darling Basin region using IIS and MODWT algorithm
Prasad, Ramendra, Deo, Ravinesh C., Li, Yan and Maraseni, Tek. 2017. "Input selection and performance optimization of ANN-based streamflow forecasts in the drought-prone Murray Darling Basin region using IIS and MODWT algorithm." Atmospheric Research. 197, pp. 42-63. https://doi.org/10.1016/j.atmosres.2017.06.014