CCPortal
DOI10.1007/s11069-021-04939-8
An evaluation of various data pre-processing techniques with machine learning models for water level prediction
Tiu E.S.K.; Huang Y.F.; Ng J.L.; AlDahoul N.; Ahmed A.N.; Elshafie A.
发表日期2021
ISSN0921030X
英文摘要Floods are the most frequent type of natural disaster. It destroys wildlife habitat, damages bridges, railways, roads, properties, and puts millions of people at risk. As such, flood detection systems have been developed to monitor the changes of water level and raise an alarm should there be imminent danger. River water level prediction is a significant task in flood mitigation planning and floodplains management. Usually, using raw data of rainfall series directly with machine learning (ML) regression methods, does not result in sufficiently good prediction accuracy. The raw data should be pre-processed using specific techniques to enhance their quality a priori to being applied to the prediction methods. This paper serves to address the stated problem by utilizing various data pre-processing techniques such as the Variational Mode Decomposition (VMD), Bagging, Boosting, Bagging-VMD, and Boosting-VMD to enhance the quality of input data and thus culminating in improved model accuracy. The five proposed pre-processing techniques were applied to the observed daily rainfall series of the Dungun river basin, Malaysia, for the period starting from November to February (Northeast Monsoon) from 1996 to 2016. Two machine learning models, the base models (Ori), that is the artificial neural network (ANN) and the support vector regression (SVR), were used in conjunction with the data pre-processing methods. The comparison between the ML methods with and without data pre-processing was done. It was found that prediction of water levels with the two ML methods of SVR and ANN together with the Boosting-VMD was superior to those results derived with just the base original model (Ori). The advantage of the enhanced models (respectively, founded on SVR and ANN) over the original models (SVR and ANN) is best reflected in the performance statistics. Numerical results in terms of root mean square error (RMSE) of (0.42, 0.20 vs 1.85,1.82), mean absolute percentage error (MAPE) of (4.36, 2.82 vs 18.89, 22.56), mean absolute error (MAE) of (0.28,0.16 vs 1.25, 1.41), and Nash–Sutcliffe efficiency coefficient (NSE) (0.96, 0.99 vs 0.25, 0.27) were obtained for the respective models. Additionally, various data visualization graphs such as hydrographs, residual hydrographs, peak-estimates, and box and whisker plots were illustrated to compare between various data pre-processing techniques. The experimental results showed that both the Boosting and the Boosting-VMD methods showed better performance over the other techniques. The Boosting-ANN model was found to be the better model to predict river water levels with the lowest RMSE (0.19), MAPE (2.72), and MAE (0.15) and the highest NSE (0.99). © 2021, The Author(s), under exclusive licence to Springer Nature B.V.
关键词Artificial neural networkBaggingBoostingRiver water level predictionSupport vector regressionVariational Mode Decomposition
语种英语
来源期刊Natural Hazards
文献类型期刊论文
条目标识符http://gcip.llas.ac.cn/handle/2XKMVOVA/206609
作者单位Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Kajang, Cheras, Selangor 43000, Malaysia; Department of Civil Engineering, Faculty of Engineering, Technology and Built Environment, UCSI University, Cheras, Kuala Lumpur, 56000, Malaysia; Faculty of Engineering, Multimedia University, Cyberjaya, 63100, Malaysia; Institute of Energy Infrastructure (IEI), Department of Civil Engineering, College of Engineering, University Tenaga Nasional (UNITEN), Selangor, 43000, Malaysia; Department of Civil Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur, 50603, Malaysia
推荐引用方式
GB/T 7714
Tiu E.S.K.,Huang Y.F.,Ng J.L.,et al. An evaluation of various data pre-processing techniques with machine learning models for water level prediction[J],2021.
APA Tiu E.S.K.,Huang Y.F.,Ng J.L.,AlDahoul N.,Ahmed A.N.,&Elshafie A..(2021).An evaluation of various data pre-processing techniques with machine learning models for water level prediction.Natural Hazards.
MLA Tiu E.S.K.,et al."An evaluation of various data pre-processing techniques with machine learning models for water level prediction".Natural Hazards (2021).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Tiu E.S.K.]的文章
[Huang Y.F.]的文章
[Ng J.L.]的文章
百度学术
百度学术中相似的文章
[Tiu E.S.K.]的文章
[Huang Y.F.]的文章
[Ng J.L.]的文章
必应学术
必应学术中相似的文章
[Tiu E.S.K.]的文章
[Huang Y.F.]的文章
[Ng J.L.]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。