Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms

Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-su...

Full description

Saved in:
Bibliographic Details
Published inJournal of hydroinformatics Vol. 24; no. 4; pp. 875 - 897
Main Authors Panahi, Javad, Mastouri, Reza, Shabanlou, Saeid
Format Journal Article
LanguageEnglish
Published London IWA Publishing 01.07.2022
Subjects
Online AccessGet full text
ISSN1464-7141
1465-1734
DOI10.2166/hydro.2022.022

Cover

Loading…
Abstract Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing.
AbstractList Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing.
Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing. HIGHLIGHTS This study has applied machine learning (ML) techniques simultaneously for modelling river water quality.; A new ensemble ML technique has been developed to assess and predict two important water quality and quantity parameters.; The bagging ensemble algorithm is used with ML techniques, where it significantly raises the stability of models in improving accuracy.; A VMD data pre-processing technique is recommended to enhance the model's fidelity.;
Author Panahi, Javad
Shabanlou, Saeid
Mastouri, Reza
Author_xml – sequence: 1
  givenname: Javad
  surname: Panahi
  fullname: Panahi, Javad
– sequence: 2
  givenname: Reza
  surname: Mastouri
  fullname: Mastouri, Reza
– sequence: 3
  givenname: Saeid
  surname: Shabanlou
  fullname: Shabanlou, Saeid
BookMark eNp1UU1rGzEQFSWFJmmvPQt6Xldfu1ofS2gbQyCX9CzGI8krs5YcSab4P_RHV2uXHgoFDZoZzbw3mndHbmKKjpCPnK0EH4bP09nmtBJMiFWzN-SWq6HvuJbq5uKrTnPF35G7UvaMCS5Hfkt-bWIJu6kWGmJN1MUJIjpLD4BTiI7ODnIMcUerwymG15Mr1KdMyyl7QEd_QnWZvp4g1lDPFKJdgnnxj9nZgDWkSLdQGmZzLFRYHrpjTuhKWZBh3qUc6nQo78lbD3NxH_7c9-THt68vD4_d0_P3zcOXpw4lk7VD5_txi0xz8HZtcWQgwLXQtzMoJ1RLI0gl7DBK1CAkZ8KjYiP2vLfynmyuuDbB3hxzOEA-mwTBXBIp7wzkGnB2Zuy93MJ6LVAz5fvGJK2TGpF7q5jzDevTFav9aNlONft0yrGNb4TWWunGKFvV6lqFOZWSnf_LyplZ1DMX9cyinmnWGtQ_DRgqLMusGcL8v7bfhbOlvQ
CitedBy_id crossref_primary_10_2166_wst_2023_171
crossref_primary_10_3390_app132212147
crossref_primary_10_1016_j_jenvman_2024_122721
crossref_primary_10_2166_hydro_2022_114
crossref_primary_10_1007_s13201_024_02356_3
crossref_primary_10_1016_j_ecoinf_2024_102500
crossref_primary_10_2166_wst_2024_133
crossref_primary_10_2166_wqrj_2024_045
crossref_primary_10_3390_su16072995
crossref_primary_10_3390_w16243616
crossref_primary_10_1016_j_eswa_2024_124550
ContentType Journal Article
Copyright Copyright IWA Publishing Jul 2022
Copyright_xml – notice: Copyright IWA Publishing Jul 2022
DBID AAYXX
CITATION
7QH
7UA
AEUYN
AFKRA
ATCPS
AZQEC
BENPR
BHPHI
BKSAR
C1K
CCPQU
DWQXO
F1W
GNUQQ
H96
HCIFZ
L.G
PATMY
PCBAR
PHGZM
PHGZT
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
PYCSY
DOA
DOI 10.2166/hydro.2022.022
DatabaseName CrossRef
Aqualine
Water Resources Abstracts
ProQuest One Sustainability
ProQuest Central UK/Ireland
Agricultural & Environmental Science Collection
ProQuest Central Essentials
ProQuest Central
Natural Science Collection
Earth, Atmospheric & Aquatic Science Collection
Environmental Sciences and Pollution Management
ProQuest One
ProQuest Central Korea
ASFA: Aquatic Sciences and Fisheries Abstracts
ProQuest Central Student
Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources
SciTech Premium Collection
Aquatic Science & Fisheries Abstracts (ASFA) Professional
Environmental Science Database
Earth, Atmospheric & Aquatic Science Database
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Environmental Science Collection
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
Aquatic Science & Fisheries Abstracts (ASFA) Professional
ProQuest Central Student
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
SciTech Premium Collection
ProQuest One Community College
ProQuest Central China
Water Resources Abstracts
Environmental Sciences and Pollution Management
Earth, Atmospheric & Aquatic Science Collection
ProQuest Central
ProQuest One Sustainability
Natural Science Collection
ProQuest Central Korea
Agricultural & Environmental Science Collection
ProQuest Central (New)
ProQuest One Academic Eastern Edition
Earth, Atmospheric & Aquatic Science Database
Aqualine
Environmental Science Collection
Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources
ProQuest One Academic UKI Edition
ASFA: Aquatic Sciences and Fisheries Abstracts
Environmental Science Database
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList Aquatic Science & Fisheries Abstracts (ASFA) Professional

CrossRef
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: BENPR
  name: ProQuest Central (New)
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Geography
EISSN 1465-1734
EndPage 897
ExternalDocumentID oai_doaj_org_article_85f3ba992c704f50a23de37cc1fd40ef
10_2166_hydro_2022_022
GeographicLocations Iran
United States--US
China
GeographicLocations_xml – name: China
– name: Iran
– name: United States--US
GroupedDBID 0R~
4.4
5GY
7XC
8CJ
8FE
8FH
AAJMC
AAJVE
AAYXX
ABFYC
ABLGR
ACGFO
AECGI
AENEX
AEUYN
AFKRA
AFRAH
AJXRC
ALMA_UNASSIGNED_HOLDINGS
ATCPS
BENPR
BHPHI
BKSAR
CCPQU
CITATION
CS3
D1J
DU5
EBS
GEUZO
GROUPED_DOAJ
H13
HCIFZ
HFPTO
HZ~
L7B
O9-
OK1
P2P
PATMY
PCBAR
PHGZM
PHGZT
PYCSY
R0Z
RHI
~02
7QH
7UA
AZQEC
C1K
DWQXO
F1W
GNUQQ
H96
L.G
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
PUEGO
ID FETCH-LOGICAL-c303t-cef58bc071afd9dc80a2ae071f71f64e24fd9ca342d683c7a23102fc408c515d3
IEDL.DBID BENPR
ISSN 1464-7141
IngestDate Wed Aug 27 01:24:34 EDT 2025
Tue Jul 01 09:22:47 EDT 2025
Thu Apr 24 23:04:45 EDT 2025
Tue Jul 01 04:10:34 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c303t-cef58bc071afd9dc80a2ae071f71f64e24fd9ca342d683c7a23102fc408c515d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
OpenAccessLink https://doaj.org/article/85f3ba992c704f50a23de37cc1fd40ef
PQID 2777475153
PQPubID 2044529
PageCount 23
ParticipantIDs doaj_primary_oai_doaj_org_article_85f3ba992c704f50a23de37cc1fd40ef
proquest_journals_2777475153
crossref_primary_10_2166_hydro_2022_022
crossref_citationtrail_10_2166_hydro_2022_022
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2022-07-01
20220701
PublicationDateYYYYMMDD 2022-07-01
PublicationDate_xml – month: 07
  year: 2022
  text: 2022-07-01
  day: 01
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
PublicationTitle Journal of hydroinformatics
PublicationYear 2022
Publisher IWA Publishing
Publisher_xml – name: IWA Publishing
SSID ssj0021381
Score 2.3391092
Snippet Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their...
SourceID doaj
proquest
crossref
SourceType Open Website
Aggregation Database
Enrichment Source
Index Database
StartPage 875
SubjectTerms Aggregation
Algorithms
bagging algorithm
Chemical oxygen demand
Datasets
Decision trees
Decomposition
decomposition–ensemble model
Economic development
Error reduction
Forecasting
Frequency analysis
Frequency dependence
Gene expression
Hydrology
karoon river
Learning algorithms
Machine learning
Neural networks
Nonlinear systems
Nonlinearity
Pollutants
Pollution control
Resource management
Runoff
Sediments
Stream discharge
Stream flow
Surface water
Time-frequency analysis
variational mode decomposition
Water pollution
Water pollution control
Water quality
water quality and quantity
Water resources
Water resources management
Water supply
Watershed management
Watersheds
Wavelet transforms
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA7iRS_iE9cXOQieqm2StulRRVFBTwreQpqHK2h3bbvI_gd_tDNpd1FEvEhbaNOQpvkm8yCTGUIOCwN2lY9lpB0uM-KcK6VJI1eAfMm9FTIgfXuXXT2Im8f08UuqL_QJ68IDdwN3IlPPS10UzOSx8GmsGbeO58Yk0FDsPHJfkHkzY6o3tRIe0pMCG0BvQ5F04RpZkmUnw6mtcdcfY8cxY9_EUYja_4MpB0lzuUpWehWRnnZdWyMLrlonS3228uF0g3xcVw3a1A19rtoRddUwLOPT1-AY6WifCeKJzgO0NhR0U9pMaq-No--gX9b0baJxi-6U6srSbm_llI5rXLhBsCjKN0vhBp1I8UU07jYVYMv65WlUP7fD12aTPFxe3J9fRX1WhciAuGoj43wqSwOqhfa2sEbCkGoHjx7OTDgmoNhoLpjNJDe5Rg2QeSNiwDBJLd8ii9WoctuEyozbFA4gg0TktpTS5YVMdFYAFxCFG5BoNrjK9CHHMfPFiwLTA8FQAQyFYCi4BuRoXn_cBdv4teYZYjWvhUGyQwGQjupJR_1FOgOyN0Na9TO3USwHhTiHH-U7__GNXbKMfe4cfPfIYltP3D6oMW15ECj2E1NP85Q
  priority: 102
  providerName: Directory of Open Access Journals
Title Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms
URI https://www.proquest.com/docview/2777475153
https://doaj.org/article/85f3ba992c704f50a23de37cc1fd40ef
Volume 24
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dS90wFA9OH9zLmB9jd3OSB8Gnapukbfo05lBUUGRM8C2k-bh3oL3Xtpdx_wf_aM9Jc-8QmbSFNg0lze_kfCQn5xByUBmwq3wqE-1wmRHHXC1NnrgK5EvprZAB6avr4vxWXN7ld3HCrYtulUueGBi1nRqcIz9mJSgqJUhf_n32mGDWKFxdjSk03pENYMESKHzj5PT65tfK5Mp4SFMK7AC9DkU2hG1kWVEcTxa2xd1_jB2ljL0QSyF6_yvmHCTO2UfyIaqK9MeA7RZZc8022YxZyyeLHfJ00XRoW3f0T9NPqWsmYTmfPgQHSUdjRogxXQVq7SjoqLSbt14bR_-CntnSx7nGrboLqhtLhz2WCzprcQEHQaMo5yyFG3QmxRfJbNhcgF_W92PopH7y0O2S27PT3z_Pk5hdITEgtvrEOJ_L2oCKob2trJGpZtrBo4ezEI4JKDaaC2YLyU2pURNk3ogUsMxyyz-R9WbauM-EyoLbHA4gh0yUtpbSlZXMdFEBNxCVG5Fk2bnKxNDjmAHjXoEJgmCoAIZCMBRcI3K4qj8bgm78t-YJYrWqhcGyQ8G0Has49pTMPa91VTEDrfE5_Ce3jpfQWKDF1PkR2VsireII7tQ_evvy9uuv5D22ZnDh3SPrfTt330BR6ev9SI37wdB_Bm4e7Kc
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR3LbtQw0CrbQ7kgnmKhgA8gTqGJ7STOASEKrXZpu0KolXozjh-7ldrsNsmqyj_wLXwjM3ksQghuVRIpsS3L9oznEc-DkNeZAb3KhzLQDo8Zcc_l0sSBy4C_pN4K2UL6ZJZMzsSX8_h8i_wcfGHQrHKgiS2htkuD_8j3WAqCSgrcl39YXQeYNQpPV4cUGh1aHLnmBlS26v30M8D3DWOHB6efJkGfVSAwQK7rwDgfy9wAa9XeZtbIUDPt4NPDnQjHBBQbzQWzieQm1SgBMW9ECHOIYsuh3ztkW3AQFUZke_9g9vXbRsWLeJsWFcgPWjmKqAsTyaIk2Vs0tkRvQ8behYz9wQbbbAF_MYOWwx3eJ_d60ZR-7HDpAdlyxUOy02dJXzSPyI9pUaEuX9GLol5SVyxa8wF61RpkOtpnoJjTTWDYioJMTKt16bVx9Abk2pJerzW6BjdUF5Z2Pp0NXZV4YIRIQpGvWgovaLyKFcGqc2bAnvXlHIBSL66qx-TsVtb9CRkVy8I9JVQm3MZwAfpFIrW5lC7NZKSTDKiPyNyYBMPiKtOHOseMG5cKVB4EhmqBoRAYCp4xebtpv-qCfPyz5T7CatMKg3O3Bctyrvq9rmTsea6zjBkYjY9hntw6nsJgAfdD58dkd4C06ilGpX7j97P_V78iO5PTk2N1PJ0dPSd3cWSd-fAuGdXl2r0AIanOX_aYScn3294MvwAgqilW
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR3batRA9FC3oL6IV1ytOg-KT3GTyW3yIGJtl67VpYiFvo2TuewKbXabZCn5B7_Ir_OcXFZE9K0kgWRmGCY598y5ALzMNNpVzheesrTNSDSXCx17NkP5kjoTiRbSn-fJ0Wn08Sw-24GfQywMuVUOPLFl1Gal6R_5hKeoqKQofcOJ690iTg6m79aXHlWQop3WoZxGhyLHtrlC8616OztAWL_ifHr49cOR11cY8DSy7trT1sUi1yhmlTOZ0cJXXFl8dHgmkeURNmsVRtwkItSpIm2IOx35-D5BbEKc9wbsphQ-OoLd_cP5yZetuReEbYlUZEXk8RgFXcpIHiTJZNmYkiIPOX_jc_6HSGwrB_wlGFppN70Ld3o1lb3v8Ooe7NjiPtzqK6YvmwfwY1ZUZNdX7HtRr5gtlq0rAbtonTMt66tRLNg2SWzFUD9m1aZ0Slt2hTpuyS43isKEG6YKw7r4zoatS9o8IoRhJGMNwxtyZKUOb90FNtDM6nyBQKmXF9VDOL2W7_4IRsWqsI-BiSQ0MR6IikGUmlwIm2YiUEmGnCjK7Bi84eNK3ac9p-ob5xLNHwKGbIEhCRgSrzG83o5fdwk__jlyn2C1HUWJutuGVbmQPd1LEbswV1nGNa7GxfieobFhiotFOvCtG8PeAGnZc49K_sb1J__vfgE3kQjkp9n8-CncpoV1nsR7MKrLjX2G-lKdP-8Rk8G366aFX6roLZQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Insights+into+enhanced+machine+learning+techniques+for+surface+water+quantity+and+quality+prediction+based+on+data+pre-processing+algorithms&rft.jtitle=Journal+of+hydroinformatics&rft.au=Panahi%2C+Javad&rft.au=Mastouri%2C+Reza&rft.au=Shabanlou%2C+Saeid&rft.date=2022-07-01&rft.pub=IWA+Publishing&rft.issn=1464-7141&rft.eissn=1465-1734&rft.volume=24&rft.issue=4&rft.spage=875&rft_id=info:doi/10.2166%2Fhydro.2022.022&rft.externalDBID=HAS_PDF_LINK
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1464-7141&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1464-7141&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1464-7141&client=summon