Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms
Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-su...
Saved in:
Published in | Journal of hydroinformatics Vol. 24; no. 4; pp. 875 - 897 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
London
IWA Publishing
01.07.2022
|
Subjects | |
Online Access | Get full text |
ISSN | 1464-7141 1465-1734 |
DOI | 10.2166/hydro.2022.022 |
Cover
Loading…
Abstract | Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing. |
---|---|
AbstractList | Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing. Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their nonstationarity and uncertainty path. This paper presented an ensemble data pre-processing-based machine learning (ML) algorithm for the decision-support of water resource management and water pollution control at the watershed scale due to the nonlinear path of streamflow. In the proposed hybrid model, a new time–frequency analysis algorithm, variational mode decomposition (VMD), is implemented to deal with the nonlinearity and nonstationary of a streamflow process. The VMD is exploited to decompose the original water quality and quantity series into a series of intrinsic mode functions (IMFs) with different frequencies. Therefore, an ensemble algorithm, bootstrap aggregating (bagging) algorithm is coupled with two common ML, reduced error pruning tree (REPT) and random forest (RF), to predict all the decomposed modes using VMD. Then, in order to reduce the variance among the base classifiers of the proposed ML, a bootstrap aggregation technique was recruited. Finally, the predicting value of the original water quality and quantity series is obtained by adding up the predicting results of all the decomposed modes. The proposed hybrid decomposition–ensemble model has been applied to two stations in Karoon River, Iran. Results obtained from this study indicate that the proposed hybrid decomposition–ensemble model can capture the nonlinear characteristics of a streamflow process in terms of water quality and quantity simultaneously and thus provide more accurate predicting results compared with those models without data frequency decomposing. HIGHLIGHTS This study has applied machine learning (ML) techniques simultaneously for modelling river water quality.; A new ensemble ML technique has been developed to assess and predict two important water quality and quantity parameters.; The bagging ensemble algorithm is used with ML techniques, where it significantly raises the stability of models in improving accuracy.; A VMD data pre-processing technique is recommended to enhance the model's fidelity.; |
Author | Panahi, Javad Shabanlou, Saeid Mastouri, Reza |
Author_xml | – sequence: 1 givenname: Javad surname: Panahi fullname: Panahi, Javad – sequence: 2 givenname: Reza surname: Mastouri fullname: Mastouri, Reza – sequence: 3 givenname: Saeid surname: Shabanlou fullname: Shabanlou, Saeid |
BookMark | eNp1UU1rGzEQFSWFJmmvPQt6Xldfu1ofS2gbQyCX9CzGI8krs5YcSab4P_RHV2uXHgoFDZoZzbw3mndHbmKKjpCPnK0EH4bP09nmtBJMiFWzN-SWq6HvuJbq5uKrTnPF35G7UvaMCS5Hfkt-bWIJu6kWGmJN1MUJIjpLD4BTiI7ODnIMcUerwymG15Mr1KdMyyl7QEd_QnWZvp4g1lDPFKJdgnnxj9nZgDWkSLdQGmZzLFRYHrpjTuhKWZBh3qUc6nQo78lbD3NxH_7c9-THt68vD4_d0_P3zcOXpw4lk7VD5_txi0xz8HZtcWQgwLXQtzMoJ1RLI0gl7DBK1CAkZ8KjYiP2vLfynmyuuDbB3hxzOEA-mwTBXBIp7wzkGnB2Zuy93MJ6LVAz5fvGJK2TGpF7q5jzDevTFav9aNlONft0yrGNb4TWWunGKFvV6lqFOZWSnf_LyplZ1DMX9cyinmnWGtQ_DRgqLMusGcL8v7bfhbOlvQ |
CitedBy_id | crossref_primary_10_2166_wst_2023_171 crossref_primary_10_3390_app132212147 crossref_primary_10_1016_j_jenvman_2024_122721 crossref_primary_10_2166_hydro_2022_114 crossref_primary_10_1007_s13201_024_02356_3 crossref_primary_10_1016_j_ecoinf_2024_102500 crossref_primary_10_2166_wst_2024_133 crossref_primary_10_2166_wqrj_2024_045 crossref_primary_10_3390_su16072995 crossref_primary_10_3390_w16243616 crossref_primary_10_1016_j_eswa_2024_124550 |
ContentType | Journal Article |
Copyright | Copyright IWA Publishing Jul 2022 |
Copyright_xml | – notice: Copyright IWA Publishing Jul 2022 |
DBID | AAYXX CITATION 7QH 7UA AEUYN AFKRA ATCPS AZQEC BENPR BHPHI BKSAR C1K CCPQU DWQXO F1W GNUQQ H96 HCIFZ L.G PATMY PCBAR PHGZM PHGZT PKEHL PQEST PQQKQ PQUKI PRINS PYCSY DOA |
DOI | 10.2166/hydro.2022.022 |
DatabaseName | CrossRef Aqualine Water Resources Abstracts ProQuest One Sustainability ProQuest Central UK/Ireland Agricultural & Environmental Science Collection ProQuest Central Essentials ProQuest Central Natural Science Collection Earth, Atmospheric & Aquatic Science Collection Environmental Sciences and Pollution Management ProQuest One ProQuest Central Korea ASFA: Aquatic Sciences and Fisheries Abstracts ProQuest Central Student Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources SciTech Premium Collection Aquatic Science & Fisheries Abstracts (ASFA) Professional Environmental Science Database Earth, Atmospheric & Aquatic Science Database ProQuest Central Premium ProQuest One Academic (New) ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Environmental Science Collection DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef Aquatic Science & Fisheries Abstracts (ASFA) Professional ProQuest Central Student ProQuest One Academic Middle East (New) ProQuest Central Essentials SciTech Premium Collection ProQuest One Community College ProQuest Central China Water Resources Abstracts Environmental Sciences and Pollution Management Earth, Atmospheric & Aquatic Science Collection ProQuest Central ProQuest One Sustainability Natural Science Collection ProQuest Central Korea Agricultural & Environmental Science Collection ProQuest Central (New) ProQuest One Academic Eastern Edition Earth, Atmospheric & Aquatic Science Database Aqualine Environmental Science Collection Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources ProQuest One Academic UKI Edition ASFA: Aquatic Sciences and Fisheries Abstracts Environmental Science Database ProQuest One Academic ProQuest One Academic (New) |
DatabaseTitleList | Aquatic Science & Fisheries Abstracts (ASFA) Professional CrossRef |
Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: BENPR name: ProQuest Central (New) url: https://www.proquest.com/central sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Geography |
EISSN | 1465-1734 |
EndPage | 897 |
ExternalDocumentID | oai_doaj_org_article_85f3ba992c704f50a23de37cc1fd40ef 10_2166_hydro_2022_022 |
GeographicLocations | Iran United States--US China |
GeographicLocations_xml | – name: China – name: Iran – name: United States--US |
GroupedDBID | 0R~ 4.4 5GY 7XC 8CJ 8FE 8FH AAJMC AAJVE AAYXX ABFYC ABLGR ACGFO AECGI AENEX AEUYN AFKRA AFRAH AJXRC ALMA_UNASSIGNED_HOLDINGS ATCPS BENPR BHPHI BKSAR CCPQU CITATION CS3 D1J DU5 EBS GEUZO GROUPED_DOAJ H13 HCIFZ HFPTO HZ~ L7B O9- OK1 P2P PATMY PCBAR PHGZM PHGZT PYCSY R0Z RHI ~02 7QH 7UA AZQEC C1K DWQXO F1W GNUQQ H96 L.G PKEHL PQEST PQQKQ PQUKI PRINS PUEGO |
ID | FETCH-LOGICAL-c303t-cef58bc071afd9dc80a2ae071f71f64e24fd9ca342d683c7a23102fc408c515d3 |
IEDL.DBID | BENPR |
ISSN | 1464-7141 |
IngestDate | Wed Aug 27 01:24:34 EDT 2025 Tue Jul 01 09:22:47 EDT 2025 Thu Apr 24 23:04:45 EDT 2025 Tue Jul 01 04:10:34 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c303t-cef58bc071afd9dc80a2ae071f71f64e24fd9ca342d683c7a23102fc408c515d3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
OpenAccessLink | https://doaj.org/article/85f3ba992c704f50a23de37cc1fd40ef |
PQID | 2777475153 |
PQPubID | 2044529 |
PageCount | 23 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_85f3ba992c704f50a23de37cc1fd40ef proquest_journals_2777475153 crossref_primary_10_2166_hydro_2022_022 crossref_citationtrail_10_2166_hydro_2022_022 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2022-07-01 20220701 |
PublicationDateYYYYMMDD | 2022-07-01 |
PublicationDate_xml | – month: 07 year: 2022 text: 2022-07-01 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | London |
PublicationPlace_xml | – name: London |
PublicationTitle | Journal of hydroinformatics |
PublicationYear | 2022 |
Publisher | IWA Publishing |
Publisher_xml | – name: IWA Publishing |
SSID | ssj0021381 |
Score | 2.3391092 |
Snippet | Quality and quantity of streamflow are crucial components in the management and control of water resources according which are challenging due to their... |
SourceID | doaj proquest crossref |
SourceType | Open Website Aggregation Database Enrichment Source Index Database |
StartPage | 875 |
SubjectTerms | Aggregation Algorithms bagging algorithm Chemical oxygen demand Datasets Decision trees Decomposition decomposition–ensemble model Economic development Error reduction Forecasting Frequency analysis Frequency dependence Gene expression Hydrology karoon river Learning algorithms Machine learning Neural networks Nonlinear systems Nonlinearity Pollutants Pollution control Resource management Runoff Sediments Stream discharge Stream flow Surface water Time-frequency analysis variational mode decomposition Water pollution Water pollution control Water quality water quality and quantity Water resources Water resources management Water supply Watershed management Watersheds Wavelet transforms |
SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA7iRS_iE9cXOQieqm2StulRRVFBTwreQpqHK2h3bbvI_gd_tDNpd1FEvEhbaNOQpvkm8yCTGUIOCwN2lY9lpB0uM-KcK6VJI1eAfMm9FTIgfXuXXT2Im8f08UuqL_QJ68IDdwN3IlPPS10UzOSx8GmsGbeO58Yk0FDsPHJfkHkzY6o3tRIe0pMCG0BvQ5F04RpZkmUnw6mtcdcfY8cxY9_EUYja_4MpB0lzuUpWehWRnnZdWyMLrlonS3228uF0g3xcVw3a1A19rtoRddUwLOPT1-AY6WifCeKJzgO0NhR0U9pMaq-No--gX9b0baJxi-6U6srSbm_llI5rXLhBsCjKN0vhBp1I8UU07jYVYMv65WlUP7fD12aTPFxe3J9fRX1WhciAuGoj43wqSwOqhfa2sEbCkGoHjx7OTDgmoNhoLpjNJDe5Rg2QeSNiwDBJLd8ii9WoctuEyozbFA4gg0TktpTS5YVMdFYAFxCFG5BoNrjK9CHHMfPFiwLTA8FQAQyFYCi4BuRoXn_cBdv4teYZYjWvhUGyQwGQjupJR_1FOgOyN0Na9TO3USwHhTiHH-U7__GNXbKMfe4cfPfIYltP3D6oMW15ECj2E1NP85Q priority: 102 providerName: Directory of Open Access Journals |
Title | Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms |
URI | https://www.proquest.com/docview/2777475153 https://doaj.org/article/85f3ba992c704f50a23de37cc1fd40ef |
Volume | 24 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dS90wFA9OH9zLmB9jd3OSB8Gnapukbfo05lBUUGRM8C2k-bh3oL3Xtpdx_wf_aM9Jc-8QmbSFNg0lze_kfCQn5xByUBmwq3wqE-1wmRHHXC1NnrgK5EvprZAB6avr4vxWXN7ld3HCrYtulUueGBi1nRqcIz9mJSgqJUhf_n32mGDWKFxdjSk03pENYMESKHzj5PT65tfK5Mp4SFMK7AC9DkU2hG1kWVEcTxa2xd1_jB2ljL0QSyF6_yvmHCTO2UfyIaqK9MeA7RZZc8022YxZyyeLHfJ00XRoW3f0T9NPqWsmYTmfPgQHSUdjRogxXQVq7SjoqLSbt14bR_-CntnSx7nGrboLqhtLhz2WCzprcQEHQaMo5yyFG3QmxRfJbNhcgF_W92PopH7y0O2S27PT3z_Pk5hdITEgtvrEOJ_L2oCKob2trJGpZtrBo4ezEI4JKDaaC2YLyU2pURNk3ogUsMxyyz-R9WbauM-EyoLbHA4gh0yUtpbSlZXMdFEBNxCVG5Fk2bnKxNDjmAHjXoEJgmCoAIZCMBRcI3K4qj8bgm78t-YJYrWqhcGyQ8G0Has49pTMPa91VTEDrfE5_Ce3jpfQWKDF1PkR2VsireII7tQ_evvy9uuv5D22ZnDh3SPrfTt330BR6ev9SI37wdB_Bm4e7Kc |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR3LbtQw0CrbQ7kgnmKhgA8gTqGJ7STOASEKrXZpu0KolXozjh-7ldrsNsmqyj_wLXwjM3ksQghuVRIpsS3L9oznEc-DkNeZAb3KhzLQDo8Zcc_l0sSBy4C_pN4K2UL6ZJZMzsSX8_h8i_wcfGHQrHKgiS2htkuD_8j3WAqCSgrcl39YXQeYNQpPV4cUGh1aHLnmBlS26v30M8D3DWOHB6efJkGfVSAwQK7rwDgfy9wAa9XeZtbIUDPt4NPDnQjHBBQbzQWzieQm1SgBMW9ECHOIYsuh3ztkW3AQFUZke_9g9vXbRsWLeJsWFcgPWjmKqAsTyaIk2Vs0tkRvQ8behYz9wQbbbAF_MYOWwx3eJ_d60ZR-7HDpAdlyxUOy02dJXzSPyI9pUaEuX9GLol5SVyxa8wF61RpkOtpnoJjTTWDYioJMTKt16bVx9Abk2pJerzW6BjdUF5Z2Pp0NXZV4YIRIQpGvWgovaLyKFcGqc2bAnvXlHIBSL66qx-TsVtb9CRkVy8I9JVQm3MZwAfpFIrW5lC7NZKSTDKiPyNyYBMPiKtOHOseMG5cKVB4EhmqBoRAYCp4xebtpv-qCfPyz5T7CatMKg3O3Bctyrvq9rmTsea6zjBkYjY9hntw6nsJgAfdD58dkd4C06ilGpX7j97P_V78iO5PTk2N1PJ0dPSd3cWSd-fAuGdXl2r0AIanOX_aYScn3294MvwAgqilW |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR3batRA9FC3oL6IV1ytOg-KT3GTyW3yIGJtl67VpYiFvo2TuewKbXabZCn5B7_Ir_OcXFZE9K0kgWRmGCY598y5ALzMNNpVzheesrTNSDSXCx17NkP5kjoTiRbSn-fJ0Wn08Sw-24GfQywMuVUOPLFl1Gal6R_5hKeoqKQofcOJ690iTg6m79aXHlWQop3WoZxGhyLHtrlC8616OztAWL_ifHr49cOR11cY8DSy7trT1sUi1yhmlTOZ0cJXXFl8dHgmkeURNmsVRtwkItSpIm2IOx35-D5BbEKc9wbsphQ-OoLd_cP5yZetuReEbYlUZEXk8RgFXcpIHiTJZNmYkiIPOX_jc_6HSGwrB_wlGFppN70Ld3o1lb3v8Ooe7NjiPtzqK6YvmwfwY1ZUZNdX7HtRr5gtlq0rAbtonTMt66tRLNg2SWzFUD9m1aZ0Slt2hTpuyS43isKEG6YKw7r4zoatS9o8IoRhJGMNwxtyZKUOb90FNtDM6nyBQKmXF9VDOL2W7_4IRsWqsI-BiSQ0MR6IikGUmlwIm2YiUEmGnCjK7Bi84eNK3ac9p-ob5xLNHwKGbIEhCRgSrzG83o5fdwk__jlyn2C1HUWJutuGVbmQPd1LEbswV1nGNa7GxfieobFhiotFOvCtG8PeAGnZc49K_sb1J__vfgE3kQjkp9n8-CncpoV1nsR7MKrLjX2G-lKdP-8Rk8G366aFX6roLZQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Insights+into+enhanced+machine+learning+techniques+for+surface+water+quantity+and+quality+prediction+based+on+data+pre-processing+algorithms&rft.jtitle=Journal+of+hydroinformatics&rft.au=Panahi%2C+Javad&rft.au=Mastouri%2C+Reza&rft.au=Shabanlou%2C+Saeid&rft.date=2022-07-01&rft.pub=IWA+Publishing&rft.issn=1464-7141&rft.eissn=1465-1734&rft.volume=24&rft.issue=4&rft.spage=875&rft_id=info:doi/10.2166%2Fhydro.2022.022&rft.externalDBID=HAS_PDF_LINK |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1464-7141&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1464-7141&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1464-7141&client=summon |