Dimensionality Reduction for Water Quality Prediction from a Data Mining Perspective
Biochemical oxygen demand (BOD) is the measurement of the amount of dissolved oxygen used by aerobic microbes for oxidizing organic matter in water bodies and used for analyzing the water quality. The actual BOD prediction method is cumbersome. Instead an automatic prediction model is required that...
Saved in:
Published in | Modelling, Simulation and Intelligent Computing Vol. 659; pp. 255 - 263 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Singapore
Springer Singapore Pte. Limited
2020
Springer Singapore |
Series | Lecture Notes in Electrical Engineering |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Biochemical oxygen demand (BOD) is the measurement of the amount of dissolved oxygen used by aerobic microbes for oxidizing organic matter in water bodies and used for analyzing the water quality. The actual BOD prediction method is cumbersome. Instead an automatic prediction model is required that is accurate, faster and less expensive. This paper presents a data-driven model for predicting BOD, in a lower-dimensional space obtained using dimensionality reduction techniques that help remove irrelevant properties of high-dimensional data. Machine learning algorithms, namely decision stump, SVM, MLP, linear regression (LR), and instance-based learner (IBK), were trained with the full dataset with 11 parameters. The training set was later transformed into a lower-dimensional space using principal component analysis (PCA) and correlation-based feature selection (CFS). The performance of the learners on the full training set and transformed dataset was analyzed using correlation coefficient, RMSE, and MAE. The algorithms are able to preserve their predictive accuracy on the lower-dimensional space. |
---|---|
ISBN: | 9811547742 9789811547744 |
ISSN: | 1876-1100 1876-1119 |
DOI: | 10.1007/978-981-15-4775-1_28 |