Data potential and feasibility study with Grid Mean Algorithm

The Grid Mean Algorithm is a computational approach designed to evaluate regression metrics such as coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) directly on tabular data without the need to train machine learni...

Full description

Saved in:
Bibliographic Details
Published inMathematical Modeling and Computing Vol. 12; no. 1; pp. 331 - 341
Main Authors Holdovanskyi, V. A., Alieksieiev, V. I.
Format Journal Article
LanguageEnglish
Published 2025
Online AccessGet full text

Cover

Loading…
More Information
Summary:The Grid Mean Algorithm is a computational approach designed to evaluate regression metrics such as coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) directly on tabular data without the need to train machine learning (ML) models. This method enables researchers and practitioners to assess the potential of data for regression tasks, estimate the feasibility of ML projects, and make informed decisions about resource allocation. Additionally, the algorithm allows for estimating the approximate accuracy limit achievable with the given data, making it a valuable criterion for determining the optimality of a model. By addressing whether further research stages are necessary or redundant, it provides a practical tool for planning ML experiments and evaluating the economic viability of investing in such models. Experiments on synthetic datasets demonstrate the method's capability to produce accurate metric estimates across various functional forms and noise levels, making it a robust choice for initial data exploration and ML project planning.
ISSN:2312-9794
2415-3788
DOI:10.23939/mmc2025.01.331