Data potential and feasibility study with Grid Mean Algorithm

The Grid Mean Algorithm is a computational approach designed to evaluate regression metrics such as coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) directly on tabular data without the need to train machine learni...

Full description

Saved in:

Bibliographic Details
Published in	Mathematical Modeling and Computing Vol. 12; no. 1; pp. 331 - 341
Main Authors	Holdovanskyi, V. A., Alieksieiev, V. I.
Format	Journal Article
Language	English
Published	2025
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The Grid Mean Algorithm is a computational approach designed to evaluate regression metrics such as coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) directly on tabular data without the need to train machine learning (ML) models. This method enables researchers and practitioners to assess the potential of data for regression tasks, estimate the feasibility of ML projects, and make informed decisions about resource allocation. Additionally, the algorithm allows for estimating the approximate accuracy limit achievable with the given data, making it a valuable criterion for determining the optimality of a model. By addressing whether further research stages are necessary or redundant, it provides a practical tool for planning ML experiments and evaluating the economic viability of investing in such models. Experiments on synthetic datasets demonstrate the method's capability to produce accurate metric estimates across various functional forms and noise levels, making it a robust choice for initial data exploration and ML project planning.
ISSN:	2312-9794 2415-3788
DOI:	10.23939/mmc2025.01.331