Neural network systems with an integrated coefficient of variation-based feature selection for stock price and trend prediction

Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 219; p. 119527
Main Authors Chaudhari, Kinjal, Thakkar, Ankit
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose coefficient of variation (CV)-based feature selection for stock prediction. The unitless statistical method, CV, is widely used to obtain variability among data distributions. We calculate CV for each feature and integrate an existing method, k-means algorithm, as well as proposed methods, median range and top-M, to select a set of features with specific characteristics such as features belonging to the largest cluster, the defined range, and with the highest CV values, respectively. We apply the set of selected features to models such as backpropagation neural network (BPNN), long short-term memory (LSTM), gated recurrent unit (GRU), and convolutional neural network (CNN) for stock price and trend prediction. We demonstrate the applicability of our proposed approach using five of the existing feature selection methods, namely, correlation coefficient, Chi2, mutual information, principal component analysis, and variance threshold; comparison indicates remarkable performance enhancement using several accuracy-based, as well as error-based, metrics and the same is statistically supported using Wilcoxon signed-rank test. •Statistical method, coefficient of variation (CV) is proposed for feature selection.•k-means algorithm, median range, and top-M are applied on the CV values of features.•Selected features are applied to neural network for stock price and trend prediction.•Comparative analysis with five existing feature selection methods is demonstrated.•Statistical significance using Wilcoxon signed-rank test is provided.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2023.119527