A label learning approach using competitive population optimization algorithm feature selection to improve multi-label classification algorithms

One of the crucial pre-processing stages in data mining and machine learning is feature selection, which is used to choose a subset of representative characteristics and decrease dimensions. By eliminating unnecessary and redundant features, feature selection can improve machine learning tasks’ accu...

Full description

Saved in:

Bibliographic Details
Published in	Journal of King Saud University. Computer and information sciences Vol. 36; no. 5; p. 102083
Main Author	Cui, Lianhe
Format	Journal Article
Language	English
Published	Elsevier B.V 01.06.2024 Elsevier
Subjects	Competitive swarm optimizer Feature selection Multi-label data Reconstruction error Sparse representation Sparse representation Feature selection Competitive swarm optimizer Multi-label data Reconstruction error
Online Access	Get full text

Cover

Loading…

More Information
Summary:	One of the crucial pre-processing stages in data mining and machine learning is feature selection, which is used to choose a subset of representative characteristics and decrease dimensions. By eliminating unnecessary and redundant features, feature selection can improve machine learning tasks’ accuracy. This work presents a novel multi-label classification (MLC) model utilizing a combination of stack regression (RR) and original label space transformation (IPLST) called RR-IPLST (original label space transformation-ridge regression). A novel embedded technique is implemented, utilizing competitive crowding optimizer (CSO) for multi-label feature selection. Particles are first created using this procedure, after which they are split into two equal groups and compete in pairs. The winners advance to the next iteration, while the losers pick up tips from the victors. At the conclusion of each iteration, the objective function for every particle is determined. A local search technique inspired by the gradient descent algorithm is used to find the local structure of the data, and half of the initial population is produced by the similarity between features and labels in order to boost the convergence rate. Ultimately, feature selection is carried out depending on the best particle. Six popular and sophisticated multi-label feature selection techniques are evaluated to see how well the suggested approach performs. According to the simulation results, the application of the suggested solution performs better than comparable techniques in terms of stability, accuracy, precision, convergence, error measurement, and other criteria that have been examined on various data sets. In 93.35% of cases, the test results demonstrate superiority over traditional algorithms.
ISSN:	1319-1578 2213-1248
DOI:	10.1016/j.jksuci.2024.102083