GA-SVM based feature selection and parameter optimization in hospitalization expense modeling

Feature selection and parameter optimization are two important aspects to improve the performance of classifier. A novel approach based on the genetic algorithm(GA) for feature selection and parameter optimization of support vector machine(SVM) is proposed in order to improve the prediction accuracy...

Full description

Saved in:
Bibliographic Details
Published inApplied soft computing Vol. 75; pp. 323 - 332
Main Authors Tao, Zhou, Huiling, Lu, Wenwen, Wang, Xia, Yong
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Feature selection and parameter optimization are two important aspects to improve the performance of classifier. A novel approach based on the genetic algorithm(GA) for feature selection and parameter optimization of support vector machine(SVM) is proposed in order to improve the prediction accuracy of hospitalization expense model. First of all, the data of hospitalization expense are preprocessed, including data cleaning, discretization, normalization; Secondly, using k-means to cluster and obtain two category labels; Thirdly, kernel penalty factor c, kernel function γ and the feature mask are used to construct chromosome; The Fourth, a weighted combination of classification accuracy and feature number are taken as the fitness function, and GA was used to optimize the SVM parameters, and simultaneously select the optimal subset of features; Finally, single parameter optimization is performed using GA and particle swarm optimization (PSO), and the optimization performance of which is compared with that of GA-PCA and PSO-PCA. Experimental results show that the proposed algorithm can be used to quickly obtain suitable feature subsets and SVM parameters, thereby achieving a better classification result. [Display omitted] •Using k-means to cluster and obtain two category label—low expense and high expense.•A weighted combination of classification accuracy and feature number are taken as the fitness function.•Kernel penalty factor c, kernel function γ and the feature mask are used to construct chromosome.•In this paper a GA-SVM based feature selection and parameter optimization in hospitalization expense modeling are proposed.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2018.11.001