Sensitive data classification method based on improved ID3 decision tree

The invention discloses a sensitive data classification method based on an improved ID3 decision tree, and the method comprises the steps: 1, collecting sensitive data with the same condition attribute to form a sample data set, dividing the sample data set into a sample training set and a sample te...

Full description

Saved in:
Bibliographic Details
Main Authors ZHONG QI, GAO SHICHANG, BAI JINGWEI, HE QIANG, YI BO, CHENG ZIQIU
Format Patent
LanguageChinese
English
Published 06.09.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention discloses a sensitive data classification method based on an improved ID3 decision tree, and the method comprises the steps: 1, collecting sensitive data with the same condition attribute to form a sample data set, dividing the sample data set into a sample training set and a sample testing set according to a certain proportion, and carrying out the data preprocessing; step 2, based on an information gain theory, selecting a condition attribute with a maximum weighted information gain of sample data in the sample training set; 3, taking the condition attribute with the maximum weighted information gain as a split node attribute of the decision tree, constructing an improved ID3 decision tree, and carrying out model verification; and 4, carrying out sensitive data classification by utilizing the improved ID3 decision tree. According to the method, sensitive data are classified by improving the ID3 decision tree model, the improved ID3 decision tree model can well integrate data features in multip
Bibliography:Application Number: CN202210792299