Sensitive data classification method based on improved ID3 decision tree
The invention discloses a sensitive data classification method based on an improved ID3 decision tree, and the method comprises the steps: 1, collecting sensitive data with the same condition attribute to form a sample data set, dividing the sample data set into a sample training set and a sample te...
Saved in:
Main Authors | , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
06.09.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention discloses a sensitive data classification method based on an improved ID3 decision tree, and the method comprises the steps: 1, collecting sensitive data with the same condition attribute to form a sample data set, dividing the sample data set into a sample training set and a sample testing set according to a certain proportion, and carrying out the data preprocessing; step 2, based on an information gain theory, selecting a condition attribute with a maximum weighted information gain of sample data in the sample training set; 3, taking the condition attribute with the maximum weighted information gain as a split node attribute of the decision tree, constructing an improved ID3 decision tree, and carrying out model verification; and 4, carrying out sensitive data classification by utilizing the improved ID3 decision tree. According to the method, sensitive data are classified by improving the ID3 decision tree model, the improved ID3 decision tree model can well integrate data features in multip |
---|---|
Bibliography: | Application Number: CN202210792299 |