Method, system and device for sensitive data identification
The embodiment of the invention provides a method, a system and a device for sensitive data identification. The method comprises the following steps: parsing unstructured data to obtain text data corresponding to the unstructured data, wherein the text data comprises a plurality of words; inputting...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
22.05.2020
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The embodiment of the invention provides a method, a system and a device for sensitive data identification. The method comprises the following steps: parsing unstructured data to obtain text data corresponding to the unstructured data, wherein the text data comprises a plurality of words; inputting the text data into a sensitive data recognition model to obtain a first annotation sequence with themaximum joint distribution probability for sensitive entity attributes of each word, wherein the sensitive data recognition model comprises a language model based on deep learning, a full connectionlayer and CRF; and determining the position of the sensitive data in the text data according to the first annotation sequence. A language model based on deep learning can well learn and represent eachword in the text data, and meanwhile, the annotation sequence with the maximum joint distribution probability for the sensitive entity attribute of each word in the text data is solved in combinationwith the CRF, so that the p |
---|---|
Bibliography: | Application Number: CN201911194236 |