Method, system and device for sensitive data identification

The embodiment of the invention provides a method, a system and a device for sensitive data identification. The method comprises the following steps: parsing unstructured data to obtain text data corresponding to the unstructured data, wherein the text data comprises a plurality of words; inputting...

Full description

Saved in:
Bibliographic Details
Main Authors HAN PEIYI, DUAN SHAOMING, LIU CHUANYI, FANG BINXING
Format Patent
LanguageChinese
English
Published 22.05.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The embodiment of the invention provides a method, a system and a device for sensitive data identification. The method comprises the following steps: parsing unstructured data to obtain text data corresponding to the unstructured data, wherein the text data comprises a plurality of words; inputting the text data into a sensitive data recognition model to obtain a first annotation sequence with themaximum joint distribution probability for sensitive entity attributes of each word, wherein the sensitive data recognition model comprises a language model based on deep learning, a full connectionlayer and CRF; and determining the position of the sensitive data in the text data according to the first annotation sequence. A language model based on deep learning can well learn and represent eachword in the text data, and meanwhile, the annotation sequence with the maximum joint distribution probability for the sensitive entity attribute of each word in the text data is solved in combinationwith the CRF, so that the p
Bibliography:Application Number: CN201911194236