USE OF WORD EMBEDDINGS TO LOCATE SENSITIVE TEXT IN COMPUTER PROGRAMMING SCRIPTS

Exemplary embodiments may use word embeddings to enhance scanning of programming code scripts for sensitive subject matter, such as confidential subject matter. The scanning may be performed by a neural network in some exemplary embodiments. The neural network initially may be trained on a corpus of...

Full description

Saved in:
Bibliographic Details
Main Authors ABDI TAGHI ABAD, Fardin, GOODSITT, Jeremy Edward, WATSON, Mark Louis, PHAM, Vincent, WALTERS, Austin Grant, TRUONG, Anh, FARIVAR, Reza, TAYLOR, Kenneth
Format Patent
LanguageEnglish
Published 24.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Exemplary embodiments may use word embeddings to enhance scanning of programming code scripts for sensitive subject matter, such as confidential subject matter. The scanning may be performed by a neural network in some exemplary embodiments. The neural network initially may be trained on a corpus of programming code scripts to identify keywords relating to sensitive subject matter, such as passwords, tokens or credentials. The neural network may not only identify instances of the keywords but also may identify related terms as well. The output of the scan may be a ranked list of terms in the programming code script that may relate to sensitive subject matter.
Bibliography:Application Number: US202016992371