Use of word embeddings to locate sensitive text in computer programming scripts

Exemplary embodiments may use word embeddings to enhance scanning of programming code scripts for sensitive subject matter, such as confidential subject matter. The scanning may be performed by a neural network in some exemplary embodiments. The neural network initially may be trained on a corpus of...

Full description

Saved in:
Bibliographic Details
Main Authors Pham, Vincent, Walters, Austin Grant, Truong, Anh, Watson, Mark Louis, Goodsitt, Jeremy Edward, Taylor, Kenneth, Abdi Taghi Abad, Fardin, Farivar, Reza
Format Patent
LanguageEnglish
Published 22.09.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Exemplary embodiments may use word embeddings to enhance scanning of programming code scripts for sensitive subject matter, such as confidential subject matter. The scanning may be performed by a neural network in some exemplary embodiments. The neural network initially may be trained on a corpus of programming code scripts to identify keywords relating to sensitive subject matter, such as passwords, tokens or credentials. The neural network may not only identify instances of the keywords but also may identify related terms as well. The output of the scan may be a ranked list of terms in the programming code script that may relate to sensitive subject matter.
Bibliography:Application Number: US201916722867