NL2Type: Inferring JavaScript Function Types from Natural Language Information

JavaScript is dynamically typed and hence lacks the type safety of statically typed languages, leading to suboptimal IDE support, difficult to understand APIs, and unexpected runtime behavior. Several gradual type systems have been proposed, e.g., Flow and TypeScript, but they rely on developers to...

Full description

Saved in:

Bibliographic Details
Published in	2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE) pp. 304 - 315
Main Authors	Malik, Rabee Sohail, Patra, Jibesh, Pradel, Michael
Format	Conference Proceeding
Language	English
Published	IEEE 01.05.2019
Subjects	comments Data mining Deep learning identifiers JavaScript Manuals Natural languages Predictive models Semantics type inference
Online Access	Get full text

Cover

Loading…

More Information
Summary:	JavaScript is dynamically typed and hence lacks the type safety of statically typed languages, leading to suboptimal IDE support, difficult to understand APIs, and unexpected runtime behavior. Several gradual type systems have been proposed, e.g., Flow and TypeScript, but they rely on developers to annotate code with types. This paper presents NL2Type, a learning-based approach for predicting likely type signatures of JavaScript functions. The key idea is to exploit natural language information in source code, such as comments, function names, and parameter names, a rich source of knowledge that is typically ignored by type inference algorithms. We formulate the problem of predicting types as a classification problem and train a recurrent, LSTM-based neural model that, after learning from an annotated code base, predicts function types for unannotated code. We evaluate the approach with a corpus of 162,673 JavaScript files from real-world projects. NL2Type predicts types with a precision of 84.1% and a recall of 78.9% when considering only the top-most suggestion, and with a precision of 95.5% and a recall of 89.6% when considering the top-5 suggestions. The approach outperforms both JSNice, a state-of-the-art approach that analyzes implementations of functions instead of natural language information, and DeepTyper, a recent type prediction approach that is also based on deep learning. Beyond predicting types, NL2Type serves as a consistency checker for existing type annotations. We show that it discovers 39 inconsistencies that deserve developer attention (from a manual analysis of 50 warnings), most of which are due to incorrect type annotations.
ISSN:	1558-1225
DOI:	10.1109/ICSE.2019.00045