Part of Speech Tagging Java Method Names

Numerous software engineering tools for evolution and comprehension, including code search, comment generation, and analyzing bug reports, make use of part-of-speech (POS) information. However, many POS taggers are developed for, and trained on, natural language. In this paper, we investigate the ac...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE International Conference on Software Maintenance and Evolution (ICSME) pp. 483 - 487
Main Authors Olney, Wyatt, Hill, Emily, Thurber, Chris, Lemma, Bezalem
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Numerous software engineering tools for evolution and comprehension, including code search, comment generation, and analyzing bug reports, make use of part-of-speech (POS) information. However, many POS taggers are developed for, and trained on, natural language. In this paper, we investigate the accuracy of 9 POS taggers on over 200 source code identifiers taken from method names in open source Java programs. The set of taggers includes traditional POS taggers for English as well as some tuned to source code identifiers. Our results indicate that taggers tailored for source code are significantly more effective.
DOI:10.1109/ICSME.2016.80