DigBug—Pre/post-processing operator selection for accurate bug localization

Bug localization is a recurrent maintenance task in software development. It aims at identifying relevant code locations (e.g., code files) that must be inspected to fix bugs. When such bugs are reported by users, the localization process become often overwhelming as it is mostly a manual task due t...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of systems and software Vol. 189; p. 111300
Main Authors Kim, Kisub, Ghatpande, Sankalp, Liu, Kui, Koyuncu, Anil, Kim, Dongsun, Bissyandé, Tegawendé F., Klein, Jacques, Traon, Yves Le
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Bug localization is a recurrent maintenance task in software development. It aims at identifying relevant code locations (e.g., code files) that must be inspected to fix bugs. When such bugs are reported by users, the localization process become often overwhelming as it is mostly a manual task due to incomplete and informal information (written in natural languages) available in bug reports. The research community has then invested in automated approaches, notably using Information Retrieval techniques. Unfortunately, reported performance in the literature is still limited for practical usage. Our key observation, after empirically investigating a large dataset of bug reports as well as workflow and results of state-of-the-art approaches, is that most approaches attempt localization for every bug report without considering the different characteristics of the bug reports. We propose DigBug as a straightforward approach to specialized bug localization. This approach selects pre/post-processing operators based on the attributes of bug reports; and the bug localization model is parameterized in accordance as well. Our experiments confirm that departing from “one-size-fits-all” approaches, DigBug outperforms the state-of-the-art techniques by 6 and 14 percentage points, respectively in terms of MAP and MRR on average. [Display omitted] •The role of pre/post-processing operators for better localization performance.•DigBug, a technique that selects a best-performing operator combination.•Empirical evaluation against the state-of-the-art.
ISSN:0164-1212
1873-1228
DOI:10.1016/j.jss.2022.111300