A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction

In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective...

Full description

Saved in:
Bibliographic Details
Published in2008 ACM/IEEE 30th International Conference on Software Engineering Vol. 2008; no. 24; pp. 181 - 190
Main Authors Moser, R., Pedrycz, W., Succi, G.
Format Conference Proceeding Journal Article
LanguageEnglish
Published IEEE 01.01.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, naive Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful: >75% percentage of correctly classified files, a recall of >80%, and a false positive rate <30%. Results indicate that for the Eclipse data, process metrics are more efficient defect predictors than code metrics.
Bibliography:SourceType-Scholarly Journals-2
ObjectType-Feature-2
ObjectType-Conference Paper-1
content type line 23
SourceType-Conference Papers & Proceedings-1
ObjectType-Article-3
ISBN:1424444861
9781424444861
1605580791
9781605580791
ISSN:0270-5257
1558-1225
DOI:10.1145/1368088.1368114