A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction
In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective...
Saved in:
Published in | 2008 ACM/IEEE 30th International Conference on Software Engineering Vol. 2008; no. 24; pp. 181 - 190 |
---|---|
Main Authors | , , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
IEEE
01.01.2008
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, naive Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful: >75% percentage of correctly classified files, a recall of >80%, and a false positive rate <30%. Results indicate that for the Eclipse data, process metrics are more efficient defect predictors than code metrics. |
---|---|
Bibliography: | SourceType-Scholarly Journals-2 ObjectType-Feature-2 ObjectType-Conference Paper-1 content type line 23 SourceType-Conference Papers & Proceedings-1 ObjectType-Article-3 |
ISBN: | 1424444861 9781424444861 1605580791 9781605580791 |
ISSN: | 0270-5257 1558-1225 |
DOI: | 10.1145/1368088.1368114 |